Building with Patterns: The Attribute Pattern

Daniel Coupal and Ken W. Alger
January 24, 2019 | Updated: February 13, 2019
#University

Welcome back to the Building with Patterns series. Last time we looked at the Polymorphic Pattern which covers situations when all documents in a collection are of similar, but not identical, structure. In this post, we’ll take a look at the Attribute Pattern. The Attribute Pattern is particularly well suited when:

We have big documents with many similar fields but there is a subset of fields that share common characteristics and we want to sort or query on that subset of fields, or
The fields we need to sort on are only found in a small subset of documents, or
Both of the above conditions are met within the documents.

For performance reasons, to optimize our search we’d likely need many indexes to account for all of the subsets. Creating all of these indexes could reduce performance. The Attribute Pattern provides a good solution for these cases.

The Attribute Pattern

Let’s think about a collection of movies. The documents will likely have similar fields involved across all of the documents: title, director, producer, cast, etc. Let’s say we want to search on the release date. A challenge that we face when doing so, is which release date? Movies are often released on different dates in different countries.


{
    title: "Star Wars",
    director: "George Lucas",
    ...
    release_US: ISODate("1977-05-20T01:00:00+01:00"),
    release_France: ISODate("1977-10-19T01:00:00+01:00"),
    release_Italy: ISODate("1977-10-20T01:00:00+01:00"),
    release_UK: ISODate("1977-12-27T01:00:00+01:00"),
    ...
}

A search for a release date will require looking across many fields at once. In order to quickly do searches for release dates, we’d need several indexes on our movies collection:

{release_US: 1}
{release_France: 1}
{release_Italy: 1}
...

By using the Attribute Pattern, we can move this subset of information into an array and reduce the indexing needs. We turn this information into an array of key-value pairs:

{
    title: "Star Wars",
    director: "George Lucas",
    …
    releases: [
        {
        location: "USA",
        date: ISODate("1977-05-20T01:00:00+01:00")
        },
        {
        location: "France",
        date: ISODate("1977-10-19T01:00:00+01:00")
        },
        {
        location: "Italy",
        date: ISODate("1977-10-20T01:00:00+01:00")
        },
        {
        location: "UK",
        date: ISODate("1977-12-27T01:00:00+01:00")
        },
        … 
    ],
    … 
}

Indexing becomes much more manageable by creating one index on the elements in the array:

{ "releases.location": 1, "releases.date": 1}

By using the Attribute Pattern we can add organization to our documents for common characteristics and account for rare/unpredictable fields. For example, a movie released in a new or small festival. Further, moving to a key/value convention allows for the use of non-deterministic naming and the easy addition of qualifiers. For example, if our data collection was on bottles of water, our attributes might look something like:

"specs": [
    { k: "volume", v: "500", u: "ml" },
    { k: "volume", v: "12", u: "ounces" }
]

Here we break the information out into keys and values, “k” and “v”, and add in a third field, “u” which allows for the units of measure to be stored separately.

{"specks.k": 1, "specs.v": 1, "specs.u": 1}

Sample Use Case

The Attribute Pattern is well suited for schemas that have sets of fields that have the same value type, such as lists of dates. It also works well when working with the characteristics of products. Some products, such as clothing, may have sizes that are expressed in small, medium, or large. Other products in the same collection may be expressed in volume. Yet others may be expressed in physical dimensions or weight.

A customer in the domain of asset management recently deployed their solution using the Attribute Pattern. The customer uses the pattern to store all characteristics of a given asset. These characteristics are seldom common across the assets or are simply difficult to predict at design time. Relational models typically use a complicated design process to express the same idea in the form of user-defined fields.

While many of the fields in the product catalog are similar, such as name, vendor, manufacturer, country of origin, etc., the specifications, or attributes, of the item may differ. If your application and data access patterns rely on searching through many of these different fields at once, the Attribute Pattern provides a good structure for the data.

Conclusion

The Attribute Pattern provides for easier indexing the documents, targeting many similar fields per document. By moving this subset of data into a key-value sub-document, we can use non-deterministic field names, add additional qualifiers to the information, and more clearly state the relationship of the original field and value. When we use the Attribute Pattern, we need fewer indexes, our queries become simpler to write, and our queries become faster.

The next pattern we’ll discuss is the Bucket Design Pattern.

If you have questions, please leave comments below.

← Previous

The Future Will Be Documented

MongoDB was born out of frustration from using relational databases not designed for today's modern applications. After 40 years of no real alternatives, we pioneered a new way to work with data -- the MongoDB document model and the associated query language.

January 22, 2019

Next →

Empower Financial Services Developers with the Document Model

In financial services , having a proper data modeling strategy is crucial. The volume of data that banks collect and create is continuously growing, partly due to the expansion of digital banking and payments. Financial institutions rely heavily on data to power applications, analyze risks, and make decisions. The way data is modeled can significantly impact the performance, scalability, and accuracy of these systems, as well as the speed of development for new financial products or services. According to Celent, 62% of banks say the “competitive threat from fintechs and other challengers is increasing.” This is no surprise, as the convenience of managing all financial affairs in one centralized app or on a digital-first platform is pulling more and more customers away from traditional financial institutions. In this blog, we will explore why 24% of retail banks ranked “data platforms and management” as one of their top three IT spending priorities for 2024–2025 and how data modeling plays a critical role in this strategic focus. Data modeling and the document model In financial services, data modeling is a critical process that underpins effective data management, enabling institutions to harness the full potential of their data. This process involves identifying relevant financial data and determining how it should be visualized, including its structure, relationships, and management. A well-structured data model leads to better application performance, simpler implementations, and lower costs. It also enhances adaptability and maintainability over time. The principle that data accessed together should be stored together is fundamental for optimizing performance. MongoDB is a document-oriented NoSQL database that offers flexibility and scalability, making it an ideal choice for the industry. It runs with unparalleled reliability, security, and flexibility in a multi-cloud or on-premises environment. Unlike traditional relational databases, MongoDB allows for a more dynamic schema, enabling organizations to adapt to changing business needs without the constraints of rigid table structures. Document databases have the following key features: Document model: Data is stored in documents (unlike other databases that store data in structures like tables or graphs). The documents map to objects in the most popular programming languages, enabling developers to rapidly develop their applications. Flexible schema: MongoDB’s schema-less design enables developers to store data in a way that reflects the real-world relationships between entities, making it easier to manage complex data structures. Data distribution and resilience: Document databases are distributed, which allows for horizontal scaling (typically cheaper than vertical scaling) and data distribution. Queryability through an API or query language: Document databases have an API or query language that allows developers to execute the CRUD operations on the database. Developers have the ability to query for documents based on unique identifiers or field values. MongoDB’s document model enables an organization to design document structures that mirror its application’s access patterns. By embedding related data as subdocuments and arrays within a single document, it can make sure that data that is frequently accessed together is stored together. This alignment simplifies the mapping between the application and the database, enhancing both development efficiency and performance. In contrast to relational databases, document databases are better suited to the needs of modern applications because of their ability to store diverse data types (both structured and unstructured) in binary JSON (BSON) format. This flexibility essentially eliminates the middle layer necessary to convert to a SQL-like format, resulting in easier-to-maintain applications, lower development times, and faster responses to changes. As a document’s schema is dynamic and self-describing, developers don’t need to predefine it in the database. They can modify it at any time, avoiding disruptive schema migrations and improving their own productivity and experience. Watch now: Intro to Data Modeling for Financial Services and Insurance explains: The definition of data modeling Relational database management systems vs MongoDB Relationships: Linking vs Embedding Design Patterns MongoDB’s document model for financial services Leading financial institutions across the world are increasingly adopting MongoDB with seven of the top ten banks in the world (per the 2024 Forbes Global 2000) utilizing MongoDB in their application architectures. MongoDB Atlas is an integrated suite of data services centered on a document database designed to accelerate and simplify how users build with data. MongoDB enables banks to employ an iterative approach to banking modernization . This approach preserves legacy components for as long as they’re needed. By deploying MongoDB as an operational data layer (ODL) in a phased approach, banks can embark on their digital transformation journeys iteratively, without the risk of an all-or-nothing, rip-and-replace approach. Once the new architecture is in place, development teams can build new business functionality faster and scale new services to millions of users. Here are some of the ways that MongoDB helps financial institutions meet their industry data challenges: Open finance: MongoDB’s flexible schema, with the ability to handle various data types including structured and unstructured data, seamlessly integrates with modern technologies and frameworks, making it a great fit for orchestrating open API ecosystems. Personalized banking experiences: Retail banks aim to deliver hyper-personalized services, such as tailored financial advice or customized product recommendations. A flexible and accurate document model ensures platforms can integrate data from multiple channels (mobile apps, branches, and ATMs) for a seamless experience. Encryption and access control: MongoDB provides security capabilities like field-level encryption, role-based access control (RBAC), and auditing—key features to enhance data security and facilitate compliance with regulations like Financial Data Access (FiDA). By encrypting sensitive customer data both at rest and in transit, MongoDB helps keep data secure and tamper-proof throughout its lifecycle. Data sovereignty and global deployment: Financial data must adhere to strict geographic regulations, with certain jurisdictions enforcing the requirement that data remain within specific regions. MongoDB Atlas offers multi-cloud and multi-region deployments, enabling financial institutions to store data in specific regions while maintaining compliance with data sovereignty laws. Multidocument ACID transactions: Transactions in MongoDB feel just like transactions developers are familiar with in relational databases. With multi-document atomicity, consistency, isolation, and durability (ACID) transactions, developers can address transactional use cases across multiple documents within the same cluster. Payment system scalability and flexibility: MongoDB’s document database excels in unifying an organization’s data, from backend payment processing to customer interactions, surfacing insights to create a seamless, connected, and personalized customer journey. Payment systems must accommodate fluctuating transaction volumes and evolving business needs. MongoDB Atlas makes scaling as easy as setting the right configuration. It supports both horizontal and vertical scaling. Fraud detection: MongoDB’s document data model incorporates any kind of data—any structure, any format, any source—no matter how often it changes, enabling an organization to create a holistic picture of its customers to better predict transaction anomalies in real-time. MongoDB then enables it to process large amounts of data and analyze it in real-time to identify suspicious activity. Financial data management: MongoDB lets organizations capture and store financial and trade-related data together, respond to compliance and regulatory requests with confidence, and analyze pretrade communications to gain insights and detect errors. AI-driven interactive banking: MongoDB is designed to work seamlessly with leading AI frameworks, enabling banks to integrate and scale AI applications quickly and efficiently. MongoDB’s flexibility supports innovation by providing a scalable, developer-friendly environment that enables businesses to rapidly develop new financial services and products and scale to support millions of users. And, as a document-based database, MongoDB supports the flexible data modeling that is so crucial to the financial services industry. If you have any questions or would like to learn more about MongoDB and data modeling, feel free to check out the following resources: Intro to Data Modeling for Financial Services and Insurance Temenos Banking Cloud Scales to Record High Transactions with MongoDB Atlas and Microsoft Azure Our Solutions Library is where you can learn about different use cases for gen AI and other interesting topics that are applied to financial services and many other industries.

January 22, 2025