MongoDB Atlas、最も利用されているベクトルデータベースとして 2 年連続

Rachelle Palmer
June 21, 2024 | Updated: October 16, 2024
#Vector Search

MongoDB Atlas は、Retool 社の 2024 年版「The State of AI 」レポートにおいて、2 年連続で最も利用されているベクトルデータベースとなりました。また、Atlas Vector Search は、NPS 顧客満足度調査において最高の評価を受けています。NPS は、お客さまの「他の人に勧めたい」度合いを数値化したものです。

AI を活用して生産性を高め、開発を効率化し、実際のエンジニアリングの課題を解決しませんか？詳しくは、Retool 社とのオンデマンド Web セミナーをご覧ください。

Retool 社の 2024 年版「The State of AI 」レポートは、開発者、技術リーダー、IT意思決定者を対象としたグローバルな年次調査で、ベクトルデータベース、検索拡張生成（RAG）、AI の採用、イノベーションの課題など、AI の現状と将来について豊富なインサイトを提供しています。

MongoDB Atlas は、Retool の初回 2023 年レポートで最高の NPS を獲得し、リリースからわずか 5か月で 2 番目に広く利用されているベクトルデータベースになりました。今年、MongoDB は 21.1% の票を獲得し、21.3% を獲得した pgvector（PostgreSQL）には僅差で及ばなかったものの、事実上同率で最も支持されているベクトルデータベースとなりました。

この調査はまた、 LLM（大規模言語モデル）がトレーニングされていない最新かつ適切なコンテキストを持つ、より正確な回答を生成するための好ましいアプローチとして、RAG の採用が増加していることを指摘しています。LLM は膨大なコーパスのデータに基づいて学習されますが、そのデータの全てが最新であるとは限りませんし、独自のデータが反映されているわけでもありません。また、情報の盲点が存在する領域において、LLM は自信を持って不正確な「幻覚」を提供することで有名です。微調整は、LLM がトレーニングするデータをカスタマイズする 1 つの方法であり、Retool 社の調査回答者の 29.3% がこのアプローチを活用しています。しかし、従業員数 5,000 名以上の企業では、3 分の 1 が RAG を時間的制約のあるデータ（株式市場価格など）や、顧客履歴や取引履歴などの内部ビジネスインテリジェンスへのアクセスに活用しています。

MongoDB Atlas Vector Search が真に輝くのはこの点です。お客さまは、MongoDB に保存されたデータを容易に利用し、学習と評価の両方の段階で、生成 AI アプリケーションのパフォーマンスを強化し、劇的に向上させることができます。

1 年の間に、Retool 調査回答者のベクトルデータベース利用率は劇的に上昇し、2023 年の 20% から 2024 年には 63.6% という目を見張る数字になりました。回答者は、ベクトルデータベースを選択する際の主な評価基準として、パフォーマンスベンチマーク（40%）、コミュニティからのフィードバック（39.3%）、PoC 実験（38%）を挙げています。

このレポートが明確に強調しているペインポイントの 1 つは、AI 技術スタックの難しさです。50% 以上が、自社の AI スタックに「やや満足」「あまり満足していない」「まったく満足していない」のいずれかを回答しています。回答者はまた、社内の賛同を得るのが困難であることも報告しており、新しいソリューションの導入が必要な場合、調達活動がしばしば複雑になります。このような摩擦を減らす 1 つの方法は、技術スタックを効率化し、複数の未知のベンダーを採用する必要性を排除する統合ソリューションスイートです。ベクトル検索は MongoDB の開発者向けデータプラットフォームである Atlas のネイティブ機能なので、スタンドアローンのソリューションを追加する必要はありません。既に MongoDB Atlas を使用している場合、AI を活用したエクスペリエンスを作成するには、Atlas の既存のデータコレクションにベクトルデータを追加するだけです。

Atlas Vector Search で生成 AI アプリを開発するために役立つリソース：

MongoDB、Fireworks AI、LangChain を使用してメモリを持つ AI エージェントを構築：MongoDB をメモリプロバイダーとして使用し、Fireworks AI で機能呼び出しを行い、LangChain で会話コンポーネントの統合と管理を行う AI リサーチアシスタントエージェントを構築する手法を解説します。
LangChain と MongoDB Atlas Vector Searchの紹介：長文書を読み、複雑なクエリにインサイトに満ちた回答を提供できる独自のチャットボットを作成する手法を解説します。
動画「アイデアを生成 AI アプリに変える」：Dataworkz 社の Sachin Smotra 氏が、RAG（検索拡張生成）アプリケーションのスケーリングの複雑さについて動画で解説しています。
Mongo DB チュートリアル：Google Gemini の高度な自然言語処理と MongoDB を組み合わせ、Vertex AI Extensions を活用してデータベースのアクセス性と使いやすさを向上させる方法を紹介します。
リソースハブ：最新記事、アナリストレポート、導入事例、ホワイトペーパーなどにアクセスできます。

AI を活用して生産性を高め、開発を効率化し、エンジニアリングの課題を解決しませんか？Retool 社とのオンデマンド Web セミナーをご覧ください。

最新の AI の動向と採用について詳しくは、 2024 Retool State of AI レポートでお読みいただけます。

Atlas Vector Search を今すぐ開始するには、クイックスタートガイドをご覧ください。

← Previous

Atlas Vector Search 再次被评为最受欢迎的矢量数据库

Retool 的“2024 年 AI 现状”报告刚刚发布，MongoDB Atlas Vector Search 连续第二年被评为最受欢迎的矢量数据库。 Atlas Vector Search 获得了最高净推荐值 (NPS)，该值用于衡量用户向同伴推荐解决方案的可能性。 Retool 的“AI 现状”报告是对开发者、技术领导者和 IT 决策者进行的全球年度调查，提供了对 AI 的当前和未来状态的洞察，包括矢量数据库、检索增强生成 (RAG) 、AI 采用情况和使用 AI 创新的挑战。 MongoDB Atlas Vector Search 在 Retool 的 2023 年首份报告中获得了最高 NPS，并且在发布后仅五个月内就成为第二广泛使用的矢量数据库。今年，Atlas Vector Search 以 21.1% 的得票率并列成为最受欢迎的矢量数据库，仅次于获得 21.3% 投票率的 pgvector（PostgreSQL）。该调查还指出，人们越来越多地采用 RAG 作为在大型语言模型 ( LLM ) 未受过训练的最新相关背景下生成更准确回答的首选方法。虽然 LLM 是在庞大的数据语料库中训练出来的，但并非所有数据都是最新的，也不能反映专有数据。在那些存在盲点的领域，LLM 因自信地提供不准确的“幻觉”而臭名昭著。微调是自定义 LLM 训练数据的一种方式，29.3% 的 Retool 调查受访者利用这种方法。但是，在拥有超过 5,000 名员工的企业中，现在有三分之一的企业利用 RAG 来访问时间敏感的数据（例如股市价格）和内部商业情报，例如客户和事务历史记录。这是 MongoDB Atlas Vector Search 真正大放异彩的地方。在训练和评估阶段，客户可以轻松地利用他们在 MongoDB 中存储的数据来增强和显著改善其生成式 AI 应用程序的性能。在一年的时间里，Retool 调查受访者的矢量数据库利用率急剧上升，从 2023 年的 20% 上升到 2024 年的 63.6%，令人瞠目。受访者表示，他们选择矢量数据库的主要评估标准是性能基准 (40%)、社区反馈 (39.3%) 和概念验证实验 (38%)。该报告明确强调的痛点之一是 AI 技术堆栈的困难。超过 50% 的受访者表示，他们对自己的 AI 堆栈比较满意、不太满意或完全不满意。受访者还表示，在获得内部支持方面存在困难，而在需要采用新解决方案时，采购工作往往会使这一问题变得更加复杂。减少这种摩擦的一种方法是通过一套集成的解决方案，简化技术堆栈，并消除加入多个未知供应商的需要。矢量搜索是 MongoDB 的开发者数据平台 Atlas 的原生功能，因此无需依赖独立的解决方案。如果您已经在使用 MongoDB Atlas ，创建 AI 驱动的体验只需将矢量数据添加到 Atlas 现有的 collection 中即可。如果您是开发者，并想要开始使用 Atlas Vector Search 构建生成式人工智能应用程序，我们提供以下几个有用资源：了解如何构建一个 AI 研究助手代理，该代理使用 MongoDB 作为内存提供商、Fireworks AI 进行函数调用以及 LangChain 集成和管理会话组件。了解 LangChain 和 MongoDB Vector Search ，并学习创建自己的聊天机器人，该机器人可以阅读长篇文档并为复杂的查询提供深刻的回答。观看 Dataworkz 公司的 Sachin Smotra 深入探讨 RAG（检索增强生成）应用扩展的复杂性。阅读我们的教程，了解如何在 Vertex AI 扩展的支持下将 Google Gemini 的高级自然语言处理与 MongoDB 相结合，从而增强数据库的可访问性和可用性。浏览我们的资源中心，获取文章、分析报告、案例研究、白皮书等。想要进一步了解 AI 的最新趋势和采用情况？阅读 Retool 的“2024 年 AI 现状”完整报告。

June 21, 2024

Next →

ORiGAMi: A Machine Learning Architecture for the Document Model

The document model has proven to be the optimal paradigm for modern application schemas. At MongoDB, we've long understood that semi-structured data formats like JSON offer superior expressiveness compared to traditional tabular and relational representations. Their flexible schema accommodates dynamic and nested data structures, naturally representing complex relationships between data entities. However, the machine learning (ML) community has faced persistent challenges when working with semi-structured formats. Traditional ML algorithms, as implemented in popular libraries like scikit-learn and pandas , operate on the assumption of fixed-dimensional tabular data consisting of rows and columns. This fundamental mismatch forces data scientists to manually convert JSON documents into tabular form—a time-consuming process that requires significant domain expertise. Recent advances in natural language processing (NLP) demonstrate the power of Transformers in learning from unstructured data but their application to semi-structured data has been under-studied. To bridge this gap, MongoDB's ML research group has developed a novel Transformer-based architecture designed for supervised learning on semi-structured data (e.g., JSON data in a document model database). We call this new architecture ORiGAMi (Object Representation through Generative, Autoregressive Modelling), and we're excited to make it available to the community at github.com/mongodb-labs/origami . It includes components that make training a Transformer model feasible on datasets entailing as few as 200 labeled samples. By combining this data efficiency with the flexibility of Transformers, ORiGAMi enables prediction directly from semi-structured documents, without the cumbersome flattening and manual feature extraction required for tabular data representation. You can read more about our model on arXiv . Technical innovation The key insight behind ORiGAMi lies in its tokenization strategy: documents are transformed into sequences of key-value pairs and special structural tokens that encode nested types like arrays and subdocuments: These token sequences serve as input to the Transformer model trained to predict the next token given a portion of the document, similar to how large language models (LLMs) are trained on text tokens. What’s more, our modifications to the standard Transformer architecture include guardrails to ensure that the model only generates valid, well-formed documents, and a novel position encoding strategy that respects the order invariance of key/value pairs in JSON. These modifications also allow for much smaller models compared to LLMs, which can thus be trained on consumer hardware in minutes to hours depending on dataset size and complexity, versus days to weeks for LLMs. By reformulating classification as a next-token prediction task, ORiGAMi can predict any field within a document, including complex types like arrays and nested subdocuments. This unified approach eliminates the need for separate models or preprocessing pipelines for different prediction tasks. Example use case Our initial focus has been supervised learning: training models from labeled data to make predictions on unseen documents. Let's explore a practical example of user segmentation. Consider a collection where each document represents a user profile, containing both simple fields and complex nested structures: { "_id": "user_7842", "email": "sarah.chen@example.com", "signup_date": "2024-01-15", "device_history": [ { "device": "mobile_ios", "first_seen": "2024-01-15", "last_seen": "2024-02-11" }, { "device": "desktop_chrome", "first_seen": "2024-01-16", "last_seen": "2024-02-10" } ], "subscription": { "plan": "pro", "billing_cycle": "annual", "features_used": ["analytics", "api_access", "team_sharing"], "usage_metrics": { "storage_gb": 45.2, "api_calls_per_day": 1250, "active_projects": 8 } }, "user_segment": "enterprise_power_user" // <-- target field } Suppose you want to automatically classify users into segments like "enterprise_power_user", "smb_growth", or "early_stage_startup" based on their behavior and characteristics. Some documents in your collection already have correct labels, perhaps assigned through manual analysis or customer interviews. Traditional ML approaches would require flattening this rich document structure, leading to very sparse tables and potentially losing important hierarchical relationships. With ORiGAMi, you can: Train directly on the raw documents with existing labels Preserve the full context of nested structures and arrays Make predictions for the "user_segment" field on new users immediately after signup Update predictions as user behavior evolves without rebuilding feature pipelines Getting started with ORiGAMi We're excited to be open-sourcing ORiGAMi ( github.com/mongodb-labs/origami ) and you can read more about our model on arXiv . We've also included a command-line interface that lets users make predictions without writing any code. Training a model is as simple as pointing ORiGAMi to your MongoDB collection: origami train <mongo-uri> -d app -c users Once trained, you can generate predictions and seamlessly integrate them back into your MongoDB workflow. For example, to predict user segments for new signups (from the analytics.signups collection ) and write the resulting predictions back to MongoDB to an analytics.predicted collection: origami predict <mongo-uri> -d analytics -c signups --target user_segment --json | mongoimport -d analytics -c predicted For those looking to dive deeper, we've also included several Jupyter notebooks in the repository that demonstrate advanced features and customization options. Model performance can be improved by adjusting the hyperparameters. We're just scratching the surface of what's possible with document-native machine learning, and have many more use cases in mind. We invite you to explore the repository, contribute to the project, and share how you use ORiGAMi to solve real-world problems. Head over to the ORiGAMi github repo , play around with it, and tell us about new ways of applying it and problems it’s well-suited to solving.

March 11, 2025