Atlas Stream Processing 现已推出公共预览版

Clark Gates-George and Joe Niemiec
February 13, 2024 | Updated: February 15, 2024

今天，我们很高兴地宣布，Atlas Stream Processing现已推出公共预览版。在Atlas平台上有兴趣尝试这项功能的开发者都享有完全的访问权限。参阅相关文档了解更多详细信息，或立即开始使用。

欢迎收听MongoDB播客，聆听流媒体产品负责人Kenny Gorman对于Atlas Stream Processing公共预览版的详细介绍。

开发者青睐文档模型的灵活性、易用性以及Query API查询方式，这使得他们能够在MongoDB Atlas中以代码方式处理数据。借助Atlas Stream Processing，我们将这些相同的基本原则应用于流处理中。Atlas Stream Processing在2023年纽约MongoDB用户大会上首次推出，它正在重塑旨在聚合和丰富快速变化的事件数据流的体验，并统一了处理流动数据和静态数据的方法。

到目前为止，开发者使用该产品的情况如何？我们从中学到了什么？

在内测阶段，我们收到了数千个开发团队关于希望获取访问权限的请求，并且从数百个参与团队中收集了有价值的反馈意见。其中一些用例包括：

某全球领先的航空公司利用复杂的聚合技术，快速处理维护和运营数据，以确保航班能够准时起飞和到达，满足成千上万名乘客的需求；
某大型能源设备制造商使用Atlas Stream Processing来连续监控关于泵设备的海量数据，以避免意外停机并提升运行效果；以及
某创新型企业“软件即服务”（SaaS）提供商充分利用Atlas Stream Processing中丰富的处理功能来及时提供包含背景信息的产品内警报，从而提升产品参与度。

这些用例仅仅是我们在各行各业中观察到的Atlas Stream Processing众多应用实例中的一小部分。除了我们观察到的众多用例外，开发者也向我们提供了丰富的见解，使我们了解到他们希望未来Atlas Stream Processing应添加哪些功能。

除了支持通过 change stream 对 Atlas 数据库中的数据进行持续处理外，开发者可使用 Atlas Stream Processing 处理由 Confluent、Amazon MSK、Azure Event Hubs 和 Redpanda 等重要合作伙伴托管的 Kafka 数据，此功能也非常强大。我们开发者数据平台功能的目标始终是为开发者所依赖的关键技术提供更好的体验。

公共预览版中有哪些新功能？

基于上述情况，我们增加了新功能。随着使用团队数量的增加，我们正在扩展功能，以便将在内测阶段收集到的呼声最高的反馈意见纳入其中。通过梳理大量的反馈意见，我们从中总结出了三个共同的主题：

提升开发者体验
扩展高级特性和功能
改善运行和增强安全性

提升开发者体验

在内测阶段，我们将开发者体验置于核心位置，这对于促使Atlas Stream Processing成为开发团队的首选解决方案至关重要。在公共预览版中，我们更注重提升开发者的体验，为此我们增加了两项增强型功能。

VS Code 集成
MongoDB VS Code插件增加了对连接流处理实例的支持。对于那些已经使用了该插件的开发者而言，随着这项新功能的引入，团队能够在熟悉的开发环境中轻松地创建和管理处理器。这意味开发者不再需要频繁切换工具，而可以将更多时间用来构建应用程序！
改进了死信队列 (DLQ) 功能
DLQ支持是实现强大流处理功能的关键要素，在公开预览版中，我们进一步扩展了DLQ功能。现在，当使用sp.process()来执行管道操作以及在运行中的处理器上运行.sample()时，DLQ消息将自动显示，这样可以简化开发工作，而无需设置目标集合来充当DLQ。

扩展高级特性和功能

Atlas Stream Processing原本就已经支持很多常用的聚合操作符，这些操作符在静态数据Query API查询中经常被开发者所使用。而且我们还增加了强大的窗口功能，以及可轻松合并数据并将其发送到Atlas数据库或Kafka主题的功能。公开预览版将提供更多功能，以满足那些依靠流处理来提供卓越客户体验的先进团队的需求，包括：

$lookup操作符
现在，开发者可以通过使用远程Atlas集群的数据，对流处理器中正在处理的文档以及目标集合中的字段进行连接，从而丰富流处理器中正在处理的文档。
变更流变更前后文档信息的存储
许多开发者正在使用Atlas Stream Processing通过变更流持续处理Atlas数据库（作为源）中的数据。我们在公共预览中增强了变更流$source变量，以支持变更前和变更后文档信息的存储。这一功能使得开发者能够处理一些常见的用例，包括计算文档中字段之间的增量或差异，以及访问已删除文档的完整内容等。
在合并和发出阶段使用动态表达式进行条件路由
通过条件路由，开发者可以使用Atlas Stream Processing正在处理的文档中的字段值，动态地将特定的消息发送到不同的Atlas集合或Kafka主题。现在$merge和$emit阶段也支持使用动态表达式。基于此，用户可以根据需求，在需要将信息分发到不同集合或主题的用例中使用Query API查询。
空闲流超时
现在，用户可以对那些因缺乏传入数据导致水印无法更新的流进行配置，在特定时间段后关闭这些数据流，并输出窗口结果。这对于处理数据流不一致的流媒体源来说至关重要。

改善运行和增强安全性

最后，在最近几个月，我们加大投入力度，改善Atlas Stream Processing的运行和安全。其中一些亮点包括：

检查点
目前，Atlas Stream Processing通过执行检查点的方式，在处理数据流过程中保存状态信息。由于流处理器处于持续运行状态，无论出现数据问题还是基础设施故障，都需要一种智能的恢复机制来确保它能够持续可靠地运行。通过采用检查点机制，用户可以轻松地从停止收集和处理数据的位置恢复流处理器的正常运行。
Terraform提供商支持
Terraform目前支持创建连接和流处理实例（SPI）。这样可以将基础架构编写为可重复部署的代码。
安全角色
Atlas Stream Processing引入了项目级角色，为用户提供了执行流处理任务所需的完整权限。流处理器能够在特定角色的环境下运行，支持最低权限的配置选项。
Kafka消费者群组支持
Atlas Stream Processing中的流处理器现在采用Kafka消费者群组来进行偏移跟踪。得益于此，用户可以轻松地调整处理器在流操作中的位置，并实时监控潜在的处理器延迟情况。

关于新功能的最后一点说明是，我们计划在Atlas Stream Processing的公开预览版中，开始按照促销价格收取费用，直至可广泛使用的版本正式发布。更多关于Atlas Stream Processing价格的详细信息，请参阅我们的相关文档。

即刻构建您的第一个流处理器

对我们而言，公共预览版的推出是重要的一步，它标志着我们的开发者数据平台得到了扩展，为更多团队提供了流处理解决方案。该解决方案旨在简化构建反应式、响应式和事件驱动型应用程序的操作复杂性，同时提升开发者的体验。

我们非常期待能够看到您构建的内容！

立即登录以开始使用，或在我们的文档、资源、教程或 MongoDB University 上的 Learning Byte 中了解有关 Atlas Stream Processing 的更多信息。

← Previous

最大化增长：在支付领域释放 AI 的力量

人工智能 ( AI ) 技术是银行业不可或缺的一部分。例如，在风险、欺诈和合规等领域，AI 的使用已普及多年，并且还在不断深化。这些举措（以及其他举措）的成功以及释放更多效益的潜力，推动了 2024 年在这一领域的进一步投资，其中生成式人工智能尤其引人关注。金融技术分析机构 Celent 受 MongoDB 和 Icon Solutions 的委托编写了一份报告，该报告深入探讨了当前 AI 在银行业的应用情况，以及在支付领域采用 AI 提高运营敏捷性、实现自动化工作流程以及提高开发人员工作效率的一些关键应用场景。下载 Celent 报告：《利用 AI 在支付领域的优势》，了解如何充分利用 AI 投资，并解锁 AI 为未来支付带来的无限可能。解锁一系列工作流程和产品增强功能如今，AI 技术用于解决各种不同的工作流程和面向客户的服务，从中后台的流程自动化和优化，到实时风险和流动性管理、现金流预测和前台服务个性化等领域。虚拟助手和机器人也已成为客户支持流程的重要组成部分。在本博客中，我们将介绍 Celent 《利用 AI 在支付领域的优势》报告中的一些重要发现，以及这些发现对银行和支付行业意味着什么。高级分析、智能自动化和 AI 技术引领 2024 年投资议程随着时间的推移，银行稳步增加了对项目的投资，以更好、更高效地使用数据。这在一定程度上是由于需要满足客户对数字服务的速度和质量不断提高的期望，同时也反映出人们对账户和交易数据真正价值的理解在不断加深。然而，最重要的是，实现了交付由 AI 和高级分析支持的用例所需的技术。数据分析和人工智能技术支持的项目在全球议程中占据重要地位，这一点不足为奇。对于 33% 的企业银行来说，高级分析和机器学习投资是技术方面的首要任务，高于机器人技术和自动化相关项目（31% 的市场重点）。人工智能和自然语言处理 (NLP) 也不甘落后，28% 的银行将其列为优先事项。许多人也在探索生成式人工智能鉴于生成式 AI 的巨大潜力，人们对生成式 AI 感到兴奋合乎情理，但在 2023 年下半年，对话变得更加微妙。考虑到将大型语言模型 ( LLM ) 应用于潜在敏感客户数据的复杂性，以及对 LLM 输出可解释性（和潜在的可审计性）更广泛的监管担忧，这些都合情合理。也就是说，生成式 AI 已经在许多领域用于支持顾问和关系经理的工作，预计此类领域将进一步创新。根据该报告，58% 的银行正在以某种容量评估或测试生成式 AI，另有 23% 的银行在其路线图中拥有使用该技术的项目。 AI 在支付领域的新兴使用案例和潜在的收入增长在支付产品创新方面，开发者能力不足是银行面临的最大挑战之一。银行认为，在过去两年中，由于资源限制而无法提供的产品增强本来可以促进支付收入增长 5.3%。考虑到这一点以及与 AI 集成带来的革命性变革，金融机构必须考虑如何释放开发者资源，以充分利用这些机会。随着支付行业的不断发展，AI 集成将重塑行业格局，提供创新解决方案，优先考虑安全性、效率和个性化用户体验。AI 在支付领域的新兴使用案例证明了其在塑造未来金融交易方面的变革潜力。利用现代技术，最大限度地采用 AI 在快速发展的 AI 领域，技术不断进步，客户需求日益多元，战略性投资势在必行。为了保持竞争力，银行和支付提供商不仅应关注当前的产品增强，还应推进支付基础设施现代化，满足未来的发展需求。在采用 AI 和 ML 等需要数据作为基础的先进技术时，组织经常会遇到将这些创新技术集成到传统系统中的难题，因为传统系统缺乏灵活性，而且难以修改。例如，增加新的支付渠道和新的客户接入点可能非常困难。利用现代数据平台建立强大的数据架构，使银行能够实时整合和分析任何格式的数据，提供更丰富的支付体验，从而为消费者提供增值服务和功能。以下建议将有助于确保金融服务组织能够大规模释放生成式 AI 的变革潜力，同时确保隐私和安全问题得到充分解决：用最准确和最新的数据训练 AI/ML 模型，从而在面对不断发展的技术时，可以满足适应性和敏捷性的迫切需求。通过统一从后端支付处理到客户交互的数据，银行可以实时洞察客户需求，构建无缝、互联和个性化的客户之旅。数据模式面向未来、灵活多变，能够适应任何数据结构、格式或来源。这种灵活性有助于金融机构实现与各种 AI/ML 平台的无缝集成，使其能够灵活应对 AI 环境的变化，而无需大规模修改基础架构。通过对所有数据进行内置安全控制，解决安全问题。 MongoDB 提供了身份验证（单点登录和多因素身份验证）、基于角色的访问控制以及全面的数据加密等功能，无论是在客户环境中管理还是通过完全托管的云服务 MongoDB Atlas 进行管理，这些功能都能确保强大的安全性。这些安全措施不仅可以有效保护敏感的财务数据，降低外部各方未经授权访问的风险，而且还能使组织放心地采用 AI 和机器学习技术。通过将第三方服务与 API 集成，启动和扩展始终在线和安全的应用程序。 MongoDB 拥有灵活的数据模型，能够处理包括结构化和非结构化数据在内的各类数据，非常适合协调开放式 API 生态系统，使银行、第三方和消费者之间的数据流成为可能。 MongoDB Atlas 开发者数据平台直接向开发者提供强大的 AI 和分析功能，并通过即时整合、接收和处理任何支付数据类型，提供丰富的支付体验。MongoDB Atlas 旨在帮助金融服务机构解决数据挑战。它具有灵活的文档数据模型和无缝的第三方集成功能，这些功能是创建组合支付系统的必要功能，可轻松扩展、始终在线、安全且符合 ACID 标准。保持领先地位 — 立即下载 Celent 报告，解锁 AI 为未来支付带来的无限可能。如果您更喜欢涵盖 Celent、Icon Solutions 和 MongoDB 三方讨论的直观探索，请注册参加我们即将举行的题为“借助 Celent、 Icon Solutions 和 MongoDB，使用 AI 解锁支付领域的新机会”的网络研讨会。如果您想了解有关使用 MongoDB 构建 AI 密集型支付应用程序的更多信息，请查看以下资源：了解金融行业如何利用生成式 AI 在适当的安全控制下打造 AI 密集型支付应用，同时达到用户期望的规模和性能水平注册加入我们的“Atlas for Industries”计划，获取我们的解决方案加速工具以推动创新

February 12, 2024

Next →

MongoDB Database Observability: Integrating with Monitoring Tools

This post is the final in a three-part series on leveraging database observability. Welcome back to our series on Leveraging Database Observability! Our previous post showcased a real-world use case highlighting how MongoDB Atlas’s observability tools effectively tackle database performance challenges. Whether you’re a developer, DBA, or DevOps engineer, our mission is to empower you to harness the full potential of your data through our observability suite . Integrating Atlas metrics with your central enterprise observability tools can simplify your operations. By seamlessly working with popular observability tools, our approach helps teams streamline workflows and enhance visibility across systems. Integrating MongoDB Atlas with third-party monitoring tools MongoDB’s developer data platform combines all essential data services for building modern applications within a unified experience. Our purpose-built observability tools for Atlas environments offer automatic monitoring and optimization, guiding diagnostics tailored specifically for MongoDB. Additionally, we extend Atlas metrics into your existing enterprise observability stack, enabling seamless integration without replacing your current tools. This creates a consolidated, single-pane view that unifies Atlas telemetry with other tech and application metrics, ensuring comprehensive visibility into both database and full-stack performance. This integration empowers you to monitor, receive alerts, and make data-driven decisions within your existing workflows, driving greater efficiency. Below is a quick guide to modifying integration settings through the Atlas UI and the popular integrations we support: Navigate to the Project Integrations page in Atlas. Choose the organization and project you want to configure from the navigation bar. On the Project Integrations page, select the third-party services you’d like to integrate. Configure the chosen services with the required API keys and regions. Critical integrations for your observability platform With Atlas’s Datadog and Prometheus integrations, you can send critical MongoDB metrics to these platforms, empowering detailed, real-time monitoring. Through Datadog , you can track database operation counts, query efficiency, and resource usage, ideal for pinpointing bottlenecks and managing resources. Similarly, Prometheus enables you to monitor essential metrics like query times, connection rates, and memory usage, supporting flexible tracking of database health and performance. Both integrations facilitate proactive detection of issues, alert configuration for resource thresholds, and a cohesive view of Atlas data when visualized in Grafana. Atlas’s integration with PagerDuty streamlines incident management by sending metrics like performance alerts, billing anomalies, and security events directly to PagerDuty. This integration records incidents automatically, notifies teams upon alerts, and supports two-way syncing, ensuring resolved alerts in Atlas are reflected in PagerDuty. It enables efficient incident response and resource allocation to maintain system stability. With Atlas integrations for Microsoft Teams and Slack, you can route key metrics—such as query latency, disk usage, and throughput—to these channels for timely updates. Teams can use these insights for real-time performance monitoring, incident response, and collaboration. Notifications through these platforms ensure your team stays informed on database performance, storage health, and user activity changes as they occur. Use case: Centralized observability with MongoDB Atlas, Datadog, and Slack Let’s walk through a hypothetical scenario for ShopSmart, an e-commerce company that leverages MongoDB Atlas to manage its product catalog and customer data. As traffic surges, the DevOps team faces challenges in monitoring application performance and database health effectively. To tackle these challenges, the team leverages MongoDB Atlas’ integration with Datadog and Slack, creating a powerful observability ecosystem. Integrating MongoDB Atlas with Datadog: The team pushes key MongoDB Atlas metrics into Datadog, such as query performance, connection counts, and Atlas Vector Search metrics. With Datadog, they can visualize these metrics and correlate overall MongoDB performance with their other applications. Out-of-the-box monitors and dedicated dashboards allow the team to track metrics like throughput, average read/write latency, and current connections. This visibility helps pinpoint bottlenecks in real time, ensuring optimal database performance and improving overall application responsiveness. Setting up alerts in Datadog: The team configures alerts for critical metrics like high query latency and increased error rates. When thresholds are breached, Datadog instantly notifies the team. This proactive approach allows the team to address potential performance issues before they impact customers. Integrating Datadog with Slack: To ensure fast communication, alerts are sent directly to the dedicated Slack channel, “ShopSmart-Alerts.” This integration fosters seamless collaboration, enabling the team to discuss and resolve issues in real-time. With these integrations, ShopSmart’s engineering team can monitor performance quickly and address issues efficiently. The unified observability approach enhances operational efficiency, improves the customer experience, and supports ShopSmart’s competitive edge in the e-commerce industry. By leveraging MongoDB Atlas, Datadog, and Slack, the team ensures scalable performance and drives continuous innovation. Conclusion MongoDB Atlas empowers developers and organizations to achieve unparalleled observability and control over their database environments. By seamlessly integrating with central enterprise observability tools, Atlas enhances your ability to monitor performance metrics and ensures you can do so within your existing workflows. This means you can focus on building modern applications confidently, knowing you have the insights and alerts necessary to maintain optimal performance. Embrace the power of MongoDB Atlas and transform your approach to database management—because your applications can thrive when your data is observable. And that wraps up our Leveraging Database Observability series! We hope you learned something new and found value in these discussions. Sign up for MongoDB Atlas , our cloud database service, to see database observability in action. To dive deeper and expand your knowledge, check out this learning byte for more insights on the MongoDB observability suite and how it can enhance your database performance.

November 14, 2024