EventJoin us at AWS re:Invent 2024! Learn how to use MongoDB for AI use cases. Learn more >>

White Paper

Apache Spark and MongoDB – Turning Analytics into Real-Time Action

Apache Spark is one of the fastest growing big data projects in the history of the Apache Software Foundation. With its memory-oriented architecture, flexible processing libraries and ease-of-use, Spark has emerged as a leading distributed computing framework for real-time analytics.

Combining the leading analytics processing engine with the fastest-growing database enables organizations to operationalize sophisticated, real-time analytics. Spark jobs can be executed directly against operational data managed by MongoDB without the time and expense of ETL processes. MongoDB can then efficiently index and serve analytics results back into live, operational processes.

This white paper discusses the analytics capabilities offered by MongoDB and Apache Spark, and provides an overview of when and how to combine them into a real-time analytics engine. The paper concludes with example use cases.


Read it later?

More like this

View all resources
general_content_white_paper

MongoDB Architecture Guide

MongoDB enables you to meet the demands of modern apps with an application data platform built on several core architectural foundations

Read White Paper
general_content_white_paper

Who Owns Security in the Cloud?

At MongoDB, our overriding mission is to make data easier to work with. This can’t happen if data becomes compromised for any reason

Read White Paper
general_content_white_paper

Application-Driven Intelligence: Defining the Next Wave of Modern Apps

The digital economy demands smarter applications and faster predictive insights

Read White Paper