Docs Menu
Docs Home
/
MongoDB Atlas
/ /

Sample Mflix Dataset

On this page

  • Collections
  • sample_mflix.comments
  • sample_mflix.embedded_movies
  • sample_mflix.movies
  • sample_mflix.sessions
  • sample_mflix.theaters
  • sample_mflix.users

The sample_mflix database contains data on movies and movie theaters. The database also contains collections for certain metadata, including users and comments on specific movies.

To learn how to load the sample data provided by Atlas into your cluster, see Load Sample Data.

The sample_mflix database contains the following collections:

Collection Name
Description

Contains comments associated with specific movies.

Contains details on movies in the Western, Action, and Fantasy genres from sample_mflix.movies collection with an added plot_embedding field that contains embeddings created using OpenAI's text-embedding-ada-002 embedding model that you can use with Atlas Vector Search.

Contains movie information, including release year, director, and reviews.

Metadata field. Contains users' JSON Web Tokens.

Contains locations of movie theaters.

Contains user information.

This collection contains comments associated with specific movies. Each document contains the comment text, the user who submitted it, and the movie the comment applies to.

This collection contains the following indexes:

Name
Index
Description

_id_

{ "_id": 1 }

Primary key index on the _id field.

{
"_id": {
"$oid": "5a9427648b0beebeb69579cc"
},
"name": "Andrea Le",
"email": "andrea_le@fakegmail.com",
"movie_id": {
"$oid": "573a1390f29313caabcd418c"
},
"text": "Rem officiis eaque repellendus amet eos doloribus. Porro
dolor voluptatum voluptates neque culpa molestias. Voluptate unde
nulla temporibus ullam.",
"date": {
"$date": {
"$numberLong": "1332804016000"
}
}
}

This collection contains details on movies with genres of Western, Action, or Fantasy. Each document contains a single movie, and information such as its title, release year, and cast.

In addition, documents in this collection include a plot_embedding field that contains embeddings created using OpenAI's text-embedding-ada-002 embedding model that you can use with Atlas Vector Search.

This collection contains the following indexes:

Name
Index
Description

_id_

{ "_id": 1 }

Primary key index on the _id field.

Note

The number of dimensions in the sample have been truncated for readability.

{
"_id": {
"$oid": "573a1396f29313caabce582d"
},
"plot": "A young swordsman comes to Paris and faces villains, romance, adventure and intrigue with three Musketeer friends.",
"genres": ["Action", "Adventure", "Comedy"],
"runtime": {
"$numberInt": "106"
},
"rated": "PG",
"cast": ["Oliver Reed", "Raquel Welch", "Richard Chamberlain", "Michael York"],
"num_mflix_comments": {
"$numberInt": "0"
},
"poster": "https://m.media-amazon.com/images/M/MV5BODQwNmI0MDctYzA5Yy00NmJkLWIxNGMtYzgyMDBjMTU0N2IyXkEyXkFqcGdeQXVyMjI4MjA5MzA@._V1_SY1000_SX677_AL_.jpg",
"title": "The Three Musketeers",
"lastupdated": "2015-09-16 06:21:07.210000000",
"languages": ["English"],
"released": {
"$date": {
"$numberLong": "133747200000"
}
},
"directors": ["Richard Lester"],
"writers": ["George MacDonald Fraser (screenplay)", "Alexandre Dumas père (novel)"],
"awards": {
"wins": {
"$numberInt": "4"
},
"nominations": {
"$numberInt": "7"
},
"text": "Won 1 Golden Globe. Another 3 wins & 7 nominations."
},
"year": {
"$numberInt": "1973"
},
"imdb": {
"rating": {
"$numberDouble": "7.3"
},
"votes": {
"$numberInt": "11502"
},
"id": {
"$numberInt": "72281"
}
},
"countries": ["Spain", "USA", "Panama", "UK"],
"type": "movie",
"tomatoes": {
"viewer": {
"rating": {
"$numberDouble": "3.5"
},
"numReviews": {
"$numberInt": "9600"
},
"meter": {
"$numberInt": "78"
}
},
"dvd": {
"$date": {
"$numberLong": "982022400000"
}
},
"critic": {
"rating": {
"$numberDouble": "7.1"
},
"numReviews": {
"$numberInt": "11"
},
"meter": {
"$numberInt": "82"
}
},
"lastUpdated": {
"$date": {
"$numberLong": "1441307415000"
}
},
"rotten": {
"$numberInt": "2"
},
"production": "Live Home Video",
"fresh": {
"$numberInt": "9"
}
},
"plot_embedding": [
-0.004237316,
-0.022958077,
-0.005921211,
-0.020323543,
0.010051459
]
}

This collection contains details on movies. Each document contains a single movie, and information such as its title, release year, and cast.

This collection contains the following indexes:

Name
Index
Description
Properties

_id_

{ "_id": 1 }

Primary key index on the _id field.

cast_text_fullplot_text_genres_text_title_text

{ "_fts": "text", "_ftsx": 1 }

Text index on the cast, fullplot, genres, and title fields.

{
"_id": {
"$oid": "573a1390f29313caabcd413b"
},
"title": "The Arrival of a Train",
"year": {
"$numberInt": "1896"
},
"runtime": {
"$numberInt": "1"
},
"released": {
"$date": {
"$numberLong": "-2335219200000"
}
},
"poster": "http://ia.media-imdb.com/images/M/MV5BMjEyNDk5MDYzOV5BMl5BanBnXkFtZTgwNjIxMTEwMzE@._V1_SX300.jpg",
"plot": "A group of people are standing in a straight line along the
platform of a railway station, waiting for a train, which is seen
coming at some distance. When the train stops at the platform, ...",
"fullplot": "A group of people are standing in a straight line along
the platform of a railway station, waiting for a train, which is
seen coming at some distance. When the train stops at the platform,
the line dissolves. The doors of the railway-cars open, and people
on the platform help passengers to get off.",
"lastupdated": "2015-08-15 00:02:53.443000000",
"type": "movie",
"directors": [
"Auguste Lumière",
"Louis Lumière"
],
"imdb": {
"rating": {
"$numberDouble": "7.3"
},
"votes": {
"$numberInt": "5043"
},
"id": {
"$numberInt": "12"
}
},
"cast": [
"Madeleine Koehler"
],
"countries": [
"France"
],
"genres": [
"Documentary",
"Short"
],
"tomatoes": {
"viewer": {
"rating": {
"$numberDouble": "3.7"
},
"numReviews": {
"$numberInt": "59"
}
},
"lastUpdated": {
"$date": {
"$numberLong": "1441993589000"
}
}
},
"num_mflix_comments": {
"$numberInt": "1"
}
}

This collection contains metadata about users. Each document contains a user and their corresponding JSON Web Token

This collection contains the following indexes:

Name
Index
Description
Properties

_id_

{ "_id": 1 }

Primary key index on the _id field.

user_id_1

{ "user_id" : 1}

Ascending index on the user_id field.

{
"_id": {
"$oid": "5a98348755593fdf68350932"
},
"user_id": "bfb9vc1zz@xhasq.5h9",
"jwt": "eyJ0eXAiOiJKV1QiLCJhbGciOiJIUzI1NiJ9..."
}

This collection contains movie theater locations. Each document contains a single movie theater and its location in both string and GeoJSON forms.

Name
Index
Description
Properties

_id_

{ "_id": 1 }

Primary key index on the _id field.

geo index

{ "location.geo": "2dsphere" }

Geospatial index on the location.geo field.

{
"_id": {
"$oid": "59a47286cfa9a3a73e51e72c"
},
"theaterId": {
"$numberInt": "1000"
},
"location": {
"address": {
"street1": "340 W Market",
"city": "Bloomington",
"state": "MN",
"zipcode": "55425"
},
"geo": {
"type": "Point",
"coordinates": [
{
"$numberDouble": "-93.24565"
},
{
"$numberDouble": "44.85466"
}
]
}
}
}

This collection contains information on mflix users. Each document contains a single user, and their name, email, and password.

Name
Index
Description
Properties

_id_

{ "_id": 1 }

Primary key index on the _id field.

email_1

{ "email: 1 }

Unique, ascending index on the email field.

{
"_id": {
"$oid": "59b99db4cfa9a34dcd7885b6"
},
"name": "Ned Stark",
"email": "sean_bean@gameofthron.es",
"password": "$2b$12$UREFwsRUoyF0CRqGNK0LzO0HM/jLhgUCNNIJ9RJAqMUQ74crlJ1Vu"
}

Back

Guides