GridFS
On this page
Overview
In this guide, you can learn how to store and retrieve large files in MongoDB using the GridFS specification. GridFS splits large files into chunks and stores each chunk as a separate document. When you query GridFS for a file, the driver assembles the chunks as necessary. The driver implementation of GridFS is an abstraction that manages the operations and organization of the file storage.
Use GridFS if the size of your files exceeds the BSON document size limit of 16 MB. GridFS also helps you access files without loading the entire file into memory. For more detailed information on whether GridFS is suitable for your use case, see the GridFS server manual page.
How GridFS Works
GridFS organizes files in a bucket, a group of MongoDB collections that contain the chunks of files and information describing them. The bucket contains the following collections:
The
chunks
collection, which stores the binary file chunks.The
files
collection, which stores the file metadata.
When you create a new GridFS bucket, the driver creates the preceding
collections. The default bucket name fs
prefixes the collection names,
unless you specify a different bucket name. The driver creates the new GridFS
bucket during the first write operation.
The driver also creates an index on each collection to ensure efficient retrieval of the files and related metadata. The driver creates indexes if they do not already exist and when the bucket is empty. For more information on GridFS indexes, see the server manual page on GridFS Indexes.
When storing files with GridFS, the driver splits the files into smaller
chunks, each represented by a separate document in the chunks
collection.
It also creates a document in the files
collection that contains
a file ID, file name, and other file metadata. The following diagram shows
how GridFS splits the uploaded files:
When retrieving files, GridFS fetches the metadata from the files
collection in the specified bucket, then uses that information to reconstruct
the file from documents in the chunks
collection. You can read the file
into memory or output it to a stream.
Use GridFS
To learn about GridFS operations and how to perform them, navigate to the following sections:
Create a GridFS Bucket
To store or retrieve files from GridFS, create a bucket or get a reference to
an existing bucket on a MongoDB database. To create a GridFSBucket
instance,
call the NewBucket()
method with a database parameter:
db := client.Database("db") bucket, err := gridfs.NewBucket(db) if err != nil { panic(err) }
Note
If a GridFS bucket already exists, the NewBucket()
method returns a
reference to the bucket rather than instantiating a new one.
By default, the new bucket is named fs
. To instantiate a bucket with a
custom name, call the SetName()
method on a BucketOptions
instance as
follows:
db := client.Database("db") opts := options.GridFSBucket().SetName("custom name") bucket, err := gridfs.NewBucket(db, opts) if err != nil { panic(err) }
Upload Files
You can upload a file into a GridFS bucket in one of the following ways:
Use the
UploadFromStream()
method, which reads from an input stream.Use the
OpenUploadStream()
method, which writes to an output stream.
For either upload process, you can specify configuration information on an instance
of UploadOptions
. For a full list of UploadOptions
fields, visit the
API documentation.
Upload with an Input Stream
To upload a file with an input stream, use the UploadFromStream()
method
with the following parameters:
Your file name
An
io.Reader
, with your opened file as a parameterAn optional
opts
parameter to modify the behavior ofUploadFromStream()
The following code example reads from a file called file.txt
and uploads the
content to a GridFS bucket. It uses an opts
parameter to set file metadata:
file, err := os.Open("path/to/file.txt") uploadOpts := options.GridFSUpload().SetMetadata(bson.D{{"metadata tag", "first"}}) objectID, err := bucket.UploadFromStream("file.txt", io.Reader(file), uploadOpts) if err != nil { panic(err) } fmt.Printf("New file uploaded with ID %s", objectID)
New file uploaded with ID 62e00...
Upload with an Output Stream
To upload a file with an output stream, use the OpenUploadStream()
method
with the following parameters:
Your file name
An optional
opts
parameter to modify the behavior ofOpenUploadStream()
The following code example opens an upload stream on a GridFS bucket and sets
the number of bytes in each chunk with an opts
parameter. Then, it calls
the Write()
method on the content of file.txt
to write its content to
the stream:
file, err := os.Open("path/to/file.txt") if err != nil { panic(err) } // Defines options that specify configuration information for files // uploaded to the bucket uploadOpts := options.GridFSUpload().SetChunkSizeBytes(200000) // Writes a file to an output stream uploadStream, err := bucket.OpenUploadStream("file.txt", uploadOpts) if err != nil { panic(err) } fileContent, err := io.ReadAll(file) if err != nil { panic(err) } var bytes int if bytes, err = uploadStream.Write(fileContent); err != nil { panic(err) } fmt.Printf("New file uploaded with %d bytes written", bytes) // Calls the Close() method to write file metadata if err := uploadStream.Close(); err != nil { panic(err) }
Retrieve File Information
You can retrieve file metadata stored in the files
collection of the GridFS
bucket. Each document in the files
collection contains the following
information:
The file ID
The file length
The maximum chunk size
The upload date and time
The file name
A
metadata
document in which you can store any other information
To retrieve file data, call the Find()
method on a GridFSBucket
instance. You can pass a query filter as an argument to Find()
to match
only certain file documents.
Note
The Find()
method requires a query filter as a parameter. To match all
documents in the files
collection, pass an empty query filter to Find()
.
The following example retrieves the file name and length of documents in the
files
collection with length
values greater than 1500
:
filter := bson.D{{"length", bson.D{{"$gt", 1500}}}} cursor, err := bucket.Find(filter) if err != nil { panic(err) } type gridfsFile struct { Name string `bson:"filename"` Length int64 `bson:"length"` } var foundFiles []gridfsFile if err = cursor.All(context.TODO(), &foundFiles); err != nil { panic(err) } for _, file := range foundFiles { fmt.Printf("filename: %s, length: %d\n", file.Name, file.Length) }
Download Files
You can download a GridFS file in one of the following ways:
Use the
DownloadToStream()
method to download a file to an output stream.Use the
OpenDownloadStream()
method to open an input stream.
Download a File to an Output Stream
You can download a file in a GridFS bucket directly to an output stream using the
DownloadToStream()
method. DownloadToStream()
takes a file ID and an
io.Writer
as parameters. The method downloads the file with the specified
file ID and writes to the io.Writer
.
The following example downloads a file and writes to a file buffer:
id, err := primitive.ObjectIDFromHex("62f7bd54a6e4452da13b3e88") fileBuffer := bytes.NewBuffer(nil) if _, err := bucket.DownloadToStream(id, fileBuffer); err != nil { panic(err) }
Download a File to an Input Stream
You can download a file in a GridFS bucket to memory with an input stream using
the OpenDownloadStream()
method. OpenDownloadStream()
takes a file ID as
a parameter and returns an input stream from which you can read the file.
The following example downloads a file into memory and reads its contents:
id, err := primitive.ObjectIDFromHex("62f7bd54a6e4452da13b3e88") downloadStream, err := bucket.OpenDownloadStream(id) if err != nil { panic(err) } fileBytes := make([]byte, 1024) if _, err := downloadStream.Read(fileBytes); err != nil { panic(err) }
Rename Files
You can update the name of a GridFS file in your bucket by using the Rename()
method. Pass a file ID value and a new filename
value as arguments to
Rename()
.
The following example renames a file to "mongodbTutorial.zip"
:
id, err := primitive.ObjectIDFromHex("62f7bd54a6e4452da13b3e88") if err := bucket.Rename(id, "mongodbTutorial.zip"); err != nil { panic(err) }
Delete Files
You can remove a file from your GridFS bucket by using the Delete()
method.
Pass a file ID value as an argument to Delete()
.
The following example deletes a file:
id, err := primitive.ObjectIDFromHex("62f7bd54a6e4452da13b3e88") if err := bucket.Delete(id); err != nil { panic(err) }
Delete a GridFS Bucket
You can delete a GridFS bucket by using the Drop()
method.
The following code example deletes a GridFS bucket:
if err := bucket.Drop(); err != nil { panic(err) }
Additional Resources
To learn more about GridFS and its operations, visit the GridFS manual page.
API Documentation
To learn more about the methods or types discussed in this guide, see the following API Documentation: