All Collections
Atlas Frequently Asked Questions
How to Generate Synthetic Data with the Airbnb Dataset
How to Generate Synthetic Data with the Airbnb Dataset
Kabir Ghai avatar
Written by Kabir Ghai
Updated over a week ago
  1. Connect to the MongoDB Shell

    The MongoDB Shell is an interactive JavaScript interface to MongoDB. You can use the MongoDB Shell to quickly and easily create, query, and update data.

  2. Install a random data generator package

    To generate large volumes of synthetic data for development and testing, install a random data generator on the command line. In this example we use Falso.

    $ npm install @ngneat/falso

  3. Run this script in the shell to generate your data

    This script creates a dataset with 1,000 documents following the document pattern provided:

    const falso = require('@ngneat/falso');
    const data = [];
    for (let i = 0; i < 1000; i++) {
    data.push({
    listing_url: falso.randUrl(),
    summary: falso.randParagraph(),
    property_type: falso.randAbbreviation(),
    pricing: {
    weekly_price: falso.randNumber({min:100,max: 1000}),
    max_weeks: falso.randNumber({min:0,max: 52}),
    extra_people_fee: falso.randNumber({min:0,max: 500})
    }
    });
    }
    use('airbnb');
    db.listings.insertMany(data);

    Edit the JSON document to modify the data you generate.
    โ€‹
    Change airbnb and listings to customize the name of your dataset.

Did this answer your question?