Skip to content

Instantly share code, notes, and snippets.

@brookskindle
Created May 7, 2018 00:29
Show Gist options
  • Save brookskindle/b232089a6a0c57fd6e9bbed004a44945 to your computer and use it in GitHub Desktop.
Save brookskindle/b232089a6a0c57fd6e9bbed004a44945 to your computer and use it in GitHub Desktop.
Generate fake census data with the `faker` module
from faker import Faker
import pandas as pd
user_factory = Faker()
user_factory.seed(1000)
def get_fake_user():
global user_factory
is_male = user_factory.boolean()
if is_male:
name = user_factory.first_name_male()
else:
name = user_factory.first_name_female()
user = {
"name": name,
"state": user_factory.state(),
"age": user_factory.random_int(min=25, max=75),
}
return user
users = pd.DataFrame([get_fake_user() for _ in range(200)])
users.to_csv("census.csv", index=False)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment