source venv/bin/activate # Windows: venv\Scripts\activatepython run_pipeline.pyOutput: data/processed/master_panel_final.csv ⭐
python scripts/perform_eda.pyFinal Dataset: data/processed/final/master_panel_final.csv
- 171 countries, 2010-2022
- Variables: nutrients, food groups, obesity, population
- Ready for analysis!
- README.md - Full project overview
- PROJECT_GUIDE.md - How the pipeline works
- CLEANUP_SUMMARY.md - What was cleaned up
# Run complete pipeline
python run_pipeline.py
# Run individual step
python scripts/preprocessing/preprocess_fao_data.py
# Run EDA
python scripts/analysis/perform_eda.py
# Create visualizations
python scripts/analysis/interactive_plot.py
# Clean up intermediate files (optional)
python cleanup_intermediate_files.py
# Open notebook
jupyter notebook notebooks/02_eda_visualization.ipynbrun_pipeline.py- Main pipeline scriptmaster_panel_final.csv- Final dataset for analysisscripts/perform_eda.py- Analysis scriptnotebooks/00_raw_data_exploration.ipynb- Start here! Explore raw datanotebooks/01_eda_visualization.ipynb- Main analysis notebooknotebooks/README.md- Notebook guide
That's it! You're ready to go. 🎉