Section 27
New Project Template
Use this structure to start any new data system.
Folder Structure
bash
/project
/data
/input
/output
/sql
bronze.sql
silver.sql
gold.sql
main.py
requirements.txtrequirements.txt
txt
duckdb
pandas
openpyxlmain.py (Base Template)
python
import duckdb
DB_PATH = "project.duckdb"
def run_sql_file(con, path):
with open(path, "r") as file:
con.execute(file.read())
def main():
con = duckdb.connect(DB_PATH)
run_sql_file(con, "sql/bronze.sql")
run_sql_file(con, "sql/silver.sql")
run_sql_file(con, "sql/gold.sql")
print("Pipeline completed")
if __name__ == "__main__":
main()How to Use
- Copy this template.
- Add your input data to
/data/input. - Update SQL files for your dataset.
- Run:
bash
pip install -r requirements.txt
python main.py