this is intended to provide you with orientation and assistance during the project.
- Define cleare goals and responsibilities for individual milestones and subtasks.
- Define a fixed meeting at least every two weeks to discuss progress together
- if you have any questions, contact your mentor
The following tools should make it easier for you to work together
- Trello: Use it to define goals and tasks and responsibilities (https://trello.com/home)
- Git resp. Github: in consultation with your mentor and the Techlabs team, an official Techlabs project repo where you can work together on your code and version it with git - just like real software development. (https://github.com/)
We recommend to use the following
- R Studio for R: https://www.rstudio.com/
- Pycharm (Community) for Python: https://www.jetbrains.com/pycharm/
- Or more general Visual Studio Code: https://code.visualstudio.com/
- Python Basics: Pandas, SciPy, Numpy, Matplotlib
- Python Machine Learning: scikit-learn, Keras, Pytorch, Tensorflow
- Python: Flask, Django
- JavaScript: React, Vue, Angular
βββ LICENSE
βββ Makefile <- Makefile with commands like `make data` or `make train`
βββ README.md <- The top-level README for developers using this project
βββ data
βΒ Β βββ external <- Data from third party sources.
βΒ Β βββ interim <- Intermediate data that has been transformed
βΒ Β βββ processed <- The final, canonical data sets for modeling
βΒ Β βββ raw <- The original, immutable data dump
β
βββ models <- Trained and serialized models, model predictions, or model summaries
β
βββ notebooks <- Jupyter notebooks
β
βββ references <- Data dictionaries, manuals, and all other explanatory materials
β
βββ reports <- Generated analysis as HTML, PDF, LaTeX, etc
βΒ Β βββ figures <- Generated graphics and figures to be used in reporting
β
βββ requirements.txt <- The requirements file for reproducing the analysis environment,
β e.g. generated with `pip freeze > requirements.txt`
β
βββ setup.py <- makes project pip installable (pip install -e .) so src can be imported
βββ src <- Source code for use in this project.
Β Β βββ __init__.py <- Makes src a Python module
β
Β Β βββ data <- Scripts to download or generate data
Β Β βΒ Β βββ make_dataset.py
β
Β Β βββ features <- Scripts to turn raw data into features for modeling
Β Β βΒ Β βββ build_features.py
β
Β Β βββ models <- Scripts to train models and then use trained models to make
β β predictions
Β Β βΒ Β βββ predict_model.py
Β Β βΒ Β βββ train_model.py
β
Β Β βββ visualization <- Scripts to create exploratory and results oriented visualizations
Β Β βββ visualize.py
Take some time to search for for smaller examples that are already publicly available on Github, so you can get a quick first overview of how different problems could be solved.
- DigitalOcean: https://www.digitalocean.com/
- Heroku: https://www.heroku.com/
Happy Coding :-)