Data science toolbox
A list of tools used in previous projects. Of course, every project is unique with its specific needs, and new innovative tools appear rapidly. Please feel free to suggest your own tools.
Activity |
Tool |
Data & image processing |
Python Pandas, Numpy, Scipy, PIL, OpenCV, Albumentations, R, Matlab |
Machine learning |
Scikit-learn logistic regression, random forest, gradient boosting; time-series forecasting using ARIMA, LSTM, Prophet |
Computer vision |
Keras with backend TensorFlow, PyTorch, Image classification and regression: CNN -- basic architecture to more complex such as EfficientNet, SwinTransformer. Object detection: Yolo, R-CNN, |
Deployment and dashboards |
Flask, Curl, FastAPI, Gradio, Streamlit, Dash |
Visualization |
Plotly, Matplotlib, Bokeh, GGplot, Seaborn, Grafana |
Big data, databases and ETL |
Spark, Dask, H2O, PostgreSQL |
Cloud |
Amazon Webservices, Sagemaker (AWS), Google Cloud Platform, Hidora |
OS & development platforms |
Linux, Docker, GitHub |
Communication |
Slack, Google Meet, Jupyter notebook, Spyder, R Studio |