What would a data scientist be without data?
Do you allow me to collect anonymous data on your visit to jill.codes?
Ok, sure!
No, thanks.
More Info
Jill Augustine
Home
About
Projects
DS Toolbox
Useful Resources
Talks & Interviews
Data Science Toolbox
A list of my current go-to tools
Data Manipulation
pandas, numpy
Apache Spark
tidyverse
Microsoft Excel (for team members accustomed to pivot tables)
(Real-Time) Data Collection/Extraction
SQL (Hive)
Apache Kafka
Data Storage
parquet files
Hadoop Distributed File System (HDFS)
SQL databases
Data Visualisation
ggplot2
plotly (python API)
seaborn
Machine Learning
caret
scikit-learn
Documentation & Project Work
R Markdown
Jupyter Notebooks
Confluence
Jira
Git
*****
© 2021 Jillian Augustine