Technics Publications

Julia for Data Science


Julia for Data Science, by Dr. Zacharias Voulgaris

Master how to use the Julia language to solve business critical data science challenges. 

After covering the importance of Julia to the data science community and several essential data science principles, we start with the basics including how to install Julia and its powerful libraries. Many examples are provided as we illustrate how to leverage each Julia command, dataset, and function.

Specialized script packages are introduced and described. Hands-on problems representative of those commonly encountered throughout the data science pipeline are provided, and we guide you in the use of Julia in solving them using published datasets. Many of these scenarios make use of existing packages and built-in functions, as we cover:

  1. An overview of the data science pipeline along with an example illustrating the key points, implemented in Julia
  2. Options for Julia IDEs
  3. Programming structures and functions
  4. Engineering tasks, such as importing, cleaning, formatting and storing data, as well as performing data preprocessing
  5. Data visualization and some simple yet powerful statistics for data exploration purposes
  6. Dimensionality reduction and feature evaluation
  7. Machine learning methods, ranging from unsupervised (different types of clustering) to supervised ones (decision trees, random forests, basic neural networks, regression trees, and Extreme Learning Machines)
  8. Graph analysis including pinpointing the connections among the various entities and how they can be mined for useful insights.


Each chapter concludes with a series of questions and exercises to reinforce what you learned. The last chapter of the book will guide you in creating a data science application from scratch using Julia.


About Zack

Dr. Zacharias Voulgaris was born in Athens, Greece. He studied Production Engineering and Management at the Technical University of Crete, shifted to Computer Science through a Masters in Information Systems & Technology, and then to Data Science through a PhD in Machine Learning. He has worked at Georgia Tech as a Research Fellow, at an e-marketing startup in Cyprus as an SEO manager, and as a Data Scientist in both Elavon (GA) and G2 Web Services (WA). He also was a Program Manager at Microsoft on a data analytics pipeline for Bing. Zacharias has authored several books on Data Science, mentors aspiring data scientists, and maintains a Data Science and AI blog. Currently, he works as a consultant at GLG.


Faculty may request complimentary digital desk copies

Please complete all fields.