Software Engineer’s Guide for ramping up on Deep Learning

ChatGPT, DALL-E and Stable Diffusion have been the buzz in 2022. As a software engineer, with no formal education in the field of machine learning or artificial intelligence, I have been watching the latest progress and buzz from the sidelines, and with awe. 

This holiday break I decided to develop a basic understanding of this field and demystify some of these concepts for myself, esp. around the sub-field of deep learning and neural nets. 

I didn’t find something readily available which fit my needs – i.e. a ‘classical’ software engineer with experience in building products and platforms, including data analytics platforms, looking to ramp up in the field of deep learning and software 2.0. Resources on the web were either too high level aimed at a cursory understanding of the field for non-technical folks, or were too low level meant for students or professionals who have had several years of ML and AI experience. 

So here it is, my set of resources assembled for the purpose. Hope you find them useful too!

University courses (Free!)

Important Papers post Transformer era: Here’s my attempt at ‘seminal papers in the field of machine learning and deep learning since 2017’

Podcasts with experts in the field: Won’t teach core concepts but helpful in gaining general understanding of the trends and concepts

Notable Blog Posts: I will just list a few which I think capture the key developments in the field

Please share this post if you found it useful. Or suggest corrections if I missed something. Please also suggest resources that I can add to make this more useful.

I will try to post a broader set of followup papers, data sources, and more hands-on resources in the following blog post. Stay tuned!

Happy Learning,

Abhi Khune

Abhiram Ganesh Khune

One thought on “Software Engineer’s Guide for ramping up on Deep Learning

Leave a reply to richardsprague.eth (@sprague) Cancel reply