In this post, I’ll discuss the subject areas and common questions types that you should know for AWS Certified Solutions-Associate exam. I’ve tried to curate a list of the most useful resources from my preparation of the test. I’ll also try to provide key points to remember for some of...
[Read More]
Uploading Kaggle data to AWS S3 bucket
Machine learning in Production: Anti-patterns
Machine learning is quite a popular choice to build complex systems and is often marketed as a quick win solution. Unfortunately, building production grade systems with integration of Machine learning is quite complicated. What makes deployment of an ML system can be broken down due to the following reasons:
[Read More]
Monte Calro simulation using low level TensorFlow (Part I)
Monte Carlo methods are computational algorithms relying on repeated random sampling to solve a variety of optimization, integration and sampling problems. More often than not, one stumbles across an intractable integral only solvable through numerical integration. The field of Physics and Mathematics also rely heavily on Monte Carlo Simulations. Moreover,...
[Read More]
Updates on the blog
My job switched gears a year back as I joined ADDO AI as Chief Data Scientist and Senior Partner. In addition to my responsibilities as a Data Scientist, I get to write responses to RFPs, train data science and data architecture teams, manage team and external partnerships. Unfortunately, I’ve been...
[Read More]
Data Science, the hard way
Internet is replete with posts on ‘how to become a data scientist’. Unfortunately, I’m also adding a post on this topic. The breadth of knowledge expected of a practicing data scientist can be quite daunting. However, one can limit the depth in some of these skills. Here, I’d put together...
[Read More]
Clearing the GCP Professional Data Engineer
This might seem to you as a digression from pure data science. But one of the essential requirements for real world data science and end to end machine learning is setting up of data engineering pipelines. At time, you might just need to pull data in from a database, split...
[Read More]
Air pollution in Peshawar
A friend of mine (Max) was discussing about his presentation on Smart Cities. While going through it, he mentioned a twitter account that posts about air pollution data about Peshawar: PeshawarAir. The tweets are machine generated and are posted every hour. It’s a great step towards open data community and...
[Read More]
Air pollution in Karachi
A friend of mine (Max) was discussing about his presentation on Smart Cities. While going through it, he mentioned a twitter account that posts about air pollution data about Karachi: KarachiAir. The tweets are machine generated and are posted every hour. It’s a great step towards open data community and...
[Read More]
Workshops on Data Science
Lately, it seems that every new project requires some aspects of Data Science. Whether it’s an extract, transform layer (ETL); a machine learning module; work on streams of data or a big data project. Finding trained engineers to meet the market demand has been challenging. Moreover, many of the engineering...
[Read More]