gravatar

faiazrmn

Faiaz Khan

Recently Published

Routes in Austin Bike Share Data
Plot and find the most common routed in Austin bike sharing data from kaggle !
Practice SQL With R Dataframes
purrr, map, map2
Explore different ways to write same map and map2 functions
Simulate with purrr package
Avoid for loop by using purrr package and map2 function to simulate normally distributed random variable for hypothesis testing
100s of Regression Models in R
10 lines of R code to fit a regression model to each country of the world to estimate yearly population growth rate for each country.
Predict Penguin Gender With Repeated CV Logistic Regression
We fit a Logistic Regression model with repeated CV and achieve 95% accuracy in predicting the gender of the penguins ! Also tune the model further for more accuracy and precision !
Austin Bike Share Data EDA
Visualise Consistency of Sample Estimate with Animated Chart
In this example, we take sample from a binomial distribution with P = 0.50. We take 100 samples continuously, and at each sample we estimate P from the sample using sample observations. We see as the sample size reaches 100, the estimate converges very close to 0.50.
Visualise Multicollinearity
Multicollinearity occurs when there is significant correlation in predictor (x variable) variable in regression model. This causes serious problems, such as high variance and non-significance in coefficients (beta). We can visualize this using a simple 3d chart as given below. Press Right Arrow (->) on your keyboard to change slides. Use your mouse to move the 3D charts.
EDA of Global Warming
Team Performance Dashboard in Rmarkdown
This is a Team Performance Tracking Dashboard. This is a big team. The team is called Engineering and Analytics team at a company. Engineering team has two subteams > IT and Infrastructure. Analytics Team has three subteams > Data Analyst, Data Scientist, Senior Data Scientist. This dashboard shows Team, Subteam and individual member's performance. The performance is evaluated by respective managers of the subteams. And all the subteams are ranked by HR manager (Theodor). The beauty of this dashboard is that every month once the data is ready, the dashboard takes just a click to make for the new data.