A binomial distribution is a probability of SUCCESS or FAILURE in an experiment that is repeated many times. It is call Binominal because we will have only two outcomes. Binomial is a discrete because the values are fixed For Eg: I want to roll a dice and expecting six, I will either get six (SUCCESS)Continue reading “Binomial distribution”
Category Archives: Machine Learning
How to calculate Gain Ratio
As we discussed in one of our article about How and when does the Decision tree stop splitting? Gain Ratio is modification of information gain that reduces its bias. Gain ratio overcomes the problem with information gain by taking into account the number of branches that would result before making the split.It corrects information gainContinue reading “How to calculate Gain Ratio”
How to calculate area using left and right Z table
We need to find the left area of the Z score 0.70 ,see the below.screenshot for more idea First we will find area by using below left Z table. Step 1: If we have any second level decimal values like 0.71 then we need to split in to 0.7 and 0.01, see the above tableContinue reading “How to calculate area using left and right Z table”
Z table transformation
We will use below table to calculate our probability. Fig 1: Lets solve some of the below problems to get more idea on transformation. What is the probability of P(z<=1.5) We need to find out the below highlighted area. To find the answer using the Z-table(see the Fig 1:), find whereContinue reading “Z table transformation”
Types of regression
When I ask what are the types of regression for the beginners, the expected answer will be linear regression and Logistic regression, because these are the two algorithms that all beginners will start with. Now coming to the point we have five types of regression. Linear regression Logistic regression Polynomial regression Ridge Regression Lasso RegressionContinue reading “Types of regression”
Classification Accuracy,Confusion matrix, AUC and ROC
we already discussed how to calculate accuracy for linear regression with the help of R-Square, Adjusted R-Square, MSE etc..Can we use the same mechanism to calculate the accuracy for classification problem? Simple answer is NO, we have different mechanism to calculate accuracy for classification problems. Classification Accuracy: It is very simple to calculate, number ofContinue reading “Classification Accuracy,Confusion matrix, AUC and ROC”
Range, Interquartile Range and Percentiles
Range(measures of variability): The range of a set of data is the difference between the largest and smallest values. Range might affect with Outliers like Mean. Interquartile Range(measures of variability) Interquartile range gives another measure of variability like Variance, Standard deviation. It is a better measure of dispersion than range because it leaves out the extremeContinue reading “Range, Interquartile Range and Percentiles”
How to calculate Simple Liner Regression
Lets calculate linear regression for the below dataset. We have age of the person which we will denote as X and sugar level of the person which we will denote as Y Step 1) To know how to calculate mean refer link. Step 2) To know how to calculate standard deviation refer link Step 3)Continue reading “How to calculate Simple Liner Regression”
Cost/Loss function
Let’s say we started a mobile manufacturing company last month and I want a new cost study to improve my next month’s budget forecasts.I pay 500$ for rent and 100$ for electricity bill and for manufacturing each mobile we need 50$ and my budget is 4500$ The equation of cost function is C(x)= FC +Continue reading “Cost/Loss function”
How and when does the Decision tree stop splitting?
By default Splitting will stop when the tree reaches 100% purity, means when the child/subset node has homogeneous/single class or we can also say when child/subset node is pure(means all classes will be either Yes or No), this will lead to overfitting problem. In simple when my algorithm learned everything from my training data, It willContinue reading “How and when does the Decision tree stop splitting?”