The Journey Begins

I have started on a journey to know more about data science. There is a lot of science and statistics behind using big data tools and this blog is about making a few notes along the way.

One of the things I learnt today was about T-test. There is the Student's t-test and the Welch t-test. They are hypothesis tests on two samples. The t-test can be used, for example, to determine if two sets of data are significantly different from each other.

Below is a piece of code using the scipy library of python on using the Welch t-test.


The baseball data is from the Lahman database.

Comments

Post a Comment

Popular posts from this blog

Performance improvement of MySQL inserts in Python using batching

Connect to MySQL 5.7 from Python using SSL

Connect to MySQL Server 5.7 from PHP 7.0 using SSL