Skip to content

Conduct an inference statistical analysis that cover hypothesis testing, correlation, regression and chi-square test based on dataset

Notifications You must be signed in to change notification settings

Nurunnajwa12/Korean-Income-and-Welfare

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 

Repository files navigation

Korean Income and Welfare

seoul-aesthetic-sky-oirqhpbhjvu3vvim

This project are require to conduct an inference statistical analysis that cover the following analysis based on dataset https://www.kaggle.com/datasets/hongsean/korea-income-and-welfare.

Data

Original data : https://www.kaggle.com/datasets/hongsean/korea-income-and-welfare.

Clean data : Download through repository file (Korea-Income-Clean.xlsx)

Hypothesis Testing (Two Sample)

gend

Variables: Income and Gender.

We will test whether the mean of income of male is different than the mean of income of female at 95% confidence level, assuming unequal variances.

Analysis : Since t0= 0.347 is less than 1.96, we fail to reject the H0 at significant level, 0.05. Conclusion : There is no sufficient evidence that mean income of male is different from mean income of female.

Correlation Test

baru

Variables: Income and Family Members

We observe the strength of relationships between the two variables above by using Pearson product moment correlation.

Conclusion: r=0.267 ->relatively weak positive linear relationship between family members and income.

Regression Test

incomey

Independent value= Year born Dependent value = income

We want to predict the value of income based on the year born.

Conclusion:

  • No year born has 0 year, b0 = -88674.14 is the portion of income not explained by year born.
  • b1 = 45.73 tell us that average income increases by 45.73 won, for each additional one year born.
  • R squared = 0.078 (between 0 and 1)
  • Therefore, weaker linear relationship between income and year born.
  • Some but not all of the value of income explained by yea born.
  • 7.8% of the variation in income explained by variation in year born.

Chi Square Independent Test

edumar

Variables: Marriage and Educational Level

We want to test whether the educational level and marriage is independent or not.

Analysis: Since p-value <0.05, we reject H0 at significant level of 0.05. Conclusion: There is sufficient evidence that educational level not independent to marriage

Language

R Programming

Lessons Learned

For two-sample hypothesis tests, we fail to reject the null hypothesis. Thus, the mean of male income is equal to the standard of female income. Even though South Korea has faced gender inequality since a long time ago, the government is trying to close the gender gap, from government support for paternity leave to the role of the private sector in boosting women's careers.

For correlation, the number of family members has a weaker positive relationship with income. In addition, we found that as each year is born, the income also increases by regression. This especially happens to people in their 30's to 50's. While, for the goodness of fit tests, we know that the proportion of doctoral degrees is the lowest. Besides, the Chi-Square Independent Test depicts that education and marriage are not independent.

The most exciting finding from our results is the result of two sample hypothesis testing indicate that 6 basic pillars of South Korea economy in term on gender equality where male and femake can hold same position without discrimination. In addition, income is not dependent on family members. Besides, the person that was born between 1960 until 1990 has made a strong base towards a well developed South Korea as they have become one of Four Asian Dragons with rapid industrialization and maintained exceptionally high growth rates.

Contributing

Contributions are always welcome!

See contributing.md for ways to get started.

Please adhere to this project's code of conduct.

Feedback

If you have any feedback, please reach out to us at [email protected]

About

Conduct an inference statistical analysis that cover hypothesis testing, correlation, regression and chi-square test based on dataset

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages