For this project, I will be working to understand the results of an A/B test run by an e-commerce website if they should implement the new page, keep the old page, or perhaps run the experiment longer to make their decision.
This project includes the following contents:
- Introduction
- Part I: Probability
- Part II: A/B test
- Part III: Regression
- Conclusion
Before Part I: Probability, I will perform data cleaning such as checking missing data, discrepancies between the columns etc.
This project is using Python3. The packages are used in this project including Numpy, Panda, random, matplotlib.pyplot, scipy.stats, and statsmodels.api.
Variable Name | Metadata |
---|---|
user_id | 6-digit numbers |
timestamp | string |
group | string: control, treatment |
landing_page | string: old_page, new_page |
converted | numeric: 0:No, 1:Yes |
Variable Name | Metadata |
---|---|
user_id | 6-digit numbers |
country | string: US, CA, UK |
- A/B Test
- Two-Proportion z Test
- Logistic Regression
The reports are generated to three formats, .ipynb, .pdf and .html: