Skip to content

Commit

Permalink
Update breadth-or-depth.md
Browse files Browse the repository at this point in the history
  • Loading branch information
lawrennd authored Nov 3, 2023
1 parent 9a16ac3 commit dcf271c
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion _data-science/includes/breadth-or-depth.md
Original file line number Diff line number Diff line change
Expand Up @@ -49,7 +49,7 @@
* *\dataDim* roughly equal to *\numData*?
* Stratification of populations: batch effects etc.
}
\notes{Classical approaches to data analysis made use of many subjects to achieve statistical power. Traditionally, we measure a few things about many people. For example cardiac disease risks can be based on a limited number of factors inmany patients (such as whether the patient smokes, blood pressure, cholesterol levels etc). Because, traditionally, data matrices are stored with individuals in rows and features in columns[^depth-measurement], we refer to this as *depth* of measurement. In statistics this is sometimes known as the *large $p$, small $n$* domain because traditionally $p$ is used to denote the number of features we know about an individual and $n$ is used to denote the number of individuals.
\notes{Classical approaches to data analysis made use of many subjects to achieve statistical power. Traditionally, we measure a few things about many people. For example cardiac disease risks can be based on a limited number of factors in many patients (such as whether the patient smokes, blood pressure, cholesterol levels etc). Because, traditionally, data matrices are stored with individuals in rows and features in columns[^depth-measurement], we refer to this as *depth* of measurement. In statistics this is sometimes known as the *large $p$, small $n$* domain because traditionally $p$ is used to denote the number of features we know about an individual and $n$ is used to denote the number of individuals.

[^depth-measurement]: In statistics this is known as a *design matrix*, representing the design of a study. But in databases, one might think of each patient being in a row, or record of the database.}

Expand Down

0 comments on commit dcf271c

Please sign in to comment.