“Statistical Notes § Power Analysis for Racial Admixture Studies of Continuous Variables”, 2014-07-17 (; similar):
Miscellaneous statistical stuff
I consider power analysis of a genomic racial admixture study for detecting genetic group differences affecting a continuous trait such as IQ in US African-Americans, where ancestry is directly measured by genome sequencing and the comparisons are all within-family to eliminate confounding by population structure or racism/colorism/discrimination. The necessary sample size for well-powered studies is closely related to the average size of differences in ancestry percentage between siblings, as the upper bound on IQ effect per percentage is small, requiring large differences in ancestry to detect easily. A within-family comparison of siblings, due to the relatively small differences in ancestry between siblings estimated from IBD measurements of siblings, might require n > 50,000 pairs of siblings to detect possible effects on IQ, an infeasible sample size. An alternative design focuses on increasing the available ancestry differences within a family unit by comparing adoptees with siblings; the larger within-population standard deviation of ancestry creates larger & more easily-detected IQ differences. A random-effects meta-analysis of past admixture & ancestry studies suggests the SD in heterogeneous samples may range from 2% to 20% with a mean of 11% (95% predictive interval), yielding sample sizes of n > 20,000, n = 1100924ya, and n = 500. Hence, an adoption study is probably in the feasible range, with required sample sizes comparable to annual adoption rates among US African-Americans.
- Critiques
- “Someone Should Do Something”: Wishlist of Miscellaneous Project Ideas
- Estimating Censored Test Scores
- The Traveling Gerontologist Problem
- Bayes Nets
- Genome Sequencing Costs
- Proposal: Hand-Counting Mobile App for More Fluid Group Discussions
- Air Conditioner Replacement
- Some Ways of Dealing With Measurement Error
- Value of Information: Clinical Prediction Instruments for Suicide
- Bayesian Model Averaging
- Dealing With All-Or-Nothing Unreliability of Data
- Dysgenics Power Analysis
- Power Analysis for Racial Admixture Studies of Continuous Variables
- Operating on an Aneurysm
- The Power of Twins: Revisiting Student’s Scottish Milk Experiment Example
- RNN Metadata For Mimicking Individual Author Style
- MCTS
- Candy Japan A/B Test
- DeFries-Fulker Power Analysis
- Inferring Mean IQs From SMPY/TIP Elite Samples
- Genius Revisited: On the Value of High IQ Elementary Schools
- Great Scott! Personal Name Collisions and the Birthday Paradox
- Detecting Fake (human) Markov Chain Bots
- Optimal Existential Risk Reduction Investment
- Model Criticism via Machine Learning
- Proportion of Important Thinkers by Global Region Over Time in Charles Murray’s Human Accomplishment
- Program for Non-Spaced-Repetition Review of past Written Materials for Serendipity & Rediscovery: Archive Revisiter
- On the Value of New Statistical Methods
- Bayesian Power Analysis: Probability of Exact Replication
- Expectations Are Not Expected Deviations and Large Number of Variables Are Not Large Samples
- Oh Deer: Could Deer Evolve to Avoid Car Accidents?
- Evolution As Backstop for Reinforcement Learning
- Acne: a Good Quantified Self Topic
- Fermi Calculations
- Selective Emigration and Personality Trait Change
- The Most Abandoned Books on GoodReads
- Best Student Ever!
- Little’s Law in the Wild
- Tail Collapse