Project Overview
The PRS Calculator is an open-source tool designed to help individuals understand their genetic predisposition to various health conditions. By analyzing DNA data from consumer genetic testing services (such as 23andMe or AncestryDNA), we calculate polygenic risk scores for over 50 diseases and conditions.
Our mission is to bridge the gap between cutting-edge genetic research and individuals seeking to understand their health risks. We believe that access to genetic information should be democratized while maintaining the highest standards of scientific accuracy and responsible communication of results.
This project leverages validated polygenic scores from the PGS Catalog, ensuring that all calculations are based on peer-reviewed research and meet rigorous quality standards.
Key Features
50+ Disease Scores
Comprehensive coverage of cardiovascular, oncological, metabolic, neurological, psychiatric, and autoimmune conditions using validated PGS Catalog scores.
Privacy-First Design
Your genetic data never leaves your device. All calculations are performed locally in your browser, ensuring complete data privacy.
Multiple Ancestries
Ancestry-specific population normalization for European, African, East Asian, South Asian, and Latino/Admixed American populations.
Multiple Formats
Support for 23andMe, AncestryDNA, and VCF file formats. Simply upload your raw data file to get started.
Scientific Transparency
Full documentation of our methodology, data sources, and limitations. Every score links back to its original publication.
Educational Resources
Detailed information about each disease, including heritability estimates, key genes, risk factors, and prevention strategies.
Data Privacy Statement
Your Privacy is Our Priority
We take data privacy extremely seriously. Here is exactly what happens with your genetic data:
- No Data Storage: Your genetic data is processed entirely in memory and is never stored on our servers or any external database.
- Client-Side Processing: All PRS calculations are performed locally in your web browser. Your raw genetic data never leaves your device.
- No Tracking: We do not track individual users or their results. No personal information is collected.
- Open Source: Our code is open source and can be audited by anyone to verify our privacy claims.
- Temporary Files: Any temporary files created during processing are automatically deleted when you close the browser tab.
We believe that individuals should have control over their own genetic information. By processing all data locally, we eliminate the risks associated with centralized data storage while still providing valuable genetic insights.
Data Sources & Attribution
PGS Catalog
This tool uses polygenic scores from the PGS Catalog, an open database of published polygenic scores developed and maintained by the European Bioinformatics Institute (EMBL-EBI) and the University of Cambridge.
The PGS Catalog provides standardized, harmonized scoring files that meet rigorous quality standards. Each score is linked to its original publication, ensuring full scientific transparency.
Citation: Lambert SA, Gil L, Jupp S, et al. The Polygenic Score Catalog as an open database for reproducibility and systematic evaluation. Nature Genetics (2021). doi:10.1038/s41588-021-00783-5
Additional Data Sources
- UK Biobank: Population statistics and validation cohorts
- 1000 Genomes Project: Reference population allele frequencies
- dbSNP: Variant identifiers and reference information
- Published GWAS: Effect sizes and variant associations from peer-reviewed genome-wide association studies
Medical Disclaimer
Important: Read Before Use
This tool provides genetic risk estimates for educational and informational purposes only. It is NOT a medical device and should NOT be used for:
- Diagnosing any disease or medical condition
- Making medical treatment decisions
- Replacing professional medical advice
- Determining the need for medical testing
- Making decisions about medication or lifestyle changes without consulting a healthcare provider
Genetic risk is only one factor in disease development. Many conditions are heavily influenced by environmental factors, lifestyle choices, and other non-genetic factors that this tool does not assess.
Always consult with qualified healthcare professionals for personalized medical advice. If you have concerns about your health or genetic risk, please speak with a physician or certified genetic counselor.
Understanding Your Results
When interpreting your polygenic risk scores, please keep in mind:
- A high-risk score does not mean you will definitely develop a condition. It indicates you may have a higher probability compared to others.
- A low-risk score does not guarantee you will not develop a condition. Many people with low genetic risk still develop diseases.
- Risk scores are relative to a reference population and may be less accurate for individuals of non-European ancestry.
- Scores reflect lifetime risk and do not indicate when (or if) a condition might develop.
Researchers
Luis Sanchez
Computer Science @ UC Berkeley
Berkeley CS student (3.9 GPA) and Chancellor's Scholar. Software engineer at Robolabs building memory aid apps for dementia patients. Previously at Duolingo, Adobe, and P&G. Won YC's Dedalus Hackathon and 7+ hackathons total. Contributed to the Dolphin Emulator used by millions.
Shubhankar Tripathy
AI Researcher | Stanford, MIT, Berkeley
AI Researcher specializing in Intelligence, RL, and Agentic Systems. Triple major (CS, Math, Data Science) from UMass Amherst. Google CSRMP Fellow. Co-founder of CADSA (40+ university network). Won CalHacks 1st Prize. Ex-Data Science at Dell Technologies.