Predicted Trait | |
Reported Trait | Breast cancer |
Mapped Trait(s) | breast carcinoma (EFO_0000305) |
Additional Trait Information | UK Biobank codes: cancer codes: 1002; ICD9: 174, 1749; ICD10: C50, C500-C506, C508, C509; field ID 40001: C50, C500-C506, C508, C509; field ID40002: C50, C500-C506, C508, C509; field ID 40006: C50, C500-C506, C508, C509; field ID 40013: 174, 1749 |
Score Construction | |
PGS Name | bc_2 |
Development Method | |
Name | LASSO |
Parameters | To reduce the number of candidate SNPs to a computationally feasible set, a gwas was performed on the raw phenotype and the 50,000 snps with smallest p-value were retained. The raw phenotype was regressed on age and the top 20 UK Biobank PCs and a residual phenotype was then built. The LASSO model was trained using these 50k SNPs and the adjusted phenotype. |
Variants | |
Original Genome Build | GRCh37 |
Number of Variants | 717 |
Effect Weight Type | beta |
PGS Source | |
PGS Catalog Publication (PGP) ID | PGP000520 |
Citation (link to publication) | Raben TG et al. Sci Rep (2023) |
Ancestry Distribution | |
Score Development/Training | European: 100% 200,000 individuals (100%) |
PGS Evaluation | European: 100% 1 Sample Sets |
Study Identifiers | Sample Numbers | Sample Ancestry | Cohort(s) | Phenotype Definitions & Methods | Age of Study Participants | Participant Follow-up Time | Additional Ancestry Description | Additional Sample/Cohort Information |
---|---|---|---|---|---|---|---|---|
— | [ ,
0.0 % Male samples |
European | NR | UK Biobank codes: cancer codes: 1002; ICD9: 174, 1749; ICD10: C50, C500-C506, C508, C509; field ID 40001: C50, C500-C506, C508, C509; field ID40002: C50, C500-C506, C508, C509; field ID 40006: C50, C500-C506, C508, C509; field ID 40013: 174, 1749 | — | used UK self report 'white' category and then used an adjusted phenotype that includes a regression on the top 20 PCs | — |
PGS Performance Metric ID (PPM) |
PGS Sample Set ID (PSS) |
Performance Source | Trait |
PGS Effect Sizes (per SD change) |
Classification Metrics | Other Metrics | Covariates Included in the Model |
PGS Performance: Other Relevant Information |
---|---|---|---|---|---|---|---|---|
PPM020100 | PSS011296| European Ancestry| 45,334 individuals |
PGP000520 | Raben TG et al. Sci Rep (2023) |
Reported Trait: Breast cancer | — | AUROC: 0.64213 | — | year of birth | — |
PGS Sample Set ID (PSS) |
Phenotype Definitions and Methods | Participant Follow-up Time | Sample Numbers | Age of Study Participants | Sample Ancestry | Additional Ancestry Description | Cohort(s) | Additional Sample/Cohort Information |
---|---|---|---|---|---|---|---|---|
PSS011296 | 22,667 sibling pairs | — | 45,334 individuals | — | European | — | UKB | — |