overview of the dataset in the paper :

  • 15,789 de novo variants

    • 12166 in 6430 affected individuals.
      • 5951 SNVs in our autosomal coding windows.
    • 3623 in 2179 unaffected individuals.

Annotations and RR:

  • protein-coding autosomal syn + Mis + PTV (‘splice_donor_variant’, ‘splice_acceptor_variant’, ‘stop_gained’, ‘frameshift_variant’)

In our TADA-A model, we used the 5951 DNMs. (n_sample = 6430)

paper_obs: the observed number of mutations in the paper.

our_obs: the observed number of mutations in our model.

In the paper, the amount of mutations in these catagories is 7131, while only 5743 of them are of SNV located in our autosomal coding windows.

backgrd: the expected number of background mutations after using the synonymous mutations to calibrate the rates.

burden: $burden = \frac{our_obs}{backgrd}$

calculated_RR: RR calculated by the formula gamma = 1 + (lambda - 1)/pi

est_RR: RR separately estimated by TADA-A model.

1
df
A data.frame: 8 × 12
annota (VEP)paper_obspaper_percentpaper_logRRpaper_RRour_obsour_percentbackgrdburdencalculated_RRlog_est_RRest_RR
<fct><fct><fct><dbl><dbl><dbl><dbl><dbl><dbl><dbl><dbl><dbl>
PTV_Highest(pLI=0.995-1) 366 5.13 3.923169050.560417 123 2.14 333.727355.5463.47606300032.332179
PTV_Middle(pLI=0.5-0.995)164 2.30 1.9223090 6.836726 49 0.85 291.689714.7942.293976000 9.914279
PTV_Lowest(pLI=0-0.5) 442 6.20 0.9713427 2.641489 145 2.52 1001.450010.0001.687930000 5.408274
Missense_Highest(MPC≥2) 354 4.96 3.097837022.149989 278 4.84 1811.535911.7182.250359000 9.491143
Missense_Middle(MPC=1-2) 894 12.54 1.4303100 4.179995 69412.08 5981.1605 4.2101.379533539 3.973048
Missense_Lowest(MPC<1) 315544.24 NA NA287049.9723331.2302 5.6041.171609570 3.227183
Synonymous 175624.62 0.1190180 1.126390158427.5815841.0000 1.0000.009975216 1.010025
Total 7131100.00 NA NA5743 NA4858 NA NA NA NA

burden of DeepSEA PTM

1
new.df
A data.frame: 2 × 8
RBPobsbackgrdburdencalculated_RRlog_calculated_RRlog_estimated_RRestimated_RR
<fct><dbl><dbl><dbl><dbl><dbl><dbl><dbl>
ago_adult_brain.BA4.hg19 1221011.2079215.1584161.6406300.017845331.018006
ago_adult_brain.Cingulate.gyrus.hg191331021.3039227.0784311.9570520.024560461.024865
1
df
A data.frame: 7 × 5
annotapro_VEPsibling_VEPburdenpaper_burden
<fct><dbl><dbl><dbl><dbl>
syn 1537 6780.76822851.0068929
MPC2 278 531.77752282.0683381
MPC12 698 2500.94615371.1652267
MPC01 290913540.72806690.9835945
PTV>0.995 123 94.63136343.5437192
PTV-0.5-0.995 51 121.44024111.3555210
PTV<0.5 164 760.73126790.9985671