Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

clean and update scratch notebook directory #69

Merged
merged 2 commits into from
Dec 13, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1,311 changes: 1,311 additions & 0 deletions non-pipeline_analyses/Additional_Notebooks/17C7_EscapeValidations.ipynb

Large diffs are not rendered by default.

765 changes: 765 additions & 0 deletions non-pipeline_analyses/Additional_Notebooks/231014_NeutCurves.ipynb

Large diffs are not rendered by default.

762 changes: 762 additions & 0 deletions non-pipeline_analyses/Additional_Notebooks/231027_NeutCurves.ipynb

Large diffs are not rendered by default.

2,202 changes: 2,202 additions & 0 deletions non-pipeline_analyses/Additional_Notebooks/240726_aa-distributions.eps

Large diffs are not rendered by default.

2,276 changes: 2,276 additions & 0 deletions non-pipeline_analyses/Additional_Notebooks/240810_LuciferaseValidations.ipynb

Large diffs are not rendered by default.

1,390 changes: 1,390 additions & 0 deletions non-pipeline_analyses/Additional_Notebooks/240825_CircStrain-AGN_Neuts.ipynb

Large diffs are not rendered by default.

1,423 changes: 1,423 additions & 0 deletions non-pipeline_analyses/Additional_Notebooks/241003_CircStrain-QEJ_Neuts.ipynb

Large diffs are not rendered by default.

Large diffs are not rendered by default.

528 changes: 528 additions & 0 deletions non-pipeline_analyses/Additional_Notebooks/241126_GetContacts.ipynb

Large diffs are not rendered by default.

Large diffs are not rendered by default.

1,081 changes: 1,081 additions & 0 deletions non-pipeline_analyses/Additional_Notebooks/AGN-mutants_Neuts_FigS6B.ipynb

Large diffs are not rendered by default.

40 changes: 40 additions & 0 deletions non-pipeline_analyses/Additional_Notebooks/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,40 @@
# Scratch notebooks
### Arjun Aditham, Bloom Lab (2024)

This directory contains notebooks that run scripts for custom analyses. They do not run as part of the pipeline. Some rely on data that is produced by the pipeline. Others were scripted in order to produce pre-requisite information for the pipeline. These notebooks have been edited slightly to increase readability and to update any directories that rely on additional data files.

## Notebooks that produce figures for the manuscript

Some notebooks were used to generate figures for the manuscript. Below are the notebooks and the raw figures they produced (or the analysis used to generate other figures)

'240810_LuciferaseValidations.ipynb' -- Analyzes data and produces the Figure 1D.

'AA_sub_distribution_plots_FigsS1CD.ipynb' -- produces the raw numbers for Fig S1C and the distribution plots for S1D.

'241126_GetContacts.ipynb' -- produces list of sites within 4 Angstroms of any atom in antibody for structures of complexes are available. These sites are used to generate heatmaps of escape at specific sites.

'17C7_EscapeValidations.ipynb' -- This notebook produces Fig 3E and Fig S5A. Analyzes neutralization data for 17C7 validation assays.

'231014_NeutCurves.ipynb' -- Produces the neut curves seen in Figure S5B.

'231027_NeutCurves.ipynb' -- Produces the neut curves seen in Figure S5B.

'241119_Heatmap_AntibodyConctact.ipynb' -- produces the heatmaps used in Figure 5. This notebook was hard-coded with sites identified using the distance cut-off rules as defined in its partner notebook 241126_GetContacts.ipynb.

'natural_sequence_antibody_escape-PaperFigures.ipynb' -- Reformats the antibody escape versus frequency plots for Figure 6A. Produces top escape mutations listed in Figure 6C.

'241003_CircStrain-QEJ_Neuts.ipynb' -- Produces neutralization assay data for Figure 6D for the NY-2011-1202O rabies G pseudovirus. Note that we use 'QEJ' as a convenient shorthand notation for the strain.

'240825_CircStrain-AGN_Neuts.ipynb' -- Produces neutralization assay data for Figure 6D for the A12_2718 rabies G pseudovirus. Note that we use 'AGN' as a convenient shorthand notation for the strain.

'AGN-mutants_Neuts_FigS6B.ipynb' -- Produces Figure S6B and analyzes neutralization assay data for mutations in the paper. Note that we use 'AGN' as a convenient shorthand for the A12_2718 full strain name.


## Notebooks

glycoprotein_positions.ipynb -- produces various csv files for pipeline, including site_numbering_map.csv, PacBio_runs.csv, and designed_mutations.csv. Note that these files must reside within the data directory for the pipeline to work as-is.

color_pdb_functionaleffects.ipynb -- produces pdb file with b-factors updated to contain mean functional effects on cellular entry for visualizing effects on PyMOL. This notebook requires paths to a pdb file of the structure of interest and to the csv file with calculated functional effects.

update_barcode_runs.ipynb -- produces the updated barcode_runs.csv document that is input into the dms-vep-pipeline-3 workflow for analyzing functional and antibody selections. Briefly, this notebook takes in the prior barcode_runs.csv file, archives the prior set of selections into a separate user-defined document, and appends additional runs and updates the file.

Original file line number Diff line number Diff line change
@@ -0,0 +1,129 @@
,virus,serum,fraction infectivity,concentration,replicate
16,PV,RVC58,0.0,1000.0,3
17,PV,RVC58,0.0,200.0,3
18,PV,RVC58,2.214113033237988e-05,40.0,3
19,PV,RVC58,-3.321169549856982e-05,8.0,3
20,PV,RVC58,0.07914347037309188,1.6,3
21,PV,RVC58,0.5180692380821906,0.32,3
22,PV,RVC58,0.720107052365157,0.064,3
23,PV,RVC58,0.7974903028768248,0.0128,3
24,PV,RVC58,0.00014391734716046923,1000.0,4
25,PV,RVC58,2.214113033237988e-05,200.0,4
26,PV,RVC58,7.749395616332958e-05,40.0,4
27,PV,RVC58,4.428226066475976e-05,8.0,4
28,PV,RVC58,0.10780516358835764,1.6,4
29,PV,RVC58,0.6119808423869799,0.32,4
30,PV,RVC58,0.7627840810808193,0.064,4
31,PV,RVC58,0.9444852671534948,0.0128,4
32,PV,CTB012,0.17127271368612457,120.0,5
33,PV,CTB012,0.7682307991425847,24.0,5
34,PV,CTB012,0.9150818460720943,4.8,5
35,PV,CTB012,0.8527877758819435,0.96,5
36,PV,CTB012,1.0356845829925676,0.192,5
37,PV,CTB012,0.9463119104059161,0.0384,5
38,PV,CTB012,0.8442080878781463,0.00768,5
39,PV,CTB012,1.1439768514482376,0.001536,5
40,PV,CTB012,0.2292492634614613,120.0,6
41,PV,CTB012,0.6630050772379493,24.0,6
42,PV,CTB012,0.76655914380249,4.8,6
43,PV,CTB012,0.8440530999658197,0.96,6
44,PV,CTB012,0.7801870095220699,0.192,6
45,PV,CTB012,0.8222551571535917,0.0384,6
46,PV,CTB012,0.9067457105019533,0.00768,6
47,PV,CTB012,0.8640022583952939,0.001536,6
48,PV,RVA122,2.214113033237988e-05,500.0,7
49,PV,RVA122,0.0,100.0,7
50,PV,RVA122,4.428226066475976e-05,20.0,7
51,PV,RVA122,0.006908032663702523,4.0,7
52,PV,RVA122,0.37338802192525433,0.8,7
53,PV,RVA122,0.6548350001453012,0.16,7
54,PV,RVA122,0.8426471381897135,0.032,7
55,PV,RVA122,0.9054061721168443,0.0064,7
56,PV,RVA122,6.642339099713964e-05,500.0,8
57,PV,RVA122,1.107056516618994e-05,100.0,8
58,PV,RVA122,3.321169549856982e-05,20.0,8
59,PV,RVA122,0.03380950601754408,4.0,8
60,PV,RVA122,0.3212235188621673,0.8,8
61,PV,RVA122,0.6480819553939253,0.16,8
62,PV,RVA122,0.8317204903706841,0.032,8
63,PV,RVA122,0.9986092602509974,0.0064,8
64,PV,CR57,2.214113033237988e-05,1000.0,9
65,PV,CR57,5.53528258309497e-05,200.0,9
66,PV,CR57,5.53528258309497e-05,40.0,9
67,PV,CR57,0.007616548834338679,8.0,9
68,PV,CR57,0.18283038371962687,1.6,9
69,PV,CR57,0.6935709076617997,0.32,9
70,PV,CR57,0.8740875432616929,0.064,9
71,PV,CR57,0.8634044478763196,0.0128,9
72,PV,CR57,-5.53528258309497e-05,1000.0,10
73,PV,CR57,0.0,200.0,10
74,PV,CR57,0.0001107056516618994,40.0,10
75,PV,CR57,0.015974825534812085,8.0,10
76,PV,CR57,0.1933695617578397,1.6,10
77,PV,CR57,0.6895855042019714,0.32,10
78,PV,CR57,0.8972582361545285,0.064,10
79,PV,CR57,1.0691398309247935,0.0128,10
16,PV,17C7,0.1072006220714105,50.0,3
17,PV,17C7,0.6935772905136535,10.0,3
18,PV,17C7,0.7892288529265081,2.0,3
19,PV,17C7,0.7043374446412217,0.4,3
20,PV,17C7,1.0431917695167225,0.08,3
21,PV,17C7,0.7306558504700696,0.016,3
22,PV,17C7,0.8453575415246408,0.0032,3
23,PV,17C7,0.783253863495094,0.00064,3
24,PV,17C7,0.17112783583651872,50.0,4
25,PV,17C7,0.7031088212732903,10.0,4
26,PV,17C7,0.8866392866871383,2.0,4
27,PV,17C7,0.9078750716360171,0.4,4
28,PV,17C7,0.9180532462735126,0.08,4
29,PV,17C7,0.6093713247388973,0.016,4
30,PV,17C7,0.740743494964665,0.0032,4
31,PV,17C7,0.5996458008159029,0.00064,4
32,PV,RVC68,0.011768918577027814,1000.0,5
33,PV,RVC68,0.13852405151488453,200.0,5
34,PV,RVC68,0.4553666187880762,40.0,5
35,PV,RVC68,0.6622797268252938,8.0,5
36,PV,RVC68,0.8254926415968223,1.6,5
37,PV,RVC68,0.701440480068415,0.32,5
38,PV,RVC68,0.8907778075054338,0.064,5
39,PV,RVC68,0.8878679100550698,0.0128,5
40,PV,RVC68,0.013476058414574707,1000.0,6
41,PV,RVC68,0.11772798440294967,200.0,6
42,PV,RVC68,0.34912302965589653,40.0,6
43,PV,RVC68,0.5361195062550671,8.0,6
44,PV,RVC68,0.715020001503447,1.6,6
45,PV,RVC68,0.5780220295403089,0.32,6
46,PV,RVC68,0.7823226963109775,0.064,6
47,PV,RVC68,0.6778250456490162,0.0128,6
48,PV,RVC20,0.45439665297128823,100.0,7
49,PV,RVC20,0.8739521338035512,20.0,7
50,PV,RVC20,0.8848028180740197,4.0,7
51,PV,RVC20,0.7760890493284199,0.8,7
52,PV,RVC20,0.8948516639359435,0.16,7
53,PV,RVC20,0.6694833396246394,0.032,7
54,PV,RVC20,0.7447397541298315,0.0064,7
55,PV,RVC20,0.735531545309124,0.00128,7
56,PV,RVC20,0.472580278816674,100.0,8
57,PV,RVC20,0.790858395498712,20.0,8
58,PV,RVC20,0.919424131294573,4.0,8
59,PV,RVC20,0.697793408597292,0.8,8
60,PV,RVC20,0.8513195980784977,0.16,8
61,PV,RVC20,0.6995652128226247,0.032,8
62,PV,RVC20,0.6770232072404715,0.0064,8
63,PV,RVC20,0.6858434297344638,0.00128,8
64,PV,CR4098,6.466438778586711e-05,200.0,9
65,PV,CR4098,0.02192122745940895,40.0,9
66,PV,CR4098,0.3052417761044071,8.0,9
67,PV,CR4098,0.716365020769393,1.6,9
68,PV,CR4098,0.9900635085118542,0.32,9
69,PV,CR4098,0.6062027697373898,0.064,9
70,PV,CR4098,0.7254051021818573,0.0128,9
71,PV,CR4098,0.7445328280889167,0.00256,9
72,PV,CR4098,-6.466438778586711e-05,200.0,10
73,PV,CR4098,0.010436832188638952,40.0,10
74,PV,CR4098,0.32883134476869147,8.0,10
75,PV,CR4098,0.7767356932062786,1.6,10
76,PV,CR4098,0.8698524116179273,0.32,10
77,PV,CR4098,0.8315193625384653,0.064,10
78,PV,CR4098,1.0945223605411438,0.0128,10
79,PV,CR4098,0.7479471077640105,0.00256,10
Original file line number Diff line number Diff line change
@@ -0,0 +1,65 @@
,virus,serum,fraction infectivity,concentration,replicate
16,PV,17C7,6.910865954540324e-05,500.0,3
17,PV,17C7,-6.910865954540324e-05,100.0,3
18,PV,17C7,0.00013821731909080648,20.0,3
19,PV,17C7,0.05445762372177775,4.0,3
20,PV,17C7,0.40225847099394374,0.8,3
21,PV,17C7,0.7633972895583726,0.16,3
22,PV,17C7,0.7868481613641127,0.032,3
23,PV,17C7,0.995233806113352,0.0064,3
24,PV,17C7,9.214487939387099e-05,500.0,4
25,PV,17C7,-2.3036219848467747e-05,100.0,4
26,PV,17C7,0.0,20.0,4
27,PV,17C7,0.04217931854254444,4.0,4
28,PV,17C7,0.298549409236142,0.8,4
29,PV,17C7,0.6909483781349416,0.16,4
30,PV,17C7,0.8149753857990919,0.032,4
31,PV,17C7,0.7824712795929039,0.0064,4
32,PV,CTB012,0.0023727306443921776,600.0,5
33,PV,CTB012,0.019073990034531293,120.0,5
34,PV,CTB012,0.04669441763284412,24.0,5
35,PV,CTB012,0.16530791363260453,4.8,5
36,PV,CTB012,0.6631206245579925,0.96,5
37,PV,CTB012,0.6831851720460079,0.192,5
38,PV,CTB012,0.7001398298544802,0.0384,5
39,PV,CTB012,0.8683503071879917,0.00768,5
40,PV,CTB012,0.008016604507266776,600.0,6
41,PV,CTB012,0.03932282728133444,120.0,6
42,PV,CTB012,0.06883222490722163,24.0,6
43,PV,CTB012,0.16703563012123962,4.8,6
44,PV,CTB012,0.34713279689656046,0.96,6
45,PV,CTB012,0.6381493622422535,0.192,6
46,PV,CTB012,0.8132707055303052,0.0384,6
47,PV,CTB012,0.6874929451576713,0.00768,6
48,PV,RVC20,-0.00013821731909080648,1000.0,7
49,PV,RVC20,0.0,200.0,7
50,PV,RVC20,0.011287747725749196,40.0,7
51,PV,RVC20,0.04213324610284751,8.0,7
52,PV,RVC20,0.3935968523309199,1.6,7
53,PV,RVC20,0.6491837115496695,0.32,7
54,PV,RVC20,0.6688105708605641,0.064,7
55,PV,RVC20,0.794887802091228,0.0128,7
56,PV,RVC20,-2.3036219848467747e-05,1000.0,8
57,PV,RVC20,-9.214487939387099e-05,200.0,8
58,PV,RVC20,6.910865954540324e-05,40.0,8
59,PV,RVC20,0.03317215658179355,8.0,8
60,PV,RVC20,0.29357358574887293,1.6,8
61,PV,RVC20,0.5918235241269848,0.32,8
62,PV,RVC20,0.7885758778527479,0.064,8
63,PV,RVC20,0.6432864392684617,0.0128,8
64,PV,RVC58,-4.607243969693549e-05,1000.0,9
65,PV,RVC58,-4.607243969693549e-05,200.0,9
66,PV,RVC58,4.607243969693549e-05,40.0,9
67,PV,RVC58,-6.910865954540324e-05,8.0,9
68,PV,RVC58,0.13888536946641203,1.6,9
69,PV,RVC58,0.49467978502599635,0.32,9
70,PV,RVC58,0.6442078880624005,0.064,9
71,PV,RVC58,0.7578685967947404,0.0128,9
72,PV,RVC58,-6.910865954540324e-05,1000.0,10
73,PV,RVC58,0.0,200.0,10
74,PV,RVC58,-6.910865954540324e-05,40.0,10
75,PV,RVC58,2.3036219848467747e-05,8.0,10
76,PV,RVC58,0.048237844362691455,1.6,10
77,PV,RVC58,0.5383794940785397,0.32,10
78,PV,RVC58,0.6779329139205573,0.064,10
79,PV,RVC58,0.7191216750096175,0.0128,10
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Loading
Loading