Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ps merging annotating genecounts #15

Open
wants to merge 84 commits into
base: master
Choose a base branch
from

Commits on Jun 16, 2022

  1. Configuration menu
    Copy the full SHA
    68e96d6 View commit details
    Browse the repository at this point in the history

Commits on Jun 28, 2022

  1. Configuration menu
    Copy the full SHA
    c5df8a7 View commit details
    Browse the repository at this point in the history

Commits on Jun 30, 2022

  1. Configuration menu
    Copy the full SHA
    d6d45e7 View commit details
    Browse the repository at this point in the history
  2. full exon matches

    preetisi committed Jun 30, 2022
    Configuration menu
    Copy the full SHA
    45551b7 View commit details
    Browse the repository at this point in the history

Commits on Jul 6, 2022

  1. sparse matrix initial code

    preetisi committed Jul 6, 2022
    Configuration menu
    Copy the full SHA
    a8f752c View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    c523742 View commit details
    Browse the repository at this point in the history
  3. get all unique junctions of all samples and add them to a single data…

    …frame
    Preeti Singh committed Jul 6, 2022
    Configuration menu
    Copy the full SHA
    5c84d59 View commit details
    Browse the repository at this point in the history

Commits on Jul 7, 2022

  1. calculate total junction counts by parsing sample files

    Preeti Singh committed Jul 7, 2022
    Configuration menu
    Copy the full SHA
    95ec410 View commit details
    Browse the repository at this point in the history

Commits on Jul 8, 2022

  1. divide the input sample files in chunk and multiprocess them in chunk…

    …s of 50
    Preeti Singh committed Jul 8, 2022
    Configuration menu
    Copy the full SHA
    d814dd2 View commit details
    Browse the repository at this point in the history

Commits on Jul 13, 2022

  1. Merge branch 'master' of https://github.com/SalomonisLab/altanalyze3

    …into ps-merging_annotating_genecounts
    Preeti Singh committed Jul 13, 2022
    Configuration menu
    Copy the full SHA
    a497275 View commit details
    Browse the repository at this point in the history

Commits on Jul 14, 2022

  1. intermittent code for sparse matrix

    Preeti Singh committed Jul 14, 2022
    Configuration menu
    Copy the full SHA
    f7b2d13 View commit details
    Browse the repository at this point in the history
  2. working sparse matrix from two samples

    Preeti Singh committed Jul 14, 2022
    Configuration menu
    Copy the full SHA
    679ede6 View commit details
    Browse the repository at this point in the history
  3. sparse matrix function

    Preeti Singh committed Jul 14, 2022
    Configuration menu
    Copy the full SHA
    723cdd4 View commit details
    Browse the repository at this point in the history

Commits on Jul 15, 2022

  1. sparse matrix using dictionary of keys

    Preeti Singh committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    b38af08 View commit details
    Browse the repository at this point in the history

Commits on Jul 19, 2022

  1. dok_matrix to csr matrix

    Preeti Singh committed Jul 19, 2022
    Configuration menu
    Copy the full SHA
    eb934b2 View commit details
    Browse the repository at this point in the history

Commits on Jul 21, 2022

  1. read multiple files from a directory

    Preeti Singh committed Jul 21, 2022
    Configuration menu
    Copy the full SHA
    c2e77d7 View commit details
    Browse the repository at this point in the history
  2. working version of multiprocessing

    Preeti Singh committed Jul 21, 2022
    Configuration menu
    Copy the full SHA
    05fde08 View commit details
    Browse the repository at this point in the history
  3. new main file

    Preeti Singh committed Jul 21, 2022
    Configuration menu
    Copy the full SHA
    cf70761 View commit details
    Browse the repository at this point in the history

Commits on Jul 26, 2022

  1. parse each file and make a sparse matrix

    Preeti Singh committed Jul 26, 2022
    Configuration menu
    Copy the full SHA
    8ad3a23 View commit details
    Browse the repository at this point in the history
  2. remove the chromosome guard

    Preeti Singh committed Jul 26, 2022
    Configuration menu
    Copy the full SHA
    9652ef4 View commit details
    Browse the repository at this point in the history
  3. ignore the test data folder

    Preeti Singh committed Jul 26, 2022
    Configuration menu
    Copy the full SHA
    2ed9347 View commit details
    Browse the repository at this point in the history

Commits on Jul 28, 2022

  1. read files using pandaframes

    Preeti Singh committed Jul 28, 2022
    Configuration menu
    Copy the full SHA
    221e469 View commit details
    Browse the repository at this point in the history
  2. created a giant dictionary where key is junction coordinate and it's …

    …value is an object defining it's sample id and number of spliceevents
    Preeti Singh committed Jul 28, 2022
    Configuration menu
    Copy the full SHA
    1ca86e2 View commit details
    Browse the repository at this point in the history
  3. check if the junction already exists in the dictionary

    Preeti Singh committed Jul 28, 2022
    Configuration menu
    Copy the full SHA
    e76c477 View commit details
    Browse the repository at this point in the history
  4. get junction annotations from gene model

    Preeti Singh committed Jul 28, 2022
    Configuration menu
    Copy the full SHA
    bf33f53 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    a8ce4b5 View commit details
    Browse the repository at this point in the history
  6. imported the junctions from bam derived text file

    Preeti Singh committed Jul 28, 2022
    Configuration menu
    Copy the full SHA
    77d4b6f View commit details
    Browse the repository at this point in the history
  7. get junction annotations

    Preeti Singh committed Jul 28, 2022
    Configuration menu
    Copy the full SHA
    40face6 View commit details
    Browse the repository at this point in the history

Commits on Aug 1, 2022

  1. Configuration menu
    Copy the full SHA
    d89f06a View commit details
    Browse the repository at this point in the history

Commits on Aug 2, 2022

  1. add log statement

    Preeti Singh committed Aug 2, 2022
    Configuration menu
    Copy the full SHA
    f2ea773 View commit details
    Browse the repository at this point in the history
  2. sparse matrix from giant dictionary

    Preeti Singh committed Aug 2, 2022
    Configuration menu
    Copy the full SHA
    4ede6a7 View commit details
    Browse the repository at this point in the history
  3. working version of making dok_matrix

    Preeti Singh committed Aug 2, 2022
    Configuration menu
    Copy the full SHA
    b3c8f8c View commit details
    Browse the repository at this point in the history
  4. indexing of dok_matrix fixed

    Preeti Singh committed Aug 2, 2022
    Configuration menu
    Copy the full SHA
    ae7e743 View commit details
    Browse the repository at this point in the history

Commits on Aug 4, 2022

  1. sparse matrix printed as dense matrix for testing

    Preeti Singh committed Aug 4, 2022
    Configuration menu
    Copy the full SHA
    2edcd72 View commit details
    Browse the repository at this point in the history
  2. old main

    Preeti Singh committed Aug 4, 2022
    Configuration menu
    Copy the full SHA
    467b3ea View commit details
    Browse the repository at this point in the history

Commits on Aug 5, 2022

  1. fix the way we are storing exon annotations, fix the if statements in…

    …side splice site annotations function
    Preeti Singh committed Aug 5, 2022
    Configuration menu
    Copy the full SHA
    57485ef View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    96e2e8a View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    383096e View commit details
    Browse the repository at this point in the history
  4. ignore the tsv files

    Preeti Singh committed Aug 5, 2022
    Configuration menu
    Copy the full SHA
    f434217 View commit details
    Browse the repository at this point in the history
  5. make a separate class for collecting the data

    Preeti Singh committed Aug 5, 2022
    Configuration menu
    Copy the full SHA
    6a0087b View commit details
    Browse the repository at this point in the history
  6. remove the dead code

    Preeti Singh committed Aug 5, 2022
    Configuration menu
    Copy the full SHA
    0bb5f7f View commit details
    Browse the repository at this point in the history

Commits on Aug 9, 2022

  1. added classes

    Preeti Singh committed Aug 9, 2022
    Configuration menu
    Copy the full SHA
    cc600dd View commit details
    Browse the repository at this point in the history

Commits on Aug 10, 2022

  1. memory efficent way for reading the junctions from gene model and cre…

    …ating annotations for junctions which exist in sample bed files
    Preeti Singh committed Aug 10, 2022
    Configuration menu
    Copy the full SHA
    50f8978 View commit details
    Browse the repository at this point in the history
  2. remove dead code

    Preeti Singh committed Aug 10, 2022
    Configuration menu
    Copy the full SHA
    bef70bc View commit details
    Browse the repository at this point in the history

Commits on Aug 11, 2022

  1. add print statements

    Preeti Singh committed Aug 11, 2022
    Configuration menu
    Copy the full SHA
    8ccacc7 View commit details
    Browse the repository at this point in the history

Commits on Aug 12, 2022

  1. initial commits for splice annotation

    Preeti Singh committed Aug 12, 2022
    Configuration menu
    Copy the full SHA
    4e710a7 View commit details
    Browse the repository at this point in the history

Commits on Aug 14, 2022

  1. check if start and stop annotations are not None and then find the sp…

    …lice annotations resepectively
    Preeti Singh committed Aug 14, 2022
    Configuration menu
    Copy the full SHA
    ce75956 View commit details
    Browse the repository at this point in the history
  2. get all the exons for a given gene id

    Preeti Singh committed Aug 14, 2022
    Configuration menu
    Copy the full SHA
    b490881 View commit details
    Browse the repository at this point in the history

Commits on Aug 15, 2022

  1. got the exons for candidate gene

    Preeti Singh committed Aug 15, 2022
    Configuration menu
    Copy the full SHA
    a09a494 View commit details
    Browse the repository at this point in the history
  2. working version of getting the start and stop annotations

    Preeti Singh committed Aug 15, 2022
    Configuration menu
    Copy the full SHA
    41c3623 View commit details
    Browse the repository at this point in the history

Commits on Aug 16, 2022

  1. splice annotations working

    Preeti Singh committed Aug 16, 2022
    Configuration menu
    Copy the full SHA
    e26ab14 View commit details
    Browse the repository at this point in the history

Commits on Aug 17, 2022

  1. working version of splice site annotations

    Preeti Singh committed Aug 17, 2022
    Configuration menu
    Copy the full SHA
    76f8030 View commit details
    Browse the repository at this point in the history
  2. annotations writing to a txt file

    Preeti Singh committed Aug 17, 2022
    Configuration menu
    Copy the full SHA
    c20e286 View commit details
    Browse the repository at this point in the history

Commits on Aug 18, 2022

  1. correct annotations when both start and stop coordinates of a junctio…

    …n exists in gene model
    Preeti Singh committed Aug 18, 2022
    Configuration menu
    Copy the full SHA
    8b1e22b View commit details
    Browse the repository at this point in the history
  2. working if else statements for checking if junction exists in the gen…

    …e model
    Preeti Singh committed Aug 18, 2022
    Configuration menu
    Copy the full SHA
    87198fc View commit details
    Browse the repository at this point in the history
  3. fix if statement

    Preeti Singh committed Aug 18, 2022
    Configuration menu
    Copy the full SHA
    e90af2a View commit details
    Browse the repository at this point in the history
  4. short circuiting in if else fixed

    Preeti Singh committed Aug 18, 2022
    Configuration menu
    Copy the full SHA
    9b60784 View commit details
    Browse the repository at this point in the history
  5. start annotation getting annotated correctly when stop target tuple d…

    …oesn't exist in gene model
    Preeti Singh committed Aug 18, 2022
    Configuration menu
    Copy the full SHA
    8a57f45 View commit details
    Browse the repository at this point in the history
  6. working annotations

    Preeti Singh committed Aug 18, 2022
    Configuration menu
    Copy the full SHA
    f325a40 View commit details
    Browse the repository at this point in the history
  7. negative and positive both strand working

    Preeti Singh committed Aug 18, 2022
    Configuration menu
    Copy the full SHA
    fad419d View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    194d7a5 View commit details
    Browse the repository at this point in the history

Commits on Aug 22, 2022

  1. U_ annotations coorected

    Preeti Singh committed Aug 22, 2022
    Configuration menu
    Copy the full SHA
    439d19f View commit details
    Browse the repository at this point in the history

Commits on Aug 23, 2022

  1. Configuration menu
    Copy the full SHA
    ad5757d View commit details
    Browse the repository at this point in the history

Commits on Aug 31, 2022

  1. all annotations working correctly

    Preeti Singh committed Aug 31, 2022
    Configuration menu
    Copy the full SHA
    7dce938 View commit details
    Browse the repository at this point in the history
  2. remove unnecessary print statements

    Preeti Singh committed Aug 31, 2022
    Configuration menu
    Copy the full SHA
    da3feee View commit details
    Browse the repository at this point in the history
  3. refactoring #1 - removing variables which are not needed

    Preeti Singh committed Aug 31, 2022
    Configuration menu
    Copy the full SHA
    1758f19 View commit details
    Browse the repository at this point in the history
  4. bug -fix - fixed the infinite loop while finding the annotation, refa…

    …ctoring#2'
    Preeti Singh committed Aug 31, 2022
    Configuration menu
    Copy the full SHA
    7a04082 View commit details
    Browse the repository at this point in the history

Commits on Sep 1, 2022

  1. benchmark with timeit without indexing junction files"

    Preeti Singh committed Sep 1, 2022
    Configuration menu
    Copy the full SHA
    f8ce038 View commit details
    Browse the repository at this point in the history
  2. refactoring #3 - add logging statements, cleaner way of string concat…

    …enation,separate function for writing to annotation file to add async multiprocessing in future
    Preeti Singh committed Sep 1, 2022
    Configuration menu
    Copy the full SHA
    71b568f View commit details
    Browse the repository at this point in the history

Commits on Sep 2, 2022

  1. benchmarking and adding multiprocessing

    Preeti Singh committed Sep 2, 2022
    Configuration menu
    Copy the full SHA
    34e10ee View commit details
    Browse the repository at this point in the history

Commits on Sep 8, 2022

  1. optimised gene model file reading - removed pandas, instead using pyt…

    …hon file system
    Preeti Singh committed Sep 8, 2022
    Configuration menu
    Copy the full SHA
    0eb7d1c View commit details
    Browse the repository at this point in the history

Commits on Sep 9, 2022

  1. normalize data types of junction file and gene model

    Preeti Singh committed Sep 9, 2022
    Configuration menu
    Copy the full SHA
    fa1ccf4 View commit details
    Browse the repository at this point in the history
  2. fix data type issues

    Preeti Singh committed Sep 9, 2022
    Configuration menu
    Copy the full SHA
    5583806 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    1b8bcad View commit details
    Browse the repository at this point in the history
  4. refactoring #5 - reduced the number of if else statements

    Preeti Singh committed Sep 9, 2022
    Configuration menu
    Copy the full SHA
    28ebec5 View commit details
    Browse the repository at this point in the history

Commits on Sep 14, 2022

  1. refatored splice site annotation function

    Preeti Singh committed Sep 14, 2022
    Configuration menu
    Copy the full SHA
    756d5ad View commit details
    Browse the repository at this point in the history

Commits on Sep 15, 2022

  1. reading junctions using pandas - for some reason it was faster than p…

    …ython file reading system
    Preeti Singh committed Sep 15, 2022
    Configuration menu
    Copy the full SHA
    91e47cc View commit details
    Browse the repository at this point in the history

Commits on Oct 28, 2022

  1. refactored the annotation function

    Preeti Singh committed Oct 28, 2022
    Configuration menu
    Copy the full SHA
    7549dd9 View commit details
    Browse the repository at this point in the history
  2. sorting function

    Preeti Singh committed Oct 28, 2022
    Configuration menu
    Copy the full SHA
    4b39404 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    6752b37 View commit details
    Browse the repository at this point in the history

Commits on Nov 2, 2022

  1. resolve conflict

    Preeti Singh committed Nov 2, 2022
    Configuration menu
    Copy the full SHA
    00af13b View commit details
    Browse the repository at this point in the history
  2. preliminary attempt to sort junctions

    Preeti Singh committed Nov 2, 2022
    Configuration menu
    Copy the full SHA
    2227664 View commit details
    Browse the repository at this point in the history
  3. remove dead code

    Preeti Singh committed Nov 2, 2022
    Configuration menu
    Copy the full SHA
    7df7ce2 View commit details
    Browse the repository at this point in the history
  4. close the pool processes

    Preeti Singh committed Nov 2, 2022
    Configuration menu
    Copy the full SHA
    909c0b4 View commit details
    Browse the repository at this point in the history