A bioinformatic pipeline for identifying structural variation in repetitive DNA using NGS data.
Stars
1
Forks
1
Watchers
1
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
44
commits
Ensured workers close after parallel processing\n Fixed an issue where the worker created during parallel processing in AlignmentConverter failed to close until the pipeline finishes running on every dataset. Further, allowed AlignmentPipeline to use AlignmentConverter in parallel.
d4bae80View on GitHubMerged the updated AlignmentConverter.py to master. \nThe update 1) removes conflicts with newer versions of numpy and 2) permits running the function in parallel.
7f84b94View on GitHubResolved a deprecation issue with later versions of numpy. \n When updating the MD string, the array of nucleotides reflecting the span of the consensus to which the read mapped was one element longer than it should have been. This is sliced by an array of bools defined by the CIGAR string, and this boolean array is the correct length. In early versions of Numpy, when an array is sliced by a shorter bool array, the extra elements in the sliced array are simply excluded, meaning the extra nucleotide has no impact on behavior of the function. However,in later versions of Numpy (some version >1.10.4) this a mismatch in the length between an array and the bool array used to slice it raises an array. \n I removed the extra nucleotide from the array. I also replaced several functions with numpy.fromstring to avoid a different potential deprecation.
4ffac2aView on GitHubUpdated manual to describe how to summarize copy number for repeat families.
7e8c7aaView on GitHub