Add capability to parse auxiliary target file #469

rob-p · 2019-12-31T17:09:16Z

This issue continues the discussion started in COMBINE-lab/pufferfish#8, implementing the feature discussed there. I'm tagging @mdshw5 to move conversation over to this issue. As of commit 7a37e8b, we have an --auxTargetFile option that can be passed to the quant command. The format of this file is one target per line. Every target listed in this file that is part of the index will be marked as auxiliary. This means that, while these targets will be quantified, they will not have the auxiliary models applied to them; specifically, sequence-specific, fragment-GC and position-specific bias models will not be applied to such targets. However, unlike decoy targets, they will be quantified and will appear in the quant.sf output.

The text was updated successfully, but these errors were encountered:

rob-p · 2019-12-31T17:46:43Z

In addition, the ids (index in the ordered list of transcript names) that are treated as auxiliary targets in a given sample are written to the file aux_info/aux_target_ids.json. This allows tracking, in a given sample, which targets were treated as auxiliary.

mdshw5 · 2019-12-31T17:54:10Z

Writing the targets to JSON is probably a good idea. My only remaining concern would be how to make it as easy as possible to compare these files against each other:

Does target order matter? If so, JSON parsers will almost always make an unordered object during deserialization.
Will a user just be able to diff or hash these files against each other? If so I’d say that’s the best solution.

rob-p · 2019-12-31T17:57:25Z

Great questions:

The order of the ids in the aux_target_ids.json file will always be in the order the targets appear in the original index. That means that, if the samples were quantified with the same index (which could be checked by the SeqHash values already in meta_info.json, then the ids in aux_target_ids.json will have the same interpretation and always be directly comparable. Then, one could diff the ids to ensure that precisely the same set of targets were treated as auxiliary.

rob-p added enhancement feature request labels Dec 31, 2019

rob-p self-assigned this Dec 31, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add capability to parse auxiliary target file #469

Add capability to parse auxiliary target file #469

rob-p commented Dec 31, 2019

rob-p commented Dec 31, 2019

mdshw5 commented Dec 31, 2019

rob-p commented Dec 31, 2019

Add capability to parse auxiliary target file #469

Add capability to parse auxiliary target file #469

Comments

rob-p commented Dec 31, 2019

rob-p commented Dec 31, 2019

mdshw5 commented Dec 31, 2019

rob-p commented Dec 31, 2019