You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
As of now (3.18.0) the documentation didnt change much.
Also, I couldn't run the pipeline following these suggestions recently. I was using data for E. coli strain BW25113 (4522 genes, 4419 CDS, 121 exon) of the NCBI RefSeq annotation from https://www.ncbi.nlm.nih.gov/datasets/genome/GCF_000750555.1/.
The following seems important:
only "exon" are measured, therefore all features of interest need to have an "exon"
there may not be any special signs or maybe even not space in column number 2 (and often is)
gene_id are neccessary for each "exon" entry
The simplest solution I found for now was using the annotation in gff format and convert & simplify with gffread. Additionally gene_id have to be added then. This can be achieved as below.
Description of feature
The documentation of the rnaseq pipeline that refers to running the pipeline on prokaryotic samples is unfortunately completely outdated. It specifies the required settings for
featureCounts
, even though that tool has been superseded bysalmon
for transcript quantification about 10 pipeline releases ago!On Slack, Marine Cambon has already kindly written up what is needed to run the more recent versions of the pipeline successfully with prokaryotic RNA-seq. However, somebody needs to update the pipeline documentation accordingly. Since this requires only Markdown edits, I think it is a suitable task for the Hackathon?
For some general recommendations on how to write good technical documentation, see the website of the Diátaxis framework.
The text was updated successfully, but these errors were encountered: