.. documentation master file, created by sphinx-quickstart You can adapt this file completely to your liking, but it should at least contain the root `toctree` directive. rnaends R package to study exact RNA ends at nucleotide resolution ================================================================== .. raw:: html .. role:: red The rnaends R package focuses on exact 5'-end and/or 3'-end quantification of RNAs *via* targeted RNA-Seq. Features -------- * `Pre-processing raw reads FASTQ file `_ * Reads parsing and validation * Reads feature extraction * Demultiplexing into separate fastq files * Quantification of 5'-end or 3'-end of mapped reads * Unique Molecule Identifiers (UMI) use for PCR duplicates elimination * Mapping: `map_fastq `_ wrapper for Bowtie or Subread with appropriate default parameters * Quantification: `quantify_5prime `_ or `quantify_3prime `_ functions * Downstream analysis * `Transcription start sites (TSS) identification `_ * `Signal periodicity analysis `_ * `Post-transccriptional modifications quantification `_ * `Differential proportion statistical analysis `_ Applications ------------ * `TSS identification `_ * `Endoribonucleolytic cleavage sites identification `_ * `Co-translational degradation `_ * `Post-transcriptional modification & differential proportion analysis `_ .. image:: img/rnaends.package.overview.png :alt: Overview of the features provided in the rnaends package grouped by phases of an RNA-end sequencing project. **Figure 1: Overview of the features provided in the rnaends package grouped by phases of an RNA-end sequencing project.** **A)** Overview of read structure: 1 random nucleotide, then the 4 nucleotides of barcode (here either CAAG, GTAT, TACA or TCGG) identifying 4 multiplexed samples, then the recognition sequence (CGGCACCAACCGAGG), then a UMI (7 random A, C or G nucleotides), then 3 nucleotides of control sequence (CGC), and eventually the actual 5' end of the RNA (20 nucleotides) which can be mapped onto the reference sequence; **B)** `read_features` table obtained with the functions `init_read_features` and `add_read_feature` corresponding to the read structure in panel A; **C)** for FASTQ reads pre-processing and validation based on a read_features table describing the structure of sequencing reads; **D)** Distribution of read features validity described in panel A and represented by an upset plot generated after the parsing, validation and demultiplexing of a raw read FASTQ file. Each combination of valid features is displayed in the lower upset plot while the upper bar plot shows the corresponding number of reads; **E)** for aligning reads on reference sequences and compute abundances at the single nucleotide resolution to obtain **F)** a count table representing the abundances (*count*) of RNA 3' or 5' ends mapped to a reference sequence (*rname*) at a certain location (*position*) on a strand. In this example, the reads contain UMI extracted in the pre-processing step that are used at this step to remove PCR amplification duplicates introduced in the library preparation; **G)** `rnaends` provides functions for downstream analysis of the count table for TSS identification, analysis of RNA-ends species proportions (*e.g.* 5'P/5’PPP in WT *vs.* mutant strain), periodic signal analysis such as ribosome translation speed profiling, or the exploration of 3' end modifications; **H)** rnaends count table can also be analyzed further by other libraries: we illustrate this by a differential expression analysis to identify endoribonucleolytic cleavage sites by comparing abundances between WT and endoRNase deletion mutant strains. .. toctree:: :maxdepth: 3 :caption: Installation INSTALL.md perf/performances.laptop16GB.md .. toctree:: :maxdepth: 3 :caption: Vignettes vignettes/preprocessing.md vignettes/TSS_identification.md vignettes/Endoribonucleolytic_sites_prediction.md vignettes/Degradation_profiling.md vignettes/Post-transcriptional_modifications.md .. toctree:: :maxdepth: 1 :caption: Reference manual functions/reference_manual.md .. toctree:: :maxdepth: 1 :caption: About ABOUT.md