Oral Presentation 45th Lorne Genome Conference 2024

Identifying DNA mutations directly from RNA-seq datasets (#26)

Li Yang 1 , Zhi-Can Fu 2 , Bao-Qing Gao 2 , Fang Nan 1 , Xu-Kai Ma 1
  1. Fudan University, 上海市, 中国, China
  2. Institutes of Biomedical Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China

We develop a stepwise computational framework, called DEMINING, to directly detect expressed DNA and RNA mutations in RNA deep sequencing data. DEMINING incorporates a deep learning model named DeepDDR, which facilitates the separation of expressed DNA mutations from RNA mutations after RNA-seq read mapping and pileup. When applied in RNA-seq of acute myeloid leukemia patients, DEMINING uncovered previously-underappreciated DNA and RNA mutations, some associated with the upregulated expression of host genes or the production of neoantigens. Finally, we demonstrate that DEMINING could precisely classify DNA and RNA mutations in RNA-seq data from non-primate species through the utilization of transfer learning.