Mass spectrometry-based proteomics focuses on identifying the peptide that generates a tandem mass spectrum. Traditional methods rely on protein databases but are often limited or inapplicable in certain contexts. De novo peptide sequencing, which assigns peptide sequences to spectra without prior information, is valuable for diverse biological applications; however, owing to a lack of accuracy, it remains challenging to apply. Here we introduce InstaNovo, a transformer model that translates fragment ion peaks into peptide sequences. We demonstrate that InstaNovo outperforms state-of-the-art methods and showcase its utility in several applications. We also introduce InstaNovo+, a diffusion model that improves performance through iterative refinement of predicted sequences. Using these models, we achieve improved therapeutic sequencing coverage, discover novel peptides and detect unreported organisms in diverse datasets, thereby expanding the scope and detection rate of proteomics searches. Our models unlock opportunities across domains such as direct protein sequencing, immunopeptidomics and exploration of the dark proteome.
InstaNovo enables diffusion-powered de novo peptide sequencing in large-scale proteomics experiments.
阅读:2
作者:Eloff Kevin, Kalogeropoulos Konstantinos, Mabona Amandla, Morell Oliver, Catzel Rachel, Rivera-de-Torre Esperanza, Jespersen Jakob Berg, Williams Wesley, van Beljouw Sam P B, Skwark Marcin J, Laustsen Andreas Hougaard, Brouns Stan J J, Ljungars Anne, Schoof Erwin M, Van Goey Jeroen, Keller Ulrich Auf dem, Beguir Karim, Carranza Nicolas Lopez, Jenkins Timothy P
| 期刊: | Nature Machine Intelligence | 影响因子: | 23.900 |
| 时间: | 2025 | 起止号: | 2025 Mar 31; 7(4):565-579 |
| doi: | 10.1038/s42256-025-01019-5 | ||
特别声明
1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。
2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。
3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。
4、投稿及合作请联系:info@biocloudy.com。
