JOURNAL ARTICLE

End-to-end deep learning pipeline for real-time Bragg peak segmentation: from training to large-scale deployment

Abstract

X-ray crystallography reconstruction, which transforms discrete X-ray diffraction patterns into three-dimensional molecular structures, relies critically on accurate Bragg peak finding for structure determination. As X-ray free electron laser (XFEL) facilities advance toward MHz data rates (1 million images per second), traditional peak finding algorithms that require manual parameter tuning or exhaustive grid searches across multiple experiments become increasingly impractical. While deep learning approaches offer promising solutions, their deployment in high-throughput environments presents significant challenges in automated dataset labeling, model scalability, edge deployment efficiency, and distributed inference capabilities. We present an end-to-end deep learning pipeline with three key components: (1) a data engine that combines traditional algorithms with our peak matching algorithm to generate high-quality training data at scale, (2) a modular architecture that scales from a few million to hundreds of million parameters, enabling us to train large expert-level models offline while deploying smaller, distilled models at the edge, and (3) a decoupled producer-consumer architecture that separates specialized data source layer from model inference, enabling flexible deployment across diverse computing environments. Using this integrated approach, our pipeline achieves accuracy comparable to traditional methods tuned by human experts while eliminating the need for experiment-specific parameter tuning. Although current throughput requires optimization for MHz facilities, our system's scalable architecture and demonstrated model compression capabilities provide a foundation for future high-throughput XFEL deployments.

Keywords:
Software deployment Pipeline (software) Training (meteorology) End-to-end principle Deep learning Computer science Scale (ratio) Segmentation Artificial intelligence Real-time computing Cartography Geography Software engineering Operating system

Metrics

4
Cited By
6.34
FWCI (Field Weighted Citation Impact)
37
Refs
0.89
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Machine Learning in Materials Science
Physical Sciences →  Materials Science →  Materials Chemistry
Advanced X-ray Imaging Techniques
Physical Sciences →  Physics and Astronomy →  Radiation
Enzyme Structure and Function
Physical Sciences →  Materials Science →  Materials Chemistry
© 2026 ScienceGate Book Chapters — All rights reserved.