Centre for Bioinformatics and Intelligent Medicine-News
The Bioinformatics and Intelligent Medicine Team published one research paper on Nucleic Acids Research.
November 06,2024   Editor:Centre for Bioinformatics and Intelligent Medicine
Recently, the Bioinformatics & Intelligent Medicine team has published “Pairpot: a database with real-time lasso-based analysis tailored for paired single-cell and spatial transcriptomics” in the journal Nucleic Acids Research.
The rise of single-cell and spatial transcriptomics sequencing technologies has provided a brand-new perspective for exploring more cancer treatment targets. Single-cell technology can reveal the gene expression profiles of individual cells, while spatial transcriptomics technology captures the spatial distribution of gene expression. Paired single-cell and spatially resolved transcriptomics (SRT) data supplement each other, providing in-depth insights into biological processes and disease mechanisms. Previous SRT databases have limitations in curating sufficient single-cell and SRT pairs (SC–SP pairs) and providing real-time heuristic analysis, which hinder the effort to uncover potential biological insights.
Here, we developed Pairpot (http://pairpot.bioxai.cn), a database tailored for paired single-cell and SRT data with real-time heuristic analysis. Pairpot curates 99 high-quality pairs including 1,425,656 spots from 299 datasets, and creates the association networks. It constructs the curated pairs by integrating multiple slices and establishing potential associations between single-cell and SRT data. On this basis, Pairpot adopts semi-supervised learning that enables real-time heuristic analysis for SC–SP pairs where Lasso-View refines the user-selected SRT domains within milliseconds, Pair-View infers cell proportions of spots based on user-selected cell types in real-time and Layer-View displays SRT slices using a 3D hierarchical layout. Experiments demonstrated Pairpot’s efficiency in identifying heterogeneous domains and cell proportions.
Furthermore, the research team also developed a new spatial multi-modal data format (SMD) with extensible interfaces, which are compatible with multiple SRT platforms, greatly enhancing its versatility and practicality. To visually present the cell type-specific domains, the research team also developed the 3D Landscape, enabling researchers to gain a more intuitive understanding of the distribution and interactions of cell type-specific domains in space.
Pairpot is valuable to biologists not only because it offers easy-to-access paired datasets but also because its lasso-based modules provide a new perspective for developing efficient online computational platforms based on pre-analyzed information and user behavior. Pairpot is also useful for researchers to seek potential biological insights because it provides both overall observations for multiple slices in a 3D layout and precise real-time lass-based analysis for cells or domains of user’s interest.
Pairpot’s intuitive interface and heuristic analysis modules empower users to tailor their workflows according to their specific needs. For researchers who prefer to download the data and conduct analyses offline, Pairpot generates code for online heuristic analysis and alternative bioinformatics tools that allow users to replicate the analysis results offline. If researchers would like to analyze their own paired single-cell and SRT data, they can integrate the interfaces provided by Pairpot into their pipelines, including preprocessing, data integration, neighbor graph construction, and UCell scores evaluation. For researchers without programming capabilities, Pairpot provides valuable resources and powerful tools online to streamline the SRT integrative analysis and enhance the understanding of biological processes and disease mechanisms.