US-based DataXight Launches protoXell, Extracting Mechanistic Insights from Large-Scale Perturbation Data to Accelerate Target Discovery and Drug Repositioning
2026-05-21 17:10
Favorite

en.Wedoany.com Reported - DataXight, a biomedical data science company based in Mountain View, California, officially launched its new scientific software, protoXell, on May 19 at the 2026 Bio-IT World Conference in Boston. The product is designed to help researchers bypass complex bioinformatics pipelines and obtain interpretable mechanistic insights directly from chemical and genetic perturbation data. A free trial channel has been opened simultaneously, and a cloud-native version is planned for launch on the DNAnexus platform in June 2026.

Current single-cell perturbation technologies can already generate ultra-large-scale datasets spanning hundreds of millions of cells and covering thousands of perturbations. However, translating this data into actionable biological insights remains a critical bottleneck—existing solutions rely on complex pipelines, multi-tool integration, and highly specialized expertise, significantly slowing the pace of exploration. protoXell's product form directly addresses this gap: it provides an integrated, code-free environment where researchers can compare the effects of different perturbation experiments, dissect pathway-level responses within a point-and-click interface, and generate target discovery and drug repositioning hypotheses ready for wet-lab validation within seconds.

protoXell features a built-in AI interpretation engine called inXighter. Upon completing each analysis, this engine automatically maps molecular signals to known pathways and scientific literature, outputting mechanistic-level biological interpretations in text form. This capability compresses the interpretation work, which previously required bioinformaticians to manually consult literature and pathway databases, into instant, automated output. DataXight has long provided data engineering and bioinformatics services to biomedical institutions, and the knowledge graph that inXighter relies on is built upon this accumulated engineering experience.

Another structural capability of the product comes from its pre-loaded curated perturbation data catalog. This catalog covers over 150 million cells, spanning multiple cell lines, tissue types, compounds, and CRISPRi perturbations. Researchers can directly load analysis-ready datasets for cross-experimental comparisons without needing to build pipelines from raw data. For pharmaceutical companies with proprietary perturbation data, protoXell also offers enterprise-level deployment options, supporting the integration of private data with public reference datasets in cloud or on-premise environments.

In drug repositioning scenarios, protoXell's capabilities have received preliminary validation. A case disclosed by DataXight showed that the system, during a cross-compound perturbation comparison, identified a shared transcriptional response signal between the HIV protease inhibitor saquinavir and beta-adrenergic receptor agonists, revealing that these two pharmacologically distinct classes of compounds trigger partially overlapping biological responses at the cellular level. Such associations are typically difficult to discover using traditional single-target analysis methods and hold direct reference value for mechanistic research and finding new uses for existing drugs.

DataXight CEO Tuan Nguyen stated in the launch announcement that the team has spent years helping scientists understand the complexity of biological data. The goal of protoXell is not just to simplify analysis, but to enable researchers to accomplish analytical tasks that were previously out of reach—moving from perturbation data to mechanistic understanding, and ultimately translating that into better therapeutic decisions. Nguyen previously served as Vice President and General Manager at DNAnexus, leading the construction of multiple large-scale biomedical data platforms such as precisionFDA and St. Jude Cloud. DataXight's leadership team is similarly composed of seasoned professionals in bioinformatics and data engineering from companies like DNAnexus, QIAGEN, and BioTuring.

Single-cell perturbation data is entering a phase of explosive accumulation. As the data scale from combination drug screening, gene editing perturbations, and complex disease models continues to grow, how to transform molecular snapshots of hundreds of millions of cells into verifiable biological hypotheses has become a core efficiency bottleneck in the drug discovery process. The release of protoXell addresses the long-standing engineering gap between "processing data" and "understanding mechanisms." With the launch of the DNAnexus cloud version in June, the accessibility of this tool will expand from existing DataXight customers to the broader biopharmaceutical research community.

This article is compiled by Wedoany. All AI citations must indicate the source as "Wedoany". If there is any infringement or other issues, please notify us promptly, and we will modify or delete it accordingly. Email: news@wedoany.com