Closing the Context Gap with Seqera: Benchling Integration, Data Lineage, GxP-Enablement, and More!
Context Makes Scientific Analysis Relevant
Knowing where a sample came from, how it was analyzed, and who was involved along the way is what turns raw output into something teams can act on and valuable historical context. If the journey from sample to insight isn't traceable end-to-end, the result loses its value.
This is exactly the challenge Nextflow was built to address: making scientific analysis accessible and reproducible. We've extended this philosophy across the entire Seqera ecosystem, preserving the full chain of context from lab to results. At Nextflow Summit Boston 2026, we unveiled several product updates that put this into practice. These updates span two key themes: end-to-end data context and enterprise compliance. See here for updates on Intelligent Compute and Co-Scientist.
End-to-End Data Context
Science doesn't happen in isolated steps. It flows from sample collection to analysis to insight. We’ve invested deeply to keep the thread intact across that entire journey, so context is not only preserved, but consolidated, enabling tools, teams, and organizations.
Benchling Integration via MCP
Seqera’s Integration with Benchling
Bioinformatics is a critical piece of the R&D knowledge graph, but it has historically lived in a silo, disconnected from the lab data that gives it meaning. Through Benchling’s AI Connectors, bench scientists can now integrate Seqera into Benchling via Seqera MCP. Users can launch and track Nextflow pipelines, investigate results, and spin up interactive analysis environments, all without leaving Benchling. Importantly, pipeline results connect back to the samples and people that generated them. No silos, just data at your fingertips.
“Benchling and Seqera are partnering to deliver shared AI toolkits that help life sciences teams build, scale, and accelerate research and development.”
Lauren DeVos, Head of Corporate Development,
Benchling

Data Lineage
Read the documentation
Once a pipeline has run, the next question is: what exactly happened? Every file has a story: which pipeline produced it, which parameters, which inputs, and what happened to it downstream. Available since Nextflow 25.04, data lineage tracks this full chain, assigning every run, task, and output file a unique lineage ID with a complete provenance JSON record.
This lineage is now surfaced directly in Seqera, attached to both pipeline runs and objects in Data Explorer, so provenance lives where teams are already working. Teams can trace every file's provenance end-to-end, bringing provenance investigations out of the Nextflow CLI and into the centralized interface teams already use. This enables faster debugging, easier audits, and broader team access to run history. For customers in regulated industries like pharma and clinical genomics, it directly supports compliance and data integrity requirements.
The run details page now includes full lineage data for workflows, tasks, and output files, so you can see the complete picture without leaving the run view. Every lineage ID and lineage label is a navigable link within your organization, so tracing the path between related runs, tasks, and files is as simple as clicking through. In Data Explorer, lineage metadata appears directly on objects and is fully searchable — find what you need without hunting through logs.
Data Explorer: IGV and Molstar frameworks
Read the blog post
When an analysis wraps up, it is often critical to spot-check results. What follows is downloading large files, launching separate tools, configuring paths, and potentially racking up cloud egress costs. This is unsustainable as pipelines, projects, and teams scale.
To address this, Data Explorer now embeds IGV and Mol* directly in the browser, so teams can visualize genomic and structural data in-place — BAMs, CRAMs, VCFs, PDBs, and more — without downloads, local setup, or cloud egress costs. Teams can go from pipeline output to visual quality control in seconds, without leaving the platform. No data pre-processing is required, it works natively with cloud object storage, and it reduces the need for external tools — making Seqera a more end-to-end environment for genomics and structural biology.
Studios SSH
SSH into Your Studio Session
Data Explorer handles quick inspection, but when an analysis needs deeper work (e.g. custom script, annotation or writing an update to a result access)we want a rich, interactive environment that feels local. When your preferred tools, extensions, and configurations live on your laptops, while the data and compute live in the cloud, it forces a choice between moving data or moving tools and infrastructure. Now you don’t have to.
Studios, the interactive analysis workbench of Seqera, now supports direct SSH access to running interactive sessions. Connect using any standard SSH client, or use VS Code's Remote-SSH extension to work inside a cloud-based Studio as if it were your local machine. Your code runs in the cloud, your data stays in the cloud, but your experience stays the same. SSH access also enables CI/CD integration, secure terminal access with end-to-end encryption, and reproducible environments that new team members can onboard instantly — all while IT maintains a single, standardized infrastructure underneath.
“Studios allows us to easily connect to our data sources and conduct our interactive analyses. The team at Seqera have been wonderful partners in helping us set up the system and onboard our bioinformaticians. We look forward to continuing to work with Seqera in the future.”
Andrew Foster, Bioinformatician
Mirador Therapeutics

Enterprise Compliance
The same functionality that makes analysis reproducible and traceable also makes your compliance headaches melt away. As bioinformatics moves deeper into clinical and manufacturing workflows, we need to meet the standards those environments demand, which is why we’re excited to announce new compliance controls, identity management, and authentication capabilities that keep teams audit-ready without slowing them down.
Full GxP Offering
GxP-Enabled Bioinformatics with Seqera
Pharma and biotech teams running regulated bioinformatics workflows face a persistent gap between what generic platforms provide and what validated environments require. Audit trails lack the before-and-after detail regulators expect under 21 CFR Part 11, validation is slow and largely DIY, and teams struggle to enforce consistent methods across sites while keeping internal IT and quality aligned on a path to production.
Today, we are announcing our full GxP offerings, enabling organizations to run validated bioinformatics pipelines in regulated environments. Our approach combines enterprise architecture with proven bioinformatics expertise to deliver secure, traceable, and scalable analysis aligned with FDA and EMA expectations. Teams can maintain velocity while meeting compliance requirements with full traceability from research through clinical and manufacturing environments.
- →GxP-Ready Design: Purpose-built to support cGLP, cGCP, and cGMP environments with GAMP5-aligned documentation, comprehensive controls, and validation-ready architecture.
- →Accelerated Validation: Pre-built validation templates, test protocols, generic test evidence, and risk-based planning aligned with GAMP5 2nd Edition; with flexibility to align with your preferred validation frameworks and partners.
- →Your Infrastructure: Deploy workflows directly in your environment without moving data. Self-hosted architecture integrates seamlessly with existing IAM, storage, and networking while capturing complete data and compute provenance.
- →Full Traceability & Reproducibility: Complete workflow provenance captures inputs, steps, environments, and outputs. Every analysis is reproducible and audit-ready with full run history from research through clinical and manufacturing environments.
- →Audit-Ready Operations: Human-readable audit logs with comprehensive, 21 CFR § 11- and Annex 11-aligned, and inspection-ready change tracking.
"Seqera Platform now serves as a validated orchestration layer for regulated bioinformatics pipeline execution that is a part of the manufacturing process, supporting ongoing clinical trials. The deployment is designed to scale seamlessly from clinical development through commercial manufacturing, positioning it for use in GMP operations post-BLA approval."
Albert Park,
Head of Digital for Research and Tech Dev

Identity Provider (IdP)
Enterprise teams manage users and groups centrally in identity providers like Okta or Entra ID, but role assignment requires manual replication of that hierarchy. Now, IdP group memberships map directly to Seqera Teams, with memberships updating automatically at login via SSO. Once Teams are configured with the appropriate workspaces and roles, your IdP becomes the single source of truth for access control: no manual re-sync, no role drift, and fully traceable provisioning for regulated environments.
Cloud Authentication (SSO)
Seqera Cloud Pro customers in regulated industries need SSO to meet security policies but have had no way to connect their identity provider. Now, org admins can easily configure OIDC connections directly from Organization settings. Users authenticate via corporate SSO, eliminating the need for separate passwords and credential sprawl. This gives teams the authentication and authorization controls needed to meet internal security policies and external audit requirements.
What's Next
Every update we've shared here serves the same goal: preserving context across the entire journey from sample to insight. As bioinformatics becomes more central to drug development, the platforms that support it need to keep pace, not just in capability, but in traceability, reproducibility, and compliance. That's what we're building toward, and we're excited for you to put these updates to work
