Clinical trials and high-throughput flow cytometry produce large amounts of data, leading to an analysis bottleneck unaddressed by commercially available cytometry analysis software. As the throughput, applications, and content of flow cytometry have increased exponentially over the past several years, cytometrists are beginning to embrace cloud-based data-management solutions. The newest version of Tree Star’s FlowJo visualization and analysis software for flow cytometry advances research in flow cytometry by providing robust tools for analysis automation and a bridge to repository/cloud-based data.
We recently analyzed a large dataset from a clinical trial of anti-IgE (omalizumab), which examined whether combination therapy with omalizumab and allergen immunotherapy for ragweed reduced seasonal allergic rhinitis compared with immunotherapy alone. The four-arm, randomized, double-blind, placebo-controlled study had nine time points, eight panels examining immune cell activation and one negative control panel—producing 10,991 flow cytometry standard (FCS) data files.
Manual organization, analysis, and report generation with these FCS files would produce a large bottleneck, slowing the progress of trial reporting and publication. However, with three key tools described in this article—group organization, templates, and command line—FlowJo provided an automation solution, addressing the bottleneck with a scalable solution that can:
Groups Organize Samples and Enable Batch Analysis
An intuitive user interface facilitates rapid setup of an analysis strategy, including statistics, hierarchical cellular subsetting through “gating” (boundaries which isolate events), and batch application to related samples using a patented groups structure. Data organization and management is a critical component in any data-analysis strategy, and groups organize related samples in a database-like structure, add metadata, apply analysis, and provide the underlying organization for batched reports.
Templates Execute Consistent Analysis for Repeated SOP
FlowJo has the ability to generate workspace templates, which saves sample organization, analysis strategy, and report output options without the flow cytometry data (FCS) files. Thus, when data is placed into a template, samples are organized into groups, analyses executed, reports generated, and analysis workspaces created and saved for review.
Command Line Reduces Time from Collection to Reports
The final piece of the automation puzzle comes through the ability to launch FlowJo without a user interface or human intervention. With a few lines of text, command line on Mac, PC, or LINUX is used to point FlowJo to the files to be analyzed, perform auto-compensation (signal processing), calculate gates and statistics, and produce output files such as batch spreadsheet and graph reports (Figure 1).
Case Study 2: HT Analysis Combining Manual Gating and Clustering
|Click Image To Enlarge +
|Figure 2. Gating strategy and clustering algorithms: Gating on singlet CD3+ T cells using a hierarchal gating strategy in FlowJo. This data is exported for clustering in GenePattern and R, and clustering results brought back into FlowJo for analysis and visualization.
By creating a preprocessing template, some simple and very common actions to the data can be applied. In this case T cells of interest were examined by setting up a hierarchy of gates (in order) on singlet cells, a simple clean-up gate to remove the cells on the axis, and a “magnetic” CD3+ gate.
This gating analysis was applied to all samples stained with the T-cell cytokine reagent panel (Figure 2), generating reports for QA that flagged outliers, and gated T-cell data (or further subsetting to analyze CD4+ and CD8+ T cells), and exported this pre-processed data for clustering analysis. Leveraging this template and command line, preprocessing analysis can be rapidly and consistently applied to all samples in this study.
Clustering Using GenePattern and R
GenePattern and the statistical program R offer freely available algorithms that perform clustering of flow cytometry data. In this case, we used two nonparametric clustering methods, SAMspectralclusterFCS in GenePattern and flowMeans in R, which both export .csv spreadsheet files representing the clustering results. This data can be drag-and-dropped or imported with a command line argument into FlowJo for association with the matching sample.
Currently, we are working to integrate these methods into FlowJo to enable those unfamiliar with the R environment to leverage the power of combining traditional gating analysis with automated clustering algorithms in a user-friendly environment.
Visualizing Clustering Results
Now, the clustering results can be analyzed and visualized like any other FlowJo-gated populations. When the cluster number file is dragged into FlowJo, a parameter for cluster number is created, and gates are made on each cluster number that separate the cells assigned to that cluster by SAMspectral (or any other algorithm) from all other cells.
|Click Image To Enlarge +
|Figure 3. FlowJo multigraph overlay of clustering results.
We can create a multigraph overlay of clusters meeting our threshold criteria (e.g., clusters that have more than 1% of the cells) as shown inFigure 3, allowing us to rapidly visualize cytokine-producing T cells.
Comparing Clustering and Manual Analysis
Finally, a new platform within FlowJo, FlowDx, can be used compare manual analysis (from experts and technicians) and clustering results from different algorithms. Using three statistical measures (match ratio, F-measure, and Davies-Bouldin), FlowDx evaluates consistency, accuracy, expert gating, and consensus analysis numerically to determine which classification methods agree and which do not. Critically, this allows for the assessment and comparison of manual and automated gating methods in high-throughput analysis.
The throughput and content of flow cytometry assays is a data-management inflection point driven by the exponential increase in the number of fluorophores conjugated to antibodies used for cytometric interrogation of biological samples, an increase in the number of parameters that cytometers can detect, and an explosive growth in applications.
Cytometry is in a nascent stage of cloud/LIMS adoption; FlowJo Version 10 was built from the ground-up for cloud integration and provides a bridge to analyzing cloud-hosted data and automating analyses.
Critically, data analysis, report generation, and the entire FlowJo workflow are no different from analyzing files and saving reports locally. Thus, FlowJo can integrate with a data-management solution and provide seamless integration from the current paradigm of local file storage to accessing cloud/LIMS-based flow data for analysis and report generation.