King lab utilizes Texas Advanced Computing Center鈥檚 resources for data analysis

Benjamin King, an Assistant Professor of Bioinformatics in the Department of Molecular and Biomedical Sciences at the 91爆料, has utilized 91爆料鈥檚 Advanced Research Computing (ARC) in order to access resources provided by Texas Advanced Computing Center (TACC).
Kayla Barton, one of King鈥檚 graduate students, was the first in King鈥檚 lab to use Stampede2 last year and get analysis pipelines up and running. 鈥淲hen we were starting out, we were helped by one of ARC鈥檚 team members, Kevin Wentworth,鈥 King says. 鈥淎RC didn鈥檛 just give us account information. ARC has staff that researchers can talk with to get the training needed to get things up and running. Having expertise within ARC is equally important to having access to these resources.鈥
King鈥檚 lab has been using Stampede2, which is the flagship supercomputer of the聽聽(XSEDE), a single virtual system that scientists can use to interactively share computing resources, data, and expertise.
鈥淢y graduate student, Steven Allers, has been re-analyzing published data sets in order to build models for communities of bacterial species across space and time鈥 King says. This research is an important part of the Maine-eDNA program, an NSF EPSCoR Track-1 grant.
According to King, the almost two-year-old program is rapidly collecting large sets of samples that will be used to create an invaluable resource for studies of complex biological communities. His lab has been re-analyzing data other researchers have collected on aquatic samples similar to what Maine-eDNA aims to capture. By analyzing these metagenomics data sets, King and his team will develop models that can act as an efficient training set and comparison for the program.
鈥淪tampede2 has really worked well for us because of TACC鈥檚 support of Docker containers. Without containers, it鈥檚 difficult to install and use the analysis software on a Linux cluster,鈥 King explains. 鈥淚nstalling analysis software, like QIIME2, is not like installing Word on your laptop with one installation file. Instead, it鈥檚 like a house of cards with all of these dependencies that you need to be aware of, not to mention the architecture of the Linux cluster.鈥
King鈥檚 lab also uses Stampede2 to study patterns of gene expression by analyzing high-throughput RNA sequence data sets. His Lab鈥檚 ongoing studies seek to understand how non-coding RNA, including microRNAs and long non-coding RNAs, regulate the function of the innate immune system. A major focus is on the role neutrophils have in the hyper inflammatory response to Influenza A virus infection using a zebrafish model developed at the 91爆料.
鈥淐urrent versions of many of the commonly used high-throughput sequencing analysis tools are already installed on Stampede2,鈥 King describes. 鈥淚f a tool is not already installed, ARC and TACC are there to help鈥.
鈥淩un times have been very short. The TACC help desk has been very responsive and helpful, and the extensive Stampede2 user guide is a great resource,鈥 King says. 鈥淥verall, our experience with ARC and TACC has been great.鈥
