Deliverables:
Work products and documents related to coordinating and analyzing large genomic databases in epidemiological studies; - Ad-Hoc
Work products and documents related to providing support and expertise in the Unix operating system, databases (including Oracle) and programming languages; provide support and expertise with the public bioinformatics and genomic databases and tools. - Ad-Hoc
Work products and documents related to developing robust pipelines to integrate publicly available data into the downstream analysis of NHGRI/DIR-generated research data; develop computational pipelines for processing and analysis of a variety of next- generation sequencing data. - Ad-Hoc
Work products and documents related to implementing Web-based portals for the dissemination and visualization of research data, including the application of visualization programs intended to facilitate the display of large-scale sequencing data. - Ad-Hoc
Work Details:
Support and analyze large-scale genetic and genomic data from epidemiological studies. This includes but are not limited to data manipulation, algorithmic implementation, statistical programming, and integrated genomic analyses 1
Develop computational pipelines for processing and analysis of a variety of next-generation sequencing data. 2
Develop robust pipelines to integrate publicly available data into the downstream analysis of NCI-generated data. 3
Provide support and expertise in the Unix operating system and programming languages (including Python, R) 4
Install, troubleshoot and run open-source and commercial scientific software on platforms 5
Provides programming and troubleshooting support to the Federal Government in the dissemination of research data.
Generate and optimize programs and scripts for the analysis of data; create programs and algorithms and develop computational infrastructure resources for organizing and parsing data from large and complex data.
Serve as bioinformatics expert and coordinate with teams of biologists to conduct experimental queries and/or perform portions of studies using complex procedures and techniques common to modern bioinformatics.
Coordinate building bioinformatics infrastructure to ensure easy and meaningful scientific analysis and interpretation of data.
Provide broad-based programming and analytic support for a wide variety of bioinformatic and research projects.
Performs computations on research data analysis.
Perform computational analysis of, and interpret results.
Provide reports based on analysis of scientific data.
Perform sequencing and alignment of raw data, and interpret new data using larger public access datasets.
Provide interpretive analyses of data derived from different experimental platforms to generate biological meaning.
Write custom programs and algorithms to support data analyses and discovery.
Works with staff on scientific programming and experimental design.
Collaborate with scientists to design, analyze, manage and interpret all types of data.
Work with staff on planning of experiments, and data analysis for internal and collaborative projects; use bioinformatics expertise to advise and help bench scientists on experimental design and trouble-shooting.
Work with staff to develop specifications for new analysis; design, test and implement solutions.
Make recommendations to investigators about the correct computational tools for testing scientific hypotheses and reaching valid conclusions.
Records observations and report results at weekly laboratory meetings.
Maintain proper and detailed documentation of the analysis performed and report results at lab meetings.
Attend scientific and programming meetings; take and compile comprehensive notes; organize and edit content of meeting reports.
Prepare scientific reports and progress reports; assemble data to prepare tables, graphs and slides; conduct scientific and program related information searches and report results.
Contribute to manuscript development, including manuscript drafting and critical review of manuscript content
Provides statistical support / analysis on research data.
Devise Client methods of statistical analysis for collected data.
Utilize and adapt existing bioinformatics techniques to check for trends and patterns in the data.
Perform data processing and data analysis with existing computational and statistical methods.
Assist in evaluating and interpreting results for validity and scientific meaning.
Provide support and expertise in the Unix operating system and programming languages (including Python, R)
Provides research / service goals in the context of the laboratory's overall mission.
Create Client programs and algorithms that facilitate discovery of knowledge in investigating large and complex data.
Develop and optimize programs and scripts that facilitate organization, integration and data-mining of large data sets; integrate these models into a framework of best practices.
Work with staff on the development and maintenance of bioinformatics tools, scripts and pipe-lines for data.
Participate in research design with investigators for determining best practices pertaining to the bioinformatics analysis in new and ongoing projects.
Evaluates new types of experimental approaches to protocols based on knowledge of scientific literature, available facilities and research needs.
Research and review literature to retrieve targeted clinical or scientific information, including Client statistical methods, from publicly available resources.
Collaborate with staff to review current and historical procedures for the acquisition, quality control and management of data.
Analyze and evaluate data cleaning and harmonization needs in the using a variety of descriptive statistics and analytic methods.
Identify new tools and resources for reaching biologically meaningful conclusions.
Collaborate with experimentalists and computational biologists to develop new computational tools to answer research questions of interest.
Independently coordinates the training of personnel in the use of scientific software applications, statistical software applications and programmatic software applications.
Provide training in and technical support (including product updates and version control) for programs, algorithms, archives, and pipelines generated during the course of this work.
Instruct staff in computational analysis of data.
Provide ad hoc trainings and hands-on workshops on the use of bioinformatics tools.
Provide training of students, new investigators, and other laboratory personnel in the use of techniques, procedures and equipment to complete the objectives of the laboratory.
Install, troubleshoot, and run open-source and commercial scientific software and bioinformatics techniques used in data analysis
Initiates interdisciplinary collaborations with other research centers.
Work with an interdisciplinary team to apply computational data analysis approaches to make biological discoveries.
Collaborate with group members in experiments associated with data collection.
Work with staff, collaborate with outside researchers, and contribute to positive overall teamwork; teach
Bioinformatics principles and methodologies.
Collaborate with biologists, statisticians and/or other bioinformaticians in the design of models summarizing/explaining experimental data.
Deliver at least one presentation per year to audiences outside the Government.
Attend group meetings; present findings; author publications resulting from projects.
Present analysis results at research conferences and meetings.
Present new research data in group settings, at meetings or seminars.
Implement Web-based portals for the dissemination and visualization of research data, including the application of visualization programs intended to facilitate the display of large-scale sequencing data within platforms such as IGV, JBrowse, the Exome Aggregation Consortium (ExAC) browser and gnomAD browser.
Develop computational pipelines for processing and analysis of a variety of next generation sequencing data.
Provide training to scientific staff on bioinformatics techniques used in data analysis.
Coordinate and analyze large genomic databases in epidemiological studies. This includes but are not limited to data manipulation, algorithmic implementation, statistical programming, and integrated genomic analyses.
Provide support and expertise in the Unix operating system, databases (including Oracle) and programming languages (including Perl, Python, BioPerl, Java, XML, SQL, JavaScript, JavaServer Page, C/C++, servlets and R), object-oriented programming and design, visualization and dissemination of genetic and genomic data.
Install, troubleshoot and run open-source and commercial scientific software on the Unix and Linux platforms.
Provide support and expertise with the public bioinformatics and genomic databases and tools (such as GATK, SAMtools, BCFtools, PLINK, cBioPortal, gnomAD, GTEx, GDC).
Develop robust pipelines to integrate publicly available data into the downstream analysis of NHGRI/DIR-generated research data.
1, 2, 3, 4, 5 represents priority rankings, where 1 is highest priority and 5 is lowest priority of those ranked
Minimum Education
Masters
Certifications & Licenses
Master's degree in a related discipline - PhD
Field of Study
Miscellaneous Biology
Software
MS Office Suite
Python
MATLAB
Skills
Strong verbal and written communication skills.
Experience with data mining and visualization for large-scale next-generation sequencing data (such as WGS data)
Excellent analytical, organizational and time management skills.