RPubs

by RStudio

rmurdoch

Robert W Murdoch

Recently Published

mec_proteome_scatterplots

Scatterplot ggplot2 code for complex proteomes with metadata

almost 5 years ago

mec cassette relative abundances and synteny

This document details methods used to calculate relative abundances of mec cassettes and the custom R scripting used to construct mec cassette synteny maps.

about 5 years ago

mec cassette: phylogenetic trees

This document details the pipeline used to harvest protein homologs, cluster, align and build ML-approximated phylogenetic trees

about 5 years ago

mec cassette: transcriptomics

This document describes the pipeline used for detection of mec genes in contaminated site metatranscriptomes

about 5 years ago

geosmithia.assembly.and.gene.prediction

Overview of assembly, use of MAKER for gene prediction, and preparation for protein annotation

over 5 years ago

geosmithia.comp.resources

resources for geosmithia comp project

over 5 years ago

Metatranscriptomics of Understudied Dehalogenases

over 5 years ago

RAG charts

Simple method for making Red/Amber/Green charts with grouped assays

almost 6 years ago

G. morbida hierarchical annotations for GenBank

Development of a semi-automated, reproducible system for generating a GenBank/NCBI-compatible hierarchical Eukaryotic genome annotation by making a gff file

almost 6 years ago

LC.MinION.assembly.and.annotation

about 6 years ago

Saline Isolates 1908

Primarily a purity check of strain AR

over 6 years ago

G.morbida CodingQuarry Pathogen Mode

CodingQuarry Pathogen Mode was applied to the G. morbida reference genome to search for protein-coding genes which were missed initially. Resulting proteins are subjected to positive selection testing and functionally annotated

almost 7 years ago

Initial.GWAS.with.PLINK2

converting a multi-vcf file into PLINK2 binary format, conducting -glm (linear regression?) for variants against a quantitative phenotype, and characterizing variant positions in regard to proximity to coding regions

almost 7 years ago

UTK BRC Qiime2 Intro Cookbook

almost 7 years ago

G.morbida COG and KEGG PSgene summaries

The goal of this script is to present KEGG and COG functional category analyses for the entire Geosmithia genome vs the positively selcted (PS) genes.

almost 7 years ago

Clean COG ID to COG function mapping file

The goal of this script is to make a clean COG ID to mapping table that can be used for generating your own COG category summary tables and figures.

almost 7 years ago

Arhodomonas Mauve whole genome alignment visualization

Simple script for turning the Mauve backbone file, resulting from an iterative contig reordering, into a more graceful, customizable vector graphic

almost 7 years ago

Affymetrix: Nested Interactions Decisions

There is no standard procedure regarding how to filter DE gene lists. This document describes and executes three standard methods and examines the consequences to interpretation

almost 7 years ago

Affymetrix v2.0: all analyses

This second analysis structuring of the bladder Affymetrix set aims towards characterizing treatment effects, strain effects, and then ultimately functions that are differentially expressed in treated GFR versus treated GF

almost 7 years ago

GFR Affymetrix analysis

This script performs basic overviews of the GFR Affymetrix data by running MDS ordinations and plotting probe intensity.

almost 7 years ago

Affymetrix: tGFR vs. tGF

Examining the differential expression in treated mutant strain vs. treated wild-type

almost 7 years ago

Affymetrix: tGF vs. cGF

Testing the effect of treatment on the control strain

almost 7 years ago

Affymetrix: tGFR vs. cGFR

Pairwise comparison 2, examining the effect of treatment on the mutant strain

almost 7 years ago

Affymetrix: cGFR vs. cGF

one of four pairwise test scripts for differential expression based on Affymetrix chip results processed primarily via the "limma" package

almost 7 years ago

2.Arhodomonas.oxygenases.and.hydroxylases

A survey of oxyenases and hydroxylases in three Arhodomonas genomes

almost 7 years ago

1.Arhodomonas KEGG modules

Characterizing and comparing KEGG modules in three Arhodomonas strains

almost 7 years ago

KEGG search parsing

A simple script for parsing results of a KEGG orthology name search

almost 7 years ago

qPCR validation

Some general metrics for comparing two qPCR standard curve runs

almost 7 years ago

10.gene.diversity.summary

The goal of this pipeline is to take a variety of calculations performed on genes across the 22 strains resequenced and place them into a single database. To this point, gene diversity across the population has been characterized/analyzed by several metrics 1. Gene and protein allele counting 2. Rough protein vs. gene tree ratio calculation 1, dN/dS (w, omega, Ka/Ks, etc.) is being calculated across every gene using the “one-ratio model”, i.e., M0 2. Likelihood ratio test (LRT) for positive selection for each gene; compares the likelihood of a neutral evolution model (M7) with a positive selection model (M8): In most cases, the neutral model will be about as likely as the positive selection model (LR ~ 0). Where positive selection better explains the alignement pattern of a given gene, M8 explains the data better than M7. The LRT applies appropriate statistical testing and provides a p-value based on this difference in likelihoods.

about 7 years ago

Sign In

rmurdoch

Robert W Murdoch

Recently Published