"Structural/Functional Analysis of Proteins" 

November 20th, 2003 2-7pm
South San Francisco Conference Center

click here to view posters

 

 
Title Abstract

I.  Trends in Proteomics:

 "Accommodating Differential Display Data in Systems Biology"

Luke Schneider, Target Discovery, luke_Schneider@targetdiscovery.com

 

Unlike the measurements conducted in other scientific disciplines, those common to biology are often ratiometric (differential display) rather than absolute methods.  This is particularly true of mass spectrometric methods (e.g, ICAT, IDBEST), but is also true of 1-D and 2-D gels.  The result is the proliferation of ratiometric data for protein, DNA and metabolite expression, rather than absolute concentrations and rates.  Statistical data mining methods are often invalid in biology because the number of features measured typically far exceeds the number of samples, restricting researchers to model-based analyses.  However, it is very difficult to apply traditional mathematical modeling strategies to such differential display data.  We introduce a new mathematical paradigm for handling the differential display data type directly in systems biology modeling.

"A New Trend in Proteomics – Getting Useful Information"

Kelvin Lee, Cornell University, KHL9@cornell.edu

 

 

This presentation will cover some of the state of the art methods that are used for proteomics with an emphasis on approaches that are accessible to many laboratories. Although there is a wealth of data being collected on
various problems, a key issue is the use of this data to understand the biological system and ultimately to control that system. We will cover one way in which proteomic information can be used by integrating this data with DNA sequence information and mRNA expression profiles into a deeper understanding of protein synthesis and discuss the successful application of proteomics to achieving enhanced protein secretion in Escherichia coli.

II.  Bioinformatics

"Discovering Known and Unknown Protein Modifications Using MS/MS Database Searching"

Wilfred Tang, Applied Biosystems, tangwh@appliedbiosystems.com

 

 


                      

We present an MS/MS database search algorithm with the following novel features: (1) zone modification searching, which enables the discovery of protein modifications of known (i.e., user-specified) and UNKNOWN delta masses, and (2) pre-indexing of protein databases both by the masses of the peptides obtained from digestion of the proteins using trypsin (or some other digest agent) as well as by the theoretically expected MS/MS fragment masses of each of the peptides.  In zone modification searching, the mass range covered by each MS/MS spectrum is divided into six equally-sized zones.  For each of the six zones, it is hypothesized that a modification occurs in that zone, and the search against the protein database isperformed accordingly.  Pre-indexing of the protein database results in significant efficiency gains and is particularly advantageous in a large zone modification search.  All of these features are implemented in Interrogator, the search engine which run behinds Pro ID and Pro ICAT. The ability to search for a large variety of known and unknown modifications allows a significantly greater percentage of MS/MS scans to be identified.  Moreover, we provide several examples where the ability to
search for UNKNOWN modifications allows the scientist to discover: (1) unexpected modifications which have biological meaning, (2) non-tryptic peptides in a sample which has nominally been digested using trypsin, and (3) other unintended consequences of sample handling procedures.

III.  Sample Preparation

"Immunoaffinity Depletion Of High-Abundant Proteins From Human Serum For Proteomic Sample Preparation"

Chris Miller, Agilent Technologies, Christine_miller@agilent.com

  

 

 

 

Human serum, the most complex sample of the human proteome, is valuable in the discovery of new biomarkers.  Difficulties in the analysis of serum arise from the fact that proteins range in concentration over 12 orders of magnitude. A few well-characterized high-abundant proteins represent up to 90% of the total protein mass and mask identification and characterization of the lower abundant proteins by limiting the dynamic range of mass spectral and gel electrophoretic analysis.

We have developed a single-device multiple affinity removal system for simultaneously selecting and removing six high-abundant proteins from human serum. This uses an immunoaffinity-based column and optimized buffers for efficient reduction of non-specifically bound proteins and effective column regeneration. To compare the performance of the immunoaffinity column to dye-affinity chromatography (Cibacron Blue), bound fractions were resolved electrophoretically, then protein bands were analyzed by LC/MS/MS. We found that the immunoaffinity system exhibits superior specificity of depletion in comparison to Cibacron Blue.

Optimized use conditions permit the column to be used multiple times and column reproducibility was assessed by analysis of the flow-through fractions from more than 200 consecutive runs. The column provides consistent, robust depletion of high-abundant proteins as determined by 1D gel patterns, MALDI-TOF and LC/MS analysis. ELISA results indicate that the multiple affinity removal column depletes 98-99% of targeted high-abundant proteins in a single pass through the column.

The column expands the dynamic range of current LC/MS, 1DGE and 2DGE analytical methods, enabling increased protein identification in human serum.

"THE PERFECT 2D GEL – Fact or Fiction?"

Barbara Cottrell, UCSD, bcottrell@ucsd.edu

 

 Meticulous sample preparation is the key to clean, reproducible comparative 2D gels and there is no universal protocol. Protein extraction procedures will depend on the source of the protein: cell culture, tissue, organelles or intact organisms (C. elegans). The elimination of salts is critical and precipitation or dialysis may be required, especially if protein concentrations are low. While there are many kits offering a variety of extraction procedures and detergents, finding the correct combination for a given system is a matter of trial and error. 2D gel patterns can be detergent dependant. Immunoprecipitation with Sepharose requires careful controls to account for non-specific binding and the use of antibodies bound to magnetic beads can be beneficial. Additional concerns are the relevant abundance of the protein of interest and pre-fractionation of organelles may be necessary to enrich for a specific group of proteins. Sequential extraction can be helpful in reducing the complexity of a mixture but there may be carryover in subsequent steps, an important issue when quantifying proteins. Serum proteins can affect the protein profiles in cell culture extracts as well as tissue. The reduction and alkylation of protein disulfides prior to isoelectric focusing can prevent vertical streaking Human keratin may be a problem and can be eliminated with a few simple precautions. While the number of variables may seem over-whelming, a perfect 2D gel will be the reward for careful planning and attention to detail.

IV. Biological Applications

"Host-Pathogen Interactions: Proteomic and Genomic Analysis of Host Response"

Sandra Maloney, Lawrence Livermore National Labs, smaloney@llnl.gov

 

 

Yersinia pestis, the causative agent of plague, is a gram negative, highly communicable bacterium, known both as a natural pathogen and as a biothreat agent. Y. pestis functions via the activation of the Type III secretion mechanism that enables the bacterium to adhere to and deliver virulence factors inside the host cell. This secretion mechanism is common to several human pathogens including Y. psuedotuberculosis and Y. enterocolitica, near neighbors of Y. pestis with very similar genomic content yet dramatically different clinical manifestations – Y. pestis has a high mortality rate while Y. psuedotuberculosis and Y. enterocolitica cause intestinal distress. We have applied genomic and proteomic technologies to characterize host response to these three closely related pathogens as a means to define presymptomatic biomarkers for plague. Specifically we have used targeted gene arrays and 2D DIGE to characterize the genomic immune response and the global proteomic response. Total RNA and protein fractions were isolated from macrophages after pathogen exposures. Our results reveal shared host expression changes indicative of a common immune response; however, distinct pathogen-specific expression differences were also observed. Of 268 immune response genes analyzed, 180 were differentially expressed after pathogen exposure and 18 were specific for Y. pestis exposure. By 2D DIGE, we detected 1000’s of proteins of which ~250 were differentially expressed in response to the pathogen exposures and ~25 were specific for Y. pestis. These pathogen-specific genomic and proteomic changes clearly distinguish the three different pathogen exposures and represent novel biomarkers for plague. This combined proteomic and genomic approach promises to identify diverse biomarkers for a variety of diseases.

This work was performed under the auspices of the U.S. Department of Energy by University of California Lawrence Livermore National Laboratory under contract No. W-7405-ENG-48 with support from the Department of Homeland Security (Biological Warning and Incident Characterization Program).

 

"Gene and Protein Expression Profiling Reveal Unique Cellular Targets and Signaling Pathway of Tumor Suppressor Gene FHIT in Lung Cancer"

 

Jing Zhu, Ciphergen Biosystems, jzhu@ciphergen.com

 

 

Tumor suppressor gene (TSG) plays an important role in human cancer development. Its activity modulates a biological network involving many genes and signaling pathways that control various cellular processes. High-throughput systems that can provide a global view of cellular function are necessary to gain a comprehensive understanding of such a complex biological system. In this study, we used a complementary approach with DNA microarray (Affymetrix GeneChips) and ProteinChip (Ciphergen) technologies to quantitatively monitor the FHIT TSG-induced changes at both gene and protein levels in non-small cell lung carcinoma (NSCLC) cells. We performed gene expression analysis on Ad-FHIT-transduced cells, compared with the PBS-treated mock and empty vector or Ad-LacZ-treated negative controls. The data were analyzed by a perfect-match (PM) model-based analysis. About 200 genes differentially and specifically expressed in Ad-FHIT-transduced cells were identified and these genomic data were further compared and validated with the public databases. A protein expression analysis using ProteinChip technology was also simultaneously performed using the same samples. About 40 cellular targets that are either differentially up-or down-regulated by FHIT were identified by a serial analysis of fractionation, profiling, and identification. More than 70% of these differentially expressed proteins identified are also found their corresponding gene targets pulled out on DNA microarray. A comparative analysis of the gene and protein expression profiling revealed several unique cellular targets and signaling pathways involved in FHIT tumor suppressing activity. Some of these targets were further confirmed by immunoblotting and biologically validated by gene-specifc RNAi. These findings clearly linked FHIT tumor suppressing function to the regulation of cell proliferation and apoptosis. Our data demonstrated that the complementary gene and protein expression technology is a powerful tool for generation of hypothesis and for identification of specific cellular targets and signaling pathways mediated by a specific gene product in a complex biological network.

TUTORIAL:  "Analyzing Peptides and Proteins with Mass Spectrometry and Database Searching" by Jimmy Eng, Institute for Systems Biology  jeng@systemsbiology.org

  

Methods of searching public sequence databases with mass spectrometry data for peptide and protein identification will be presented.  Specifically, the mechanisms of mass spectrometry database searching (aka peptide mass fingerprinting), and tandem mass spectrometry database searching will be discussed, along with pointers to access public search engines and publicly available sequence databases.
"Advanced Mass Spectrometric Approaches for Rapid and Quantitative Proteomics" by 
Ljiljana Pasa-Tolic, Senior Research Scientist, Pacific Northwest National Laboratory ljiljana.pasatolic@pnl.gov
This talk describes and demonstrates a global strategy that extends the sensitivity, dynamic range, comprehensiveness, and throughput of proteomic measurements.  The two stage strategy exploits Fourier transform ion cyclotron resonance mass spectrometry (FTICR) to first validate accurate mass tags (AMTs)1 produced by global protein enzymatic digestion for a specific organism, tissue or cell type from potential mass tags identified using conventional tandem mass spectrometry (MS/MS) methods, providing the basis for subsequent measurements without the need for MS/MS.2,3 This global tryptic digest approach bypasses the 2-D PAGE separations and when combined with the enhanced sensitivity and dynamic range of the FTICR instrumentation (i.e., by the application of data-dependent active FTICR dynamic range enhancement methods using external ion m/z selection)4,5 expands the identification of proteins while eliminating the inherent complications and more limited coverage associated with 2-D PAGE.  A key advantage of this approach is, however, that it enables high throughput and high precision quantitative measurements of changes in gene expression based upon stable isotope labeling (i.e., during cell culture using labeled media or Cys-peptide specific labeling methods), predetermined AMTs and the high MMA achievable by FTICR MS.
“Implementation and Uses of Automated de novo Peptide Sequencing by Tandem Mass Spectrometry”
Richard S. Johnson, Senior Staff Scientist, Immunex Corporation, JohnsonR@immunex.com

There are several computer programs that can match peptide tandem mass spectrometry data to their exactly corresponding database sequences, and in most protein identification projects these programs are utilized in the early stages of data interpretation. However, situations frequently arise where tandem mass spectral data cannot be correlated with any database sequences. In these cases, the unmatched data could be due to peptides derived from novel proteins, allelic or species-derived variants of known proteins, or post-translational or chemical modifications. Two additional problems are frequently encountered in high throughput protein identification.  First, it is difficult to quickly sift through large amounts of data to identify those spectra that, due to poor signal or contaminants, can be ignored.  Second, it is important to find incorrect database matches (false positives).  We have chosen to address these difficulties by performing automatic de novo sequencing using a computer program called Lutefisk.  Sequence candidates obtained are used as input in a homology-based database search program called CIDentify to identify variants of known proteins.  Comparison of database-derived sequences with de novo sequences allows for electronic validation of database matches even if the latter are not correct.  Modifications to the original Lutefisk program have been implemented to handle data obtained from triple quadrupole, ion trap and quadrupole / time-of-flight hybrid (Qtof) mass spectrometers.  For example, the linearity of mass errors due to temperature-dependent expansion of the flight tube in a Qtof was exploited such that isobaric amino acids (glutamine / lysine and oxidized methionine / phenylalanine) can be differentiated without careful attention to masscalibration.

 

 

THE Proteome Society
Post Office Box 197
Ross, CA 94957-0197 

Telephone: (415)
459-2266,  Fax: (413) 581- 6411, E-mail: info@proteome.org 
Copyright © 2001-2004, The Proteome Society
webmaster@proteome.org