Buscar

Compendium Introduction to proteomics final

Faça como milhares de estudantes: teste grátis o Passei Direto

Esse e outros conteúdos desbloqueados

16 milhões de materiais de várias disciplinas

Impressão de materiais

Agora você pode testar o

Passei Direto grátis

Você também pode ser Premium ajudando estudantes

Faça como milhares de estudantes: teste grátis o Passei Direto

Esse e outros conteúdos desbloqueados

16 milhões de materiais de várias disciplinas

Impressão de materiais

Agora você pode testar o

Passei Direto grátis

Você também pode ser Premium ajudando estudantes

Faça como milhares de estudantes: teste grátis o Passei Direto

Esse e outros conteúdos desbloqueados

16 milhões de materiais de várias disciplinas

Impressão de materiais

Agora você pode testar o

Passei Direto grátis

Você também pode ser Premium ajudando estudantes
Você viu 3, do total de 166 páginas

Faça como milhares de estudantes: teste grátis o Passei Direto

Esse e outros conteúdos desbloqueados

16 milhões de materiais de várias disciplinas

Impressão de materiais

Agora você pode testar o

Passei Direto grátis

Você também pode ser Premium ajudando estudantes

Faça como milhares de estudantes: teste grátis o Passei Direto

Esse e outros conteúdos desbloqueados

16 milhões de materiais de várias disciplinas

Impressão de materiais

Agora você pode testar o

Passei Direto grátis

Você também pode ser Premium ajudando estudantes

Faça como milhares de estudantes: teste grátis o Passei Direto

Esse e outros conteúdos desbloqueados

16 milhões de materiais de várias disciplinas

Impressão de materiais

Agora você pode testar o

Passei Direto grátis

Você também pode ser Premium ajudando estudantes
Você viu 6, do total de 166 páginas

Faça como milhares de estudantes: teste grátis o Passei Direto

Esse e outros conteúdos desbloqueados

16 milhões de materiais de várias disciplinas

Impressão de materiais

Agora você pode testar o

Passei Direto grátis

Você também pode ser Premium ajudando estudantes

Faça como milhares de estudantes: teste grátis o Passei Direto

Esse e outros conteúdos desbloqueados

16 milhões de materiais de várias disciplinas

Impressão de materiais

Agora você pode testar o

Passei Direto grátis

Você também pode ser Premium ajudando estudantes

Faça como milhares de estudantes: teste grátis o Passei Direto

Esse e outros conteúdos desbloqueados

16 milhões de materiais de várias disciplinas

Impressão de materiais

Agora você pode testar o

Passei Direto grátis

Você também pode ser Premium ajudando estudantes
Você viu 9, do total de 166 páginas

Faça como milhares de estudantes: teste grátis o Passei Direto

Esse e outros conteúdos desbloqueados

16 milhões de materiais de várias disciplinas

Impressão de materiais

Agora você pode testar o

Passei Direto grátis

Você também pode ser Premium ajudando estudantes

Prévia do material em texto

1 
 
Introduction to proteomics 
Basic introduction to biological mass spectrometry and proteomics 
 
 
By 
Martin R. Larsen 
 
Nicolai Bache 
Tine E. Thingholm 
Anne Kjærgaard Callesen 
Kasper Engholm-Keller 
Morten Thaysen-Andersen 
 
 
2 
 
Overview of Proteomics textbook 
 
1. Introduction to proteomics 
 
2. Biological mass spectrometry 
a. Introduction 
b. MALDI – MS 
c. ESI – MS 
d. Tandem MSMS – fragmentation and sequencing of peptides 
e. Hybride instruments 
 
3. Identification of proteins in proteomics 
 
4. Separation methods in proteomics 
a. Protein level 
b. Peptide level 
c. Affinity purification 
 
5. Quantitation in mass spectrometric based proteomics 
a. Label-free quantitation 
b. Absolute quantitation 
c. Chemical isotope labelling 
d. Metabolic labeling 
 
6. Post-translational modifications (PTMs) 
a. Phosphorylation 
b. Glycosylation 
 
7. Clinical proteomics 
a. Introduction 
b. Strategies 
c. Applications 
 
 
 
3 
 
Introduction to proteomics 
1. Proteomics 
In the past decades one of the major focuses of the biological science has been gene technology, 
especially in the field of DNA sequencing and micro-array analysis. Such interests have resulted in an 
exponential increase in the development of techniques for automated high throughput nucleic acid 
sequencing. Within the same time frame, research projects have developed in complexity from the 
characterization of one gene at a time to systematically identifying the entire genetic component of an 
organism. Complete genome sequencing of some unicellular organisms were completed already in the 
late 1970´s (e.g., Sanger F et al., J. Mol. Biol., (1978) 125(2), 225-46). The genome of the first 
eukaryotic organism (Saccharomyces Cerevisiae) was sequenced in 1996 (Goffeau A et al., Science 
1996, 274(5287), 546, 563-7), closely followed by the sequencing of the first multicellular eukaryote 
Caenorhaditis elegans in 1998 (The C. elegans Sequencing Consortium, Science 1998, 282(5396), 
2012-18) and the fruit fly, Drosophila melanogaster (Adams MD et al., Science, 2000, 287(5461), 
2185-96). Today it is routine to sequence entire genomes from your favorite organism and more than 
80 Eukaryotic genomes has been sequenced, including the Human genome which was completed in 
2006 (Gregory SG, Barlow KF, McLay KE, et al., Nature, 2006, 441 (7091): 315–21). The haploid 
human genome contains an estimated 23,000 protein-coding genes, far fewer than had been expected 
before its sequencing (International Human Genome Sequencing Consortium. Nature, 2004, 431 
(7011): 931–45). In fact, only about 1.5% of the genome codes for proteins, while the rest consists of 
RNA genes, regulatory sequences, intron and (controversially) "junk" DNA (International Human 
Genome Sequencing Consortium Nature, 2001, 409 (6822): 860–921). 
The genome defines a delicate framework of highly complex intracellular processes within a cell. Even 
though genetic information provides the basic scaffold for life, there is no direct correlation between 
genomic information and the proteins present in the cell and thereby the phenotype of a living cell or 
organism. A beautiful example of this higher order of complexity is the amazing transformation of a 
caterpillar into a butterfly (see Figure 1.1). They are genetically identical but have two completely 
different phenotypes and from the figure a significant change in the expressed proteins is obvious 
represented by two dimensional gel electrophoresis. In between these phenotypes a large number of 
4 
 
transition states exist, each with a different subset of proteins expressed at the given transition state. 
Another interesting fact is that the identity between the genome of chimpanzee and humans is 98.5% 
indicating a significant similarity; however, the phenotypes are completely different. The differences 
observed at the phenotypic level cannot be accounted for by the genomic differences alone. Thus 
differences between cellular states and cellular function are reflected in differential gene expression 
being manifested at both the mRNA and protein level. However, there are no direct correlation between 
intercellular mRNA levels and protein concentration due to intensive regulatory steps at both the 
mRNA and protein level (Anderson et al. 1997; Steiner et al. 2000; Hegde et al. 2003). Thus protein 
abundance in a biological system is essentially dependent on the rate of protein transcription vs. 
degradation and can only be measured directly at the protein level and not on the level of the mRNA. 
This is true for the majority of the proteins in the cell however, housekeeping proteins such as actin, 
keratin, tubulin have a very high correlation between mRNA and protein. 
 
 
 
 
 
 
 
 
 
 
Figure 1.1: The transformation of a caterpillar into a butterfly. The 2D gels show the different 
protein pattern present in the different steps during the morphological changes. 
 
5 
 
Additional complexity and diversity of the cell is introduced by post-translational modifications (PTM) 
of proteins (i.e., modifications of proteins introduced after the protein has been synthesized). These 
modifications are often chemical groups which are covalently linked to the proteins by the action of 
specific enzymes, e.g. disulfide linkage, glycosylation and phosphorylation. Some of the PTMs are 
reversible modifications e.g., phosphorylation, whereas some are irreversible, e.g., glycosylation. In 
total more than 200 PTMs are known today and many more are likely to exist in nature. PTMs have 
been shown to be extremely important in stabilising and influencing protein structure, defining 
localisation of proteins e.g., via lipid anchors, or they can act as key regulator in diverse cellular 
processes such as metabolic pathways, signal transduction, cell growth, etc. (Hunter 1998; Cohen 2000; 
Hunter 2000; Cohen 2002). Thus, regulation of protein expression and PTMs is directly responsible for 
virtually all cellular functions. 
The human genome has revealed about 23000 coding sequences (proteins) however due to post-
transcriptional processing (splicing) and post-translational processing (PTMs, specific proteolysis etc.) 
the true amount of protein species in a living cell could be in the area of several hundred thousand (see 
Figure 1.2), which puts high demands for any analytical technique to analyze the whole subset of 
proteins – the proteome - of a living cell. 
The term “proteome” was invented in Sydney, Australia, in 1995 and is defined as the entire 
complement of proteins expressed by any given genome, cell, tissue or organism (Wasinger et al. 1995; 
Wilkins et al. 1996) at a given time, and the term proteomics is the large scale study of proteomes. 
More refined definitions have later been applied and today if one is working with a subset of proteins 
they often call this proteomics too, e.g., the mitochondrial proteome, plasma membrane proteome etc. 
The key to proteomics is the possibility to identify and quantify proteins in complex samples in 
different situations, e.g., healthy versus disease or control versus drug treated cells. As such, 
protoemics is very technological minded. In order to achieve identification and quantitation of proteins, 
a large number of strategies have been developed over the last decade. In the beginning of the 
proteomic field (prior to 2000) most of the proteomics work was performed using two-dimensional gel 
electrophoresis (2DE) to separate proteins from each other (see more details in Chapter 4.1) and then 
combine this technique with a sensitive way to identify the proteins in thegels. The latter was initially 
6 
 
performed using technologies like traditional Edman amino acid sequencing and amino acid analysis 
combined with sophisticated database search software programs. However, these methods are not very 
sensitive, with detection limits in the low pico-mole area at best, and they are therefore not able to 
identify proteins present in small amount on the gel. Thus, only very abundant proteins could be 
identified. 
With the introduction of very robust biological mass spectrometry (MS) in the late 1990 it became 
possible to use MS to identify and quantify proteins from both gels and solution samples. Today most 
protein identification and quantitation in proteomics is performed using MS technologies. 
 
 
 
 
 
 
 
 
 
 
 
Figure 1.2: Illustration of the process from gene to gene product. The gene is transcribed into a 
primary RNA product by protein complexes and thereafter the introns are spliced out using 
specific enzymes giving mRNA molecules. The latter process can in many eukaryotic cells take 
place in different ways giving raise to many mRNA molecules from a single gene. This process 
is called alternative splicing. After translation of the mRNA the proteins can become modified 
 
7 
 
by various chemical groups (post translational modifications). The number of gene products can 
be several thousands in a single cell. 
Traditionally proteomics can be divided into three main areas: expression proteomics, modification 
specific proteomics (modificomics) and interactomics. 
Expression proteomics involves the identification and quantitative analysis of proteins at their 
expression level. This means identification of which proteins are present in a given proteome at a given 
time and in want amount relative to the other proteins. The classical approach to address these 
questions is by a comparative gel-based proteomics strategy, see Figure 1.3. Two or more proteomes 
can be compared by two-dimensional gel electrophoresis (2DE) which separate proteins according to 
isoelectric point and molecular weight, and the individual spots, representing individual protein 
isoforms, can be relatively quantified by spot density, using specialised imaging software. The 
individual spots are excised from the gel and the protein is in-gel digested by a proteolytic enzyme. 
Peptides are subsequently extracted from the gel piece and they are analyzed by MS to obtain very 
accurate masses of each peptide. The mass of the peptides can then be used as a fingerprint to identify 
the protein in protein sequence databases (Shevchenko et al. 1996). The 2DE gel-based approach is 
very laborious and have several inherent shortcomings, such as decreased analysis of low and high 
molecular weight proteins, hydrophobic proteins and various modified proteins (Gygi et al. 2000). 
However, specialized buffers and various techniques can overcome some of these problems, including 
the application of very strong detergents in the IEF loading buffer and specialized gels. 
The second strategy is MS-based and is frequently referred to as liquid chromatography (LC)–MS. This 
technique is becoming increasingly popular due to the significantly lower amount of material used for 
the analysis and the direct sequencing of peptides for identification. Here the proteins in a solution are 
processed into peptides using specific proteases and then the peptides are sequenced and identified by 
MS techniques and bioinformatics. However, the main limitation of LC-MS is the huge amount of 
peptides that can be present in a complex protein mixture such as a whole cell lysate. For example if we 
look at the yeast proteome it is expected to include about 6300 proteins which after proteolysis will 
result in > 350.000 peptides. Despite days of LC-MS analysis of the yeast proteome only a subset of 
the expressed proteins has until today been identified with this approach. If we then look at more 
8 
 
complex eukaryotes like human the picture get even more complex. If we only had 23000 gene 
products in human cells we will after processing using a specific protease end up with > 2 million 
peptides which had to be sequenced by LC-MSMS. This is a significant challenge to any analytical 
technique and therefore not readily achievable with the present technology. 
 
 
 
 
 
 
 
 
 
 
 
 
Figure 3: Strategies for expression proteomics. Expression proteomics can be divided into a 
gel-based strategy and a MS driven strategy. See text for further details. 
For the quantification of proteins in LC-MS approaches, a chemical reagent can be introduced onto an 
amino acid functional group (e.g., primary amine), either prior to enzymatic processing or after (see 
later under quantification). Briefly, if two samples are to be compared, the first sample is labelled with 
a light isotopic version of the chemical reagent, and the second sample is labelled with a heavy isotope 
of the same chemical regent. The labels can be introduced on the protein or peptide level depending on 
 
9 
 
the individual strategy. After the labelling and processing of the samples, they are consolidated, and the 
peptides are analysed by multi-dimensional LC/LC-MS and MS/MS to facilitate protein identification 
and quantification. New technology and advances in instrumentation with increased resolution have 
improved this strategy, and it is now becoming the dominating workflow in proteomics. 
 “Modificomics” deals with large-scale identification, characterization and quantification of PTMs on 
proteins in the cell. Virtually all cellular processes are tightly regulated by proteins with distinct 
functions. One of the fastest ways to alter the function of a protein is by PTMs. These can modulate 
many properties: conformational state, stability, activity, interaction partners and subcellular 
distribution. Many of the control mechanisms involved in the dynamic regulation of cellular expression 
patterns in health and disease are dependent on specific PTMs within key proteins. Therefore, the 
determination and quantitative assessment of PTMs in proteins is fundamentally important for 
elucidation of the complex processes that govern cellular events such as cell growth, division, 
differentiation and migration. Such events are implicated in disease development and progression. 
Several new and exciting protein chemistry tools have been developed to be able to identify and 
characterise PTMs, which are typically present on low abundant proteins in the cell, especially if there 
is a low stoichiometry (low occupancy of the modified site). Therefore it can be very difficult to 
characterise modified peptides together with unmodified un-modified peptides. The development of 
methods to enrich and analyse different PTM sub-proteomes i.e. the phosphoproteome, by selective 
affinity methods like immobilised metal affinity chromatography (IMAC) (Neville et al. 1997) and 
titanium dioxide (TiO2) chromatography (Pinkse et al. 2004; Larsen et al. 2005) has greatly improved 
the success and analytical depth of these workflows. Modificomics has significantly benefited from 
instrumental improvements towards sensitivity and analytical speed combined with specialised peptide 
fragmentation capabilities, e.g., MSn, multistage activation (MSA) (Schroeder et al. 2004) and electron 
transfer dissociation (ETD) (Schroeder, M. J., et al., 2005 Journal of Proteome Research 4, 1832-41). 
This improves the success of sequencing modified peptides, e.g. phosphopeptides, which otherwise 
show an all-dominating neutral loss of phosphoric acid, which makes traditional sequencing analysis 
difficult. 
10 
 
“Interactomics”involves elucidation of how proteins interact to form large functional complexes and 
characterization of their dynamical and structural properties. Protein activity is greatly affected by 
protein folding and protein-protein interactions which cannot be directly addressed in the classical 
expression proteomics. Thus it is essential to obtain a fundamental understanding of the function of 
proteins and protein complexes at the molecular level. For this purpose nuclear magnetic resonance 
(NMR), x-ray crystallography and hydrogen-deuterium exchange (1H/2H) combined with MS (HX/MS) 
are three complementary and very powerful techniques to investigate the native dynamic and structural 
properties of proteins in solution. In addition, in order to characterize protein complexes several affinity 
purification strategies has been developed to specifically purify whole complexes and study the 
different subunits using MS approaches. 
 
 
11 
 
2. Biological Mass Spectrometry 
Almost a century ago Thomson developed the very first mass spectrometer (Thomson, 1913) and in 
1918 Arthur Jeffrey Dempster made the first modern mass spectrometer. They were able to accurately 
measure the natural occurring elements and their isotopes. The first attempts to analyse peptides and 
proteins using mass spectrometry date back to the mid-1960’s where electron ionisation (EI) and gas 
chromatograph (GC) mass spectrometry were used (Barber et al. 1965; Biemann et al. 1966). The 
major drawback was that only volatile samples could be ionised and subsequently analysed by a mass 
spectrometry, limiting the technology to small apolar molecules. By contrast, peptides and proteins are 
rather large polar molecules that are not easily volatized and undergo fragmentation upon conventional 
thermal heating. In order to get large polar molecules volatile they had to be derivatized. Even though 
chemical derivatization (CI) techniques allowed a limited number of small biomolecules to be analysed 
by increasing analyte volatility, sensitivity was poor (Kiryushkin et al. 1971). At this time mass 
spectrometry was considered of little use for biological applications. But the situation was improved 
with the introduction of field desorption (FD) allowing un-derivatized peptides and small proteins to be 
analysed (Winkler et al. 1972). However, sensitivity was low, and the technique was demanding and 
required an experienced operator. A milestone was reached with the introduction of plasma desorption 
(PD) (Torgerson et al. 1974) and fast atom bombardment (FAB) (Barber et al. 1981) mass spectrometry 
in the early 1980’s making mass spectrometry a viable tool for the analysis of larger and thermally 
labile biomolecules such as peptides and proteins. The decisive breakthrough in the analysis of larger 
proteins and peptides came with the development of MALDI by Michael Karas and Franz Hillenkamp 
(Karas et al. 1987; Karas et al. 1988) and Electrospray ionisation by John B. Fenn (Fenn et al. 1989). 
These soft ionisation techniques have revolutionized protein mass spectrometry by allowing high-
molecular-weight compounds to be analysed as intact molecular ions with very little fragmentation. 
The scientific impact of these achievements was acknowledged in 2002 and awarded the Nobel Prize in 
Chemistry. The prize was shared between scientists working with MS and nuclear magnetic resonance 
(NMR). Koichi Tanaka and John B. Fenn shared the price in MS for the development of soft laser 
desorption (SLD) and ESI, respectively. However, this is very controversial and many scientists in the 
MS world believe that Michael Karas and Franz Hillenkamp should have been awarded the Nobel prise 
for the development of MALDI instead of Koichi Tanaka. They described the concept of matrix-
12 
 
assisted laser desorption (MALD) (Karas et al. 1985) and later the now widely used solid-phase 
MALDI technique used in modern MALDI mass spectrometers (Karas et al. 1988). The soft laser 
desorption (SLD) method developed by Koichi Tanaka was based on a liquid matrix consisting of a 
mixture of analyte and glycerol with ultra fine cobalt particles as a matrix (Tanaka et al. 1988). This 
technique has not been used in the field of protein mass spectrometry. MALDI quickly became the 
dominating ionisation technique in the field of proteomics together with ESI. 
2.1. Matrix-assisted Laser Desorption/Ionisation 
The early experiments with laser desorption date back to 1960 (Vastola et al. 1970), and the first 
systematic attempts to generate ions of organic molecules with lasers date to the early 1970s 
(Posthumus et al. 1978; Kupka et al. 1980). However, all experiments on laser desorption of organic 
ions revealed an upper limit of approximately 1000 Da for biopolymers that could be desorped as intact 
ions. However, this limit was dependent on the molecular structure and laser parameters, also making 
reproducibility very difficult. The situation changed dramatically with the development of MALDI by 
Hillenkamp and Karas in 1985-1988 (Karas et al. 1985; Karas et al. 1987; Karas et al. 1988). This 
method increased sensitivity, and because the process is independent of the absorption properties and 
size of the compound being analysed, it allows very big molecules to be desorped, ionised and 
subsequently analysed. 
 
Figure 2.1: Ion formation by MALDI. 
Illustration of the principle of MALDI 
from irradiation to ion formation by the 
gas-phase proton transfer model. For 
further description of the MALDI process 
see the text. 
 
 
13 
 
The first step in MALDI is to create a “solid solution” deposit of analyte-doped matrix crystals, where 
the analyte molecules are embedded throughout a matrix so they are completely isolated from one 
another. Matrix is normally a small acidic molecule. In doing so, the matrix is believed to serve two 
major functions: 1) absorption of energy from the laser pulse, which minimizes sample damage by 
absorbing most of the incident energy and increases the efficiency of energy transfer from the laser to 
the analyte, and 2) isolation of the analyte molecules from each other preventing their interaction – 
molecular aggregation (Hillenkamp et al. 1991). When the analyte-doped matrix crystals are irradiated 
by a short laser pulse, it induces a rapid heating of the crystals by the accumulation of a large amount 
of energy through excitation of the matrix molecules. The rapid heating causes a local sublimation of 
the matrix crystals (like a local explotion in vacuum) and an expansion of the matrix into the gas phase, 
bringing intact analyte molecules into an expanding matrix plume (Dreisewerd 2003). An illustration of 
the MALDI process is shown in Figure 2.1. 
Only little internal energy is transferred to the analyte molecules and they may be cooled during the 
expansion process. Ionisation of the analyte molecules can occur at any time during this process, and 
there are several theories trying to explain the process, reviewed in (Zenobi et al. 1998; Dreisewerd 
2003; Karas et al. 2003; Knochenmuss et al. 2003). One of the most widely accepted theories on the 
ion formation mechanism involves a gas-phase proton transfer in the expanding plume with photo-
ionised matrix molecules (Zenobi et al. 1998), see Figure 2.1. 
Several different matrices are used in MALDI MS and there are numerous different sample preparation 
techniques. Some of the most commonly used matrices are listed in Table 1 and their chemical 
structures are shown in Figure 2.2. 
Matrix Chemical name Application 
-cyano (HCCA) -Cyano-4-hydroxycinnamic 
acid 
Peptides, proteins and organic 
compounds 
Sinapic acid (SA) 3,5-Dimethoxy-4-
hydroxycinnamic acidHigher mass biopolymers 
Gentisic acid (DHB) 2,5-Dihydroxybenzoic acid Peptides, proteins and 
14 
 
 carbonhydrates 
PA Picolinic acid Oligonucleotides 
HPA or 3HPA 3-Hydroxypicolinic acid Oligonucleotides 
Trihydroxyacetophenone 2,4,6-Trihydroxyacetophenone Oligonucleotides, peptides 
ATT 6-Aza-2-thiothymine Oligonucleotides 
 
Table 1: The matrices most commonly used for MALDI along with their chemical names and 
applications are listed in this table. 
COOH
OH
HO
DHB
HO
COOH
CN
4-Hydroxy--cyanocinnamic 
acid (4HCCA)
HO
OCH3
CH3O COOH
Sinapinic acid (SA)
COOH
N
Picolinic acid (PA)
COOH
N
HO
3-Hydroxypicolinic acid (3HPA)
OH
OHHO
CH3
2,4,6.Trihydroxy 
acetophenone
N
N
HN
HS
O
CH3
6-Aza-2-thiothymine (ATT)
O
 
Figure 2.2: The chemical structures of the matrices listed in Table 1. 
Matrices are classified as either “hot” or “cold” matrices depending on the degree of fragmentation 
they cause to the sample upon laser desorption. DHB does not give rise to much fragmentation and is 
therefore considered to be a cold matrix, whereas -Cyano-4-hydroxycinnamic acid is a hot matrix. 
Several matrix additives have been shown to improve the efficiency of the ionization of difficult 
analyte molecules in MALDI MS. When analyzing phosphopeptides by MALDI MS the addition of the 
15 
 
mineral acid ortho-phosphoric acid (H3PO4, PA) to the DHB matrix has shown to improve the signal of 
phosphopeptides in particularly for multiphosphorylated peptides (Kjellstrom and Jensen, 2004). In 
addition, ammonium citrate has been shown to improve signals from phosphopeptides/proteins in 
MALDI MS (Asara and Allison 1999). The exact mechanisms behind the addition of additives are not 
known, however it is believed that additive can change the physionchemical properties or both the 
matrix or the analyte molecules and thereby change solubility and incorboration in crystals. 
MALDI can practically be coupled to any mass analyzer, but it is typically combined with time-of-
flight (TOF) analyzers. MALDI is an ion source that generates pulsed ion packages due to the laser 
desorption process and is for that reason very compatible with TOF mass analyzers and mass analyzers 
that allow the storage of ions such as ion trap and FT instruments (see later). 
Some of the obvious advantages using MALDI MS are the rapid sample preparation and analysis (one 
analysis can take less than a minute). It is a very sensitive method and the technique is suited for 
analysis of heterogeneous and semi-complex samples. Besides this, MALDI MS has a high tolerance 
towards salt and other low molecular contaminants making it ideal for analysis of peptides or proteins 
from biological samples. One disadvantage of MALDI is that the use of matrix gives rise to matrix 
background in the spectra below 1000 Da, which can suppress the signals from compounds in the low-
molecular-weight range. This interference varies between different matrices. DHB is good for small 
molecules and peptides (200-1000 Da), since it creates minimal amount of interference in this area. 
Another disadvantage with MALDI is that sequence information by ion fragmentation using this ion 
source relies on singly charged ions, which require much more energy (Hoffmann and Stroobant, 2001) 
and is significantly dependent on the amino acid sequence of the peptide to be analyzed. 
Recently, MALDI has been coupled with offline HPLC using robots, which deposit small fraction of 
the eluted peptides directly onto the MALDI target (Mirgorodskaya, et al., 2005). This has some 
advantages over HPLC coupled to ESI (see later), since the entire sample is not used for spraying, 
however, this is not routine in many laboratories. 
 
 
16 
 
2.2. Electrospray ionisation 
Electrospray ionisation (ESI) is a versatile and commonly used method for the production of gas phase 
ions from a liquid. The ion source operates at atmospheric pressure and produces a continuous flow of 
quasimolecular ions ideal for several state-of-the-art MS configurations. The pioneering experiments in 
1917 on Electrospray ionisation (ESI) by the physicist John Zeleny precede the first description of the 
process by Malcolm Dole in 1968 (Dole et al. 1968). The well-defined breakthrough and introduction 
of electrospray ionisation into a useful technique for mass spectrometry of large biomolecules was 
presented by John Fenn in 1988 at the 36th ASMS conference in San Francisco and published in 1989 
(Fenn et al. 1989). Later it was optimised into nanoflow ESI (nESI) with increased sensitivity by Wilm 
and Mann (Wilm et al. 1996). The nESI source is now routinely used in front of most state-of-the-art 
MS configurations and is well suited for solvent based protein and peptide separation techniques like 
high pressure liquid chromatography (HPLC) and nanoflow HPLC ideal for high throughput nano 
HPLC-MS applications. 
Electrospray ionisation works by introducing the sample solution into the source region of the mass 
spectrometer through a small steel or glass capillary. By applying a high potential (±1700-5000 v) 
between the capillary and the counter electrode (orifice), a strong electric field is created. This induces 
a charge accumulation at the liquid surface at the tip of the capillary (Kebarle 2000), see Figure 2.3. 
The ions will be drawn from the capillary and attracted towards the counter electrode. However, the 
surface tension of the liquid provides an opposing force holding the liquid together which results in the 
formation of a Taylor cone at the tip of the capillary. As the charge density builds up at the tip of the 
Tayler cone, the repulsive electrostatic forces will overcome the surface tension and produce a jet of 
charged droplets, a process known as budding. The charged microdroplets undergo evaporation as they 
progress down a pressure gradient in the ion source reducing their size. Consequently the charge 
density builds up until the Rayleigh limit is reached. At this point the accumulated repulsive coulombic 
forces exceed the surface tension of the microdroplet and a coulomb explosion (droplet fission) occurs. 
The charge to mass ratio (of droplet) is increased in the now smaller daughter droplets relative to the 
parent droplet. However, the increased charge at the surface is balanced by the increased surface-to-
17 
 
volume ratio of the daughter droplet. Upon several consecutive rounds of coulombic fission very small 
charged droplets are formed (Kebarle 2000). 
There are two major competing theories for the final ionisation step of the analyte molecules. The 
charged residue model (CRM) first suggested by Dole et al. (Dole et al. 1968) and the ion evaporation 
mechanism (IEM) based upon the ideas of Iribarne and Thompson (Iribarne et al. 1976; Thomson et al. 
1979). Both are illustrated in Figure 2.4. The CRM model suggests that charged droplets undergo 
several consecutive rounds of coulombic fission and eventually produce droplets containing a single 
analyte molecule. Upon complete solvent evaporation, the molecule is left with the charge carried by 
this droplet (Schmelzeisen-Redeker et al. 1989). The IEM model suggests that as the droplet reaches a 
certain radius the ions are desorbed directly out of the droplet by the strong electric field at the surface 
of the droplet (Thomson et al. 1979). It is most likely a mixture of the two ionisation mechanisms that 
account for the ionisation in ESI. 
 
 
 
 
 
 
 
 
Figure 2.3. Ionisation of proteins and peptides by electrospray ionisation. At the tip of the 
capillary/needle, the liquid forms a Taylor cone. At the tip, small droplets arebudding off, 
forming small charged droplets. As the liquid evaporates the droplets undergo several rounds of 
coulombic fission and eventually form individual analyte ions on the gas phase. See the text for 
further details. Adapted and modified from (Hoffmann et al. 2001) 
18 
 
 
 
 
 
 
 
 
 
 
Figure 2.4. Figure illustrating the two different theories for generating molecular ions in ESI. 
ESI normally produces multiply charged ions, and this allows molecules that have masses exceeding 
100 kDa to be analysed by mass spectrometers with low mass/charge (m/z) range analysers. A standard 
quadrupole analyser has an upper m/z limit of approximately 4000 (Hoffmann et al. 2001). In the 
majority of MS driven proteomic workflows, peptides of approximately 600 – 3500 Da are analysed 
producing mainly 2+, 3+ and 4+ ions with ESI. These multi-charged ions are well suited for collision-
induced dissociation (CID) which is more efficient and requires less collision energy compared to CID 
of singly charged ion (Wysocki et al. 2000). The same is true for electron capture dissociation (ECD) 
and electron transfer dissociation (ETD) where a charge state of 3+ and 4+ is desirable for optimal 
fragmentation efficiency (Zubarev et al. 1998; Mikesh et al. 2006)(see later). 
In positive ion mode the analyte molecules are sprayed at low pH to improve the formation of positive 
ions, whereas higher pH conditions are used for negative ion mode. Proteins and peptides are usually 
analyzed using positive ion mode, whereas oligonucleotides and oligosaccharides are analyzed using 
negative ion mode. The number of positive charges obtained by a molecule is related to the number of 
basic sites on the molecule, however, posttranslational modifications also influence this. 
 
19 
 
In the electrospray ionization process, neutral species, ions and clusters of ions with neutrals are 
formed. The ions need to be separated from the neutrals to aid desolvation before entering the mass 
spectrometer. This can be accomplished simply by positioning the capillary slightly off-axis from the 
entrance to the mass analyzer, since the outer regions of the spray consists mainly of smaller, lighter 
and more desolvated droplets (Cech and Enke, 2001). Another important problem is the cooling created 
by the analyte and the solvent adiabatic expansion, which produces ion clusters. This is prevented by 
the application of a heated, dry nitrogen counter-current or gas curtain. 
There are several advantages using electrospray as an ion source. As mentioned above multiply 
charged ions are produced, which makes the analysis of high-mass ions by mass analyzers with relative 
low m/z range possible, in addition to giving a better mass accuracy. In addition, fragmentation of 
multiply charged peptides requires less energy and is therefore easier. No matrix is needed so no 
signals from matrix will interfere with the analyte signals and in principle electrospray is unlimited in 
mass range (Siuzdak, 1996). However, multiply charged ions are a disadvantage for low resolution 
instruments such as ion traps as no charge stages can be deduced. In addition, the ESI system has a low 
salt tolerance and the purity of the sample needs to be fairly high. Compared to the MALDI techniques 
there is a great loss of sample. The electrospray ion source is compatible with practically any type of 
mass spectrometers. Most commonly used instruments are the quadrupole and the ion trap (Mann and 
Wilm, 1995). The coupling of ESI with LC (LC-ESI) have become fully automated as opposed to LC-
MALDI, which is only semi-automated. 
 
2.3 Mass analyzers used in modern biological mass spectrometry 
Mass spectrometers do not actually measure mass, but rather the mass-to-charge (m/z) ratio of ions, 
expressed in units of Thomson (Th) or atomic mass units (u) for singly charged ions (z = 1). As there 
are many different ion sources, there are a variety of mass analyzers, each working by different or 
similar mechanisms in order to determine the m/z ratio of ions. Some mass analyzers scan the ions 
formed in the ion source, while others have the capability to isolate individual ions due to their m/z 
ratio. 
20 
 
Four important factors come into play, when describing mass analyzers. 
1. The upper mass limit, 
2. The transmission, 
3. The resolution (R) 
4. Mass accuracy. 
The upper mass limit is the highest possible m/z ratio that the particular instrument is capable of 
measuring. The transmission describes the number of ions reaching the detector compared to those 
produced by the ion source. A low transmission indicates a substantial loss of ions from the ion source 
to the detector and results in a decrease in sensitivity. Normally there is a significant loss of ions from 
the ion source to the mass analyzer in any mass spectrometer. The resolution is a term for how well the 
instrument is able to distinguish between signals from two ions of similar m/z ratios. 
The definition of the resolution R is given by the equation: 
 R = m/m 
Where m is the smallest mass difference between two peaks with masses m and m + m, respectively. 
The resolution can also be determined from one peak with mass m as the peak width m at x% of the 
peak height, usually being 50%, designating m as the full width at half-maximum (FWHM) (Figure 
2.5). 
 
 
 
 
 
 
In
te
ns
it
y
m/z
m
x %
2x %
100 %
m
 
21 
 
Figure 2.5. Relationship between two definitions of resolution. This figure was adapted from 
Hoffman and Stroobant, 2001. 
Finally the fourth and very important factor is the mass accuracy. This is a measurement of how well a 
measured m/z ratio correlates with the real (theoretical) ratio. The mass accuracy is highly dependent 
on how well the instrument is calibrated and is therefore somewhat variable (Hoffmann and Stroobant, 
2001). The data can be recalibrated after the analysis if necessary, which will improve the mass 
accuracy. The mass accuracy is also highly influenced by the resolution. The higher the resolution, the 
better the separation of the isotopes and the more accurate peak assignment can be performed and 
consequently better mass accuracy is achieved (Figure 2.6). 
The contributions of these four factors determine the performance and sensitivity of the individual MS 
instruments. 
 
 
 
 
 
 
 
Figure 2.5: Resolution. The three sprectra were calculated for the same molecular formula at 
resolution (R) 200, 2000, and infinity () using the 10% ratio definition (10% above baseline). 
The higher the resolution is, the narrower the peaks will be. This figure was adapted from 
Siuzdak, 1996. 
 
 
R
el
at
iv
e 
in
te
ns
ity
(%
)
R = 200
R = ∞
R = 2000
m/z
 
22 
 
The Quadrupole (Q) 
The principle of the quadrupole was first described by Paul and Steinwegen in 1953 (Paul and 
Steinwegen, 1953). A quadrupole consists of four parallel rods with a circular or hyperbolic section 
(Figure 2.7). A direct current (fixed DC) and a superimposed radio-frequency (alternating RF) 
potential are applied to the rods. 
 
 
 
 
 
 
 
 
 
 
 
 
 
Figure 2.7. Skematic illustration of the principle of the quadrupole. This figure was adapted 
from Skelkton and Gates, 2000. (http://www-methods.ch.cam.ac.uk/meth/ms/theory/quadrupole.html) 
Ions enter the quadrupole in the space between the four rods and travel along the z-axis. At the same 
time ions move circularly in the x-y plan due to the alternating electric field. The opposing rods have 
the same charge, whereas the rods next to each other haveopposite charges (Figure 2.7). Positive ions 
are drawn to the negatively charged rods (- 0). Due to a change of the potential polarity, the ions are 
Nonresonant ion
Resonant ion (detected)
Quadropole rods
Source slits
(entrance to the quadropole)
To detector
Analyte ions 
(with different m/z ratios)
X
Z
Y
- 0
+ 0
r0
 
23 
 
repelled by the rods and change direction before they touch the rods and discharge themselves, which 
would prevent their detection. Instead, they are positioned in the center of the quadrupole. 
The stability of the trajectory of a given ion depends on its m/z ratio. Only ions that are stable will be 
able to pass through the quadrupole. This means that by changing the DC/RF potentials on the rods, the 
quadrupole can select ions of a particular m/z ratio. At a given time only ions of this particular m/z ratio 
are let through the quadrupole and will be scanned by the detector, whereas ions with other m/z ratio 
are unstable and will discharge themselves against the rod surfaces. This relation gives rise to the so-
called stability areas, in which ions are successfully scanned (Figure 2.8). 
 
 
 
 
 
 
 
 
Figure 2.8: The stability areas are created as a function of DC voltage (U) and RF voltage (V) 
for ions with different masses (m1 ‹ m2 ‹ m3). Making U as a linear function of V results in a 
straight line (scan line) intersecting at (0;0). Choosing U and V ratios from this line will give 
good scan of the individual masses (m). Choosing U and V ratios below the scan line will result 
in an overlap of different m ratios, decreasing the resolution. This figure was adapted from 
Hoffmann and Stroobant, 2001. 
 
As illustrated on the figure, keeping U/V constant allows for the detection of several different masses 
(along the scan line). The higher the slope of the line, while still intersecting a stability area, the better 
U
(D
C
 V
ol
ta
ge
)
V (RF Voltage)
m1
m2
m3
Unstable area
Unstable
area
(constant DC/RF)
Scan line
 
24 
 
the resolution. At U = 0, when no direct potential is applied, the resolution is zero. However, ions are 
still transmitted as long as V is within a stability area. The more the instruments are operated at the 
peaks of the figure, the better the resolution will be, but there will be some overlap between the 
different masses below the scan line and therfore the resolution will decrease. The mass range and 
ultimate resolution of a quadrupole instrument depends on the length and diameter of the rods. 
However, the mass range for common quadrupole instruments is about 4000 Th but can be extended up 
to 8000 Th and their resolution operating in normal scan mode is only approaching 3000, classifying 
them as low-resolution instruments (Hoffmann and Stroobant, 2001). An iontrap can achieve a very 
high resolution if operated in enhanced or zoom scan mode, however with a significant drop in speed. 
Quadrupole instruments can be used in tandem mode as well, where instruments are connected linearly. 
Often the quadruple instruments are coupled to a time-of-flight instruments (See section on QqTOF) or 
several quadrupoles are connected (triple quadrupole (QqQ)). 
 
Time-Of-Flight (TOF) 
Time-of-Flight mass spectrometry is one of the simplest methods of mass measurement although there 
are hidden complexities when it comes to higher resolution applications. The idea of linear time-of-
flight (TOF) mass spectrometry was introduced by Stephens in 1946 (Stephens, 1946) and some ten 
years later the design for the first TOF instrument was published by Wiley and McLaren (Wiley and 
McLaren, 1955). A TOF mass spectrometer consists of an ion source in which the ions are ionized and 
accelerated, a field-free flight tube in which the ions are separated according to their m/z ratio and a 
detector, in which the ions are detected (Figure 2.9). 
 
 
 
 
 
25 
 
 
 
 
 
 
 
 
 
Figure 2.9. Schematic presentation of the time-of-flight (TOF) principle using extraction of ion 
packages (MALDI). Ions are produced in the ion source and accelerated in a strong electric 
field before entering the field-free drift region. Finally, the ions reach the detector at different 
times depending on their velocities. Vacc being the acceleration potential for the ions and Ek the 
kinetic energy. 
After ionisation in the ion source, the ions are normally accelerated by a 20 to 25 kV potential (Vs), and 
obtain the kinetic energy of Ek = Vsq = mv
2 /2, where q is the charge of the ion, m is the mass of the ion 
and v is the velocity of the ion. Therefore the smaller ions will travel faster than the larger ions, and 
will reach the detector first. By measuring the flight time of the ions, the m/z can be calculated from t2 
= (m/z) k, where k = d2/(2Vse), d being the distance the ion travels and e the charge of an electron. 
There is no limit to the upper mass range of a TOF analyzer, which makes it highly compatible with 
soft ionization techniques where only molecular ions are produced. MALDI produced ions in pulses 
making it a good choice for coupling with TOF, since the TOF analyzer is a pulsed instrument 
requiring a well-defined starting time (point of initiation). TOF mass analyzers are also very sensitive 
instruments because of a high transmission efficiency due to the ability to detect ions over the whole 
m/z range at the same time without scanning. However, there are limitations to the simple construct of 
Detector
Positive ions
Analyzer Field-free Region
+20.kV
Ion Source
d
Laser beam
Ek = 0Ek = Vacc/dS
dS
Flight tube
sample
0.kV
 
26 
 
TOF, resulting in a poor resolution. This is due to different factors creating the distribution in flight 
times for ions of the same m/z ratio (Hoffmann and Stroobant, 2001). Some of which are: 
Kinetic energy distribution: In theory all ions are subjected to the same acceleration potential (Vacc) 
in the ion source, but in practice there will be variation in their initial kinetic energies. This will result 
in different velocities for ions of identical masses so these will not reach the detector at the same time. 
Time distribution: Variations of the length of the ion formation in the source. This leads to a minor 
time shift between different ions leaving the ion source, so these ions will not reach the detector at the 
same time. 
Space distribution: Variations of the spatial positions of identical ions in relation to the ground grid 
gives a difference in the energy acquired. The ion closest to the ground grid will not be in the energy 
field as long and therefore receive less energy. The ion with the highest kinetic energy will reach the 
detector first. 
Distribution of the ion direction: Identical ions with the same initial kinetic energy positioned at the 
same initial position, but with opposite directed directions will eventually both reach the detector, but 
not at the same time. 
Equipment: Electronics and especially the digitizers. 
The two-stage acceleration strategy (delayed extraction) developed by Wiley and McLaren in 1955 
corrects both initial velocity distribution and the initial spatial distribution whereas the ion mirror 
(reflector) developed in 1970’s by Mamyrin et al. corrects the initial energy distribution (Mamyrin et 
al. 1973; Mamyrin et al. 1979). 
Delayed extraction 
Delayed extraction (DE) reduces the kinetic energy distribution among ions with the same m/z ratio 
leaving the source by introducing a time lag between ion formation and ion extraction from the source, 
and at the same time it improves the pulsed beam definition(t = 0). First the ions are allowed to expand 
and separate according to their initial kinetic energy into a field-free region in the source. For ions of 
the same m/z ratio those with more initial energy will move further into the source and thereby be 
27 
 
closer to the last grid in the ion source than the less energetic ions. When the extraction pulse is applied 
after a certain delay, the less energetic ions will receive more energy because they experience a higher 
electric field than the faster and more energetic ions (see Figure 2.10). Thus, the initially less energetic 
ions receive more kinetic energy and join the initially more energetic ions at the detector. 
The ions are focused at the detector by adjusting the amplitude and delay of the pulsed extraction 
(Mamyrin et al. 1973; Mamyrin et al. 1979). Delayed extraction reduces post source decay (PSD)( 
fragmentation of the ions in the flight tube) by allowing the matrix and ionised molecules to expand 
into the source region prior to acceleration of the analyte ions. The plume density is thereby decreased, 
reducing collisional activation when the ions are accelerated into the flight tube (Kaufmann et al. 
1996). 
 
 
 
 
 
 
Figure 2.10: Delayed extraction. Schematic description of a continuous extraction mode and a 
delayed pulse extraction mode in a linear TOF mass analyser. Ions of a given mass with a correct 
energy are symbolized by ○ and ions with the same mass but to low a energy are symbolized by ●. 
Delayed pulse extraction corrects the energy dispersion of the ions leaving the source with the same 
m/z ratio. Adapted from (Hoffmann et al. 2001). 
 
 
 
 
28 
 
The reflector 
The reflector consists of a series of grids and ring electrodes, creating a retarding field that acts as an 
ion mirror by deflecting the ions and sending them back into the flight tube (Figure 2.11). It corrects 
the energy distribution of ions with the same m/z ratio because ions with higher kinetic energy will 
penetrate more deeply into the reflector than ions with a lower kinetic energy but having the same m/z 
value. The result is that ions with more kinetic energy spend longer time in the reflector than ions with 
less kinetic energy and is therefore overtaken by the slower moving ions. When the ions leave the 
reflector, the kinetic energy is restored to the same absolute value as before entering and the fast ions 
will catch up with the slow-moving ones, with the same m/z, at the detector (see Figure 2.11). 
(Mamyrin et al. 1973; Mamyrin et al. 1979). The optimal resolution is then achieved by placing the 
detector were the two ions catch up. 
 
 
 
 
 
 
 
Figure 2.11: The reflector. Schematic description of a TOF instrument equipped with a 
reflectron; (■) ions of a given mass with correct energy; (□) ions of the same mass but with a 
kinetic energy that is too low. The slower ions reach the reflectron later, but spend less time 
inside the reflector and overtake the faster ions (■). With properly chosen voltages, path lengths 
and fields, both kinds of ions reach the detector simultaneously. 
 
-3000 V
Source
Detector
-3000 V
X
Y
-3000 V +130 Vd/2 D
 
29 
 
Thus the reflector compensates for kinetic energy distribution of the ions whereas delayed extraction 
compensates for the initial time-, velocity- and spatial distribution of the ions created under the 
desorption process. The effect of delayed extraction and the reflector on mass resolution is illustrated in 
Figure 2.12. 
MALDI TOF instruments have due to delayed extraction and reflectron been able to achieve a much 
higher resolution (up to 20.000) (Mann and Talbo, 1996) resulting in an instrument, which has a good 
combination of resolution, sensitivity and fast response. One of the reasons that we are able to identify 
proteins based on Peptide Mass Fingerprinting (PMF, see later) today, is the introduction of delayed 
extraction and reflectons, which give a high mass accuracy. This lowers the level of false positives 
obtained by searching the data in the fast growing public protein sequence databases. (Jensen, et al., 
1996). 
 
 
 
 
 
 
 
 
 
 
Figure 2.12: Illustration of the improvement in resolution when the reflector and delayed 
extraction are used. The spectra were recorded on the 4700 proteomics analyzer from Applied 
Biosystems and recorded using continuous extraction, linear mode (+delayed extraction), 
 
30 
 
reflector mode (-delayed extraction) and reflector mode using both delayed extraction and the 
reflector mode. 
 
Ion Trap 
Paul and Steinwegen, who also invented the quadrupole in the 1950's, were the first to describe the 
principle of the ion trap (Paul and Steinwegen, 1960) as a slight modification of the quadrupole 
instrument. Both analyzers are based on ion motions in RF electric fields. The classical 3D ion trap, or 
Paul trap, consists of a circular electrode with two ellipsoid end caps (Figure 2.13). 
In principal an ion trap is a quadrupole in which its four rods have been bent around itself forming a 
closed loop. The inner rod forms the point at the center of the ion trap and the outer rod forms the 
circular electrode, whereas the two remaining rods form the two end caps. 
 
 
 
 
 
 
 
 
Figure 2.13: The composition of the ion trap mass analyzer. 
The ions produced in the ion source enter the trap through a small orifice in one of the endcaps (Figure 
2.14). An oscillating potential, 0, (also referred to as the fundamental RF, the sum of a direct and a 
alternative potential, see section on quadrupole) is applied to the endcaps and -0 is applied to the ring 
Ring electrode
End capEnd cap
 
31 
 
electrode. However, often 0 is only applied to the ring electrode. The potential creates a three-
dimensional quadrupolar field in which the ions are trapped (Hoffmann and Stroobant, 2001). The 
difference between the quadrupole and the ion trap is, that in the quadrupole only ions of a given m/z 
range are allowed to pass at a given time due to U and V conditions making the trajectory of these ions 
stable (see section on quadrupole). In the ion trap however, all ions of different m/z ratios are present 
and ions of a given m/z ratio are expelled from the trap at a given time by making the trajectory of these 
ions unstable (Glish and Vachet, 2003). The other ions are trapped within the system of the three 
electrodes (Figure 2.14A and B). This is due to the application of a resonant frequency along the z 
axis. Inside a quadrupole these ions would discharge themselves against the rods and be destroyed, but 
in an ion trap they repel each other, resulting in an expansion in their trajectories. The addition of 
helium gas molecules (10-3 Torr) in the ion trap absorbs any excess energy from the ions through 
collisional absorption, preventing ion losses. As with the quadrupole, the volume in which ions of a 
given m/z have stable trajectories, can be described mathematically. By sequentially making ion 
trajectories become unstable, ions are ejected from low m/z to high m/z through a small orifice in the 
opposite endcap where they are detected and used to reconstruct a mass spectrum (mass selective 
instability) (Glish, et al., 2003). 
 
 
 
 
 
 
 
 
Figure 2.14: The principle of the ion trap. The voltages applied to the system are depicted at 
C, but are present at all four times (A-D). A. Ions are trapped inside the three dimensional 
A B
DC
Scan out To 
detector
Top
Ring
Bottom
Ions in
Ions out
Trapped Selected ion
Ejected
Fragmentation
detector
electrodeelectrode
electrode
ions
ions
To 
product ion
V

detector
To 
 
32 
 
quadruple. B. An ion of a particular m/z ratio is selected. C-D. Additional steps in ion trap 
tandem ms (MS2). C. The selected ion is fragmented. D. The fragments are ejected from the ion 
trap. 
Ions can also be ejected from the ion trap by resonance ejection, where the use of higher excitation 
amplitudes increases the oscillation of a given ion resulting in its rejection from the ion trap. The 
combination of mass selective instability and resonance ejection enhances the resolution of the 
instrument (Goeringer, et al., 1992). 
Linear ion trap 
An alternative to the conventional 3D ion trap (Paul trap) is the axially-ejecting linear ion trap. The 
linear ion trap is operated as the Paul trap however this instrument is incorporated into a triple-
quadrupole instrument providing several advantages compared to the Paul trap. In the linear ion trap, a 
RF field applied to elongated electrodes is responsible for trapping the ions in the radial direction, 
whereas a static electric field on the ring-shaped electrodes at the two ends is responsible for trapping 
the ions in the axial direction. Due to a trapping stiffness in the radial direction, which is higher than in 
the axial direction, the ions will arrange themselves in a linear string in the z axis (http://heart-
c704.uibk.ac.at/linear_paul_trap.html; Hager, 2002). This improves the trapping ability of the ion trap. 
In addition, the linear ion trap is also larger than the regular ion trap providing a higher ion capacity. 
The selection of ions for fragmentation is performed by the use of mass-selective instability in 
combination with resonance ejection as described for tandem ion traps (see below). As the Paul trap, it 
can perform MSn due to the ability to store ions, while fragmenting others. In addition, the ability to 
operate as a quadrupole mass filter enables the linear ion trap to form hybrid instruments. 
One disadvantage of the iontraps is the so-called 28% rule. An iontrap is not readily able to select and 
fragment an ion and then scan out all the fragment ions. Due to the mass selective instability an iontrap 
is only able to efficiently scan out the masses beginning from 1/3 of the parant mass ion (28% rule). 
 
 
33 
 
2.4 Hybride and specialized MS instruments in proteomics 
QqTOF mass spectrometer 
In 1996 Morris and co-workers described the first Q-TOF tandem mass spectrometer. The construct 
was built by Morris to improve transmission and the efficiency to detect daughter ions, as well as to 
increase the signal-to-noise ratio. Furthermore, the aim of this construction was to improve resolution 
and mass accuracy, especially of fragment ions, in comparison with already known tandem mass 
spectrometers such as the triple quadrupole instruments (Morris, et al., 1996). They used a quadrupole 
coupled to an orthogonal-acceleration TOF instrument. The orthogonal injection was introduced in 
order to combine a continuous ion source (such as ESI) with a TOF instrument (Figure 2.15). 
O´Halloran and co-workers were the first to report of such an instrument (O´Halloran, et al., 1964). In 
orthogonal TOF, ions formed in the continuous ion source are injected along the y axis. In MS mode 
the ions travel through three quadrupoles operated in RF only mode in order to function as ion guides 
(Q0, Q1 and Q2). The first quadrupole (Q0) is an ion guide, where the ions gently collide with neutral 
gas molecules (collisional cooling or damping usually with at a pressure of 10 mTorr). This decelerates 
and focuses the ions by reducing the energy spread and the ion beam diameter giving a better 
transmission into the second and the third quadrupoles (Q1 and Q2) as well as the TOF analyzer. Q1 
and Q2 focus the ions further before they enter a field-free pusher region between a flat plate and a 
grid. This region is field-free, letting ions continue to move in their original direction in the gap. A 
voltage is then applied to the flat plate (a pusher) accelerating and thereby sending off an ion packet 
nearly in the orthogonal z direction into the flight tube. The ion packet is accelerated to several 
kiloelectronvolts per charge by a uniform DC electric field. 
The deceleration of the ions in the collision cell ensures that all ions of different m/z ratio reach the 
detector, and the pusher ensures that the ions reach the detector in an m/z dependent manner 
(Chernushevich, et al., 1999; Chernushevich, et al., 2001). Q-TOF instruments are able to isolate a 
single isotop for fragmentation, which reduces the complexity of the fragmentation spectra, when 
working with complex samples. 
 
34 
 
 
 
 
 
 
 
 
 
 
Figure 2.15: Schematic diagram of the tandem QqTOF. In MS mode the Q1 quadrupole is 
operated in RF mode only. The figure was adapted from Chernushevich, et al., 2001. 
LTQ Orbitrap XL mass spectrometer 
The orbitrap mass analyzer is with its commercial introduction in 2005 the newest type MS instrument 
in proteomic research (Hu, Q.Z., et al., Journal of Mass Spectrometry, 2005). It has a mass accuracy in 
the low- to sub-ppm range and an extremely high resolution – both comparable to the FT-ICR, which, 
like the orbitrap, also employs Fourier transformation for the calculation of ion m/z values. 
The LTQ-Orbitrap is cheaper than the LTQ-FT-ICR instrument. The running costs of the instrument 
are furthermore much lower due to the lack of the FT-ICR’s super-cooled magnet, making the 
acquisition and maintenance of the instrument in a proteomics lab much more feasible. The LTQ-
Orbitrap XL consists of a front part, which is basically a regular ESI-coupled linear trap quadropole 
(LTQ) and a back part including a so-called C-trap connected to an octopole (for higher energy 
collision induced dissociation – HCD) and the orbitrap (Figure 2.16a). 
Due to the high sensitivity, speed and MSn capabilities of the LTQ, it is the ideal partner for the 
orbitrap mass analyzer in the LTQ-Orbitrap XL instrument. The analytes are ionized in the ESI source 
Z
X
Y
 
35 
 
and guided to the LTQ through the ion optics. After manipulation in the LTQ, the ions are delivered to 
the C-trap. This is a C-shaped quadropole operating in RF mode only which functions to accumulate 
and store ions which are collisionally damped by a low nitrogen pressure before injection into the 
orbitrap by high voltage pulses (Makarov, A., et al., Anal Chem, 2006. 78(7): p. 2113-20.). 
The orbitrap is basically an ion trap, though the ions are not trapped by a radio frequency field electric 
like in a 3D ion trap or by a magnetic field as in a FT-IC. In this type of mass analyzer, the ions are 
attracted towards a central electrode (Figure 2.16a) using an electric field, an attraction 
counterbalanced by the centrifugal force arising from the initial tangential velocity of the ions 
originating from the ion injection into the orbitrap (Scigelova, M. and A. Makarov, Proteomics, 2006). 
The ions are furthermore confined by the two halves of the outer orbitrap electrode (Figure 2.16b). The 
forces imposed on the ions cause them to oscillate in complex spiral patterns not only around the inner 
electrode alone but also back and fourth in the axial direction (the z axis in Figure 2.16b). These axial 
oscillation frequencies are inversely proportional to the square root of the m/z ratios of the ions and are 
detected as an image current on the two halves of the outer electrode separated by an insulating ceramic 
ring (Figure 2.16b-c). Hereby, a single sine wave is generated by each ionic species in the orbitrap. By 
performing a Fourier transformation on the verycomplex waveforms originating from the different 
ions in the trap, the oscillation frequencies and thereby the m/z ratios can be extracted. The resolution 
of the spectra is dependent on the time of analysis, but a resolution of more than 100.000 can be readily 
achieved. 
The XL version of the LTQ-Orbitrap moreover contains a dedicated octopole collision cell coupled to 
the back-end of the C-trap (Figure 2.16a), enabling execution of high-energy collision induced 
dissociation (HCD). Hereby more extensive fragmentation than in regular, low-energy CID can be 
produced, also leading to other types of fragment ions (Olsen, J.V., et al., Nature Methods, 2007; Bean, 
M.F., et al., Anal Chem, 1991). Fragmentation of peptide ions using HCD is advantageous as the LTQ 
is incapable of efficiently trapping low-mass product ions as these cannot be stabilized in the trap – a 
phenomenon known as the “1/3 rule” indicating the loss of product ions with a mass below 1/3 of the 
precursor ion. HCD therefore facilitates the observation of the immonium ions of m/z 216.04 
indicating a phosphotyrosine or the reporter ions from the iTRAQ® reagent used for labeling of 
peptides in quantitative proteomic experiments (m/z 114-117). The intensity of the ions detected after 
36 
 
performing HCD, is nevertheless an order of magnitude lower than by CID in the LTQ, due to 
transmission loss. In addition, the number of ions required for the HCD is significant higher than for 
normal CID in the LTQ. Thus this is still leaving room for major improvements of the application of 
the technique in analysis of low-abundant and complex samples. 
Recently, the LTQ-orbitrap XL has been equipped with capabilities for performing electron transfer 
dissociation (ETD), a gentle fragmentation technique which allow efficient fragmentation of multi-
charged peptides and proteins (see later). 
The very high mass accuracy of the instrument (<5 parts per million (ppm) with external calibration) 
can be improved even further by internal calibration. The polycyclodimethylsiloxane polymer 
(445.120025 Da), which is always present in the ambient air, and is therefore an inevitable ion showing 
up in mass spectra from ESI instruments (Schlosser, A. and R. Volkmer-Engert, Journal of Mass 
Spectrometry, 2003), has been used for internal calibration in real-time, employing its mass as a so-
called lock-mass (Olsen, J.V., et al., Molecular & Cellular Proteomics, 2005). This method relies on 
filling the C-trap with a particular amount of polycyclodimethylsiloxane ions and scanning them along 
with the analyte ions. The average mass accuracy can thus be reduced to the sub-ppm level by 
correcting all measured masses by applying the same ppm deviations for all masses as for the lock-
mass. 
 
 
 
 
 
 
 
 
 
 
 
A)
B)
Orbitrap
HCD collision cell
37 
 
Figure 2.16: A) The LTQ-Orbitrap XL consists of an ion source, ion optics, and a linear ion 
trap connected to the C-trap through the front end, and with the HCD collision cell in the back 
end. From the C-trap, the ions are injected in to the orbitrap. B) The orbitrap mass analyzer. a) 
Central electrode b) Outer end-cap electrode c) Insulating ceramic ring.. 
 
Fourier transform Ion Cyclotron Resonance mass spectrometer (FT ICR) 
In 1974 Fourier transform (FT) MS was described by Comisarow and Marshall (Comisarow and 
Marshall, 1974a and b). The FT instrument consists of four components: The first component is a 
magnet creating a uniform high magnetic field of 4.7 – 13 T, the second component is an analyzer cell, 
where ions are stored, analyzed and detected. The analyzer cell can be a cubic cell consisting of six 
plates positioned inside the magnetic field (Figure 2.17). The third component of an FT instrument is 
an ultra-high vacuum system. Ions generated in the ion source are guided through compartments with a 
series of pumps to gradually lower the pressure before the ion enters the ICR cell. At this point pressure 
will be in the range of 10-13 to 10-14 Bar, and the temperature about absolute zero. The fourth 
component is a sophisticated data system (Amster, et al., 1996). 
As the ions generated in the ion source enter the analyzer cell, they are exposed to the magnetic field 
and to an electric field generated between the electrodes of the analyzer cell. This forces the ions into a 
circular motion (the cyclotron motion) in a plane perpendicular to the field (Figure 2.17). 
The cyclotron motion arises from the interaction of an ion with an unidirectional magnetic field and is 
due to the Lorentz Force: 
F = zv x B 
Where F is the Lorentz Force observed by the ion, when it enters the magnetic field, z the charge of the 
ion, v the velocity of the ion, and B the magnetic field strength (constant). x indicates the cross product 
and not the multiplication symbol. The electric field acts to force the ions away from the center of the 
analyzer cell, whereas the magnetic field prevents the ions from colliding with the walls of the analyzer 
cell. The frequency of this magnetron motion is m/z dependent: 
38 
 
c = zB / 2m 
c being the induced cyclotron frequency, and m the mass of the ion. By inducing a swept RF pulse 
across the excitation plates a packet of ions of a given m/z ratio is excited. By adjusting the pulse to 
have the same frequency as the cyclotron frequency of the ions, ions will absorb energy. This excitation 
results in an increase in the radius of the ion motion bringing them closer to the detector plates, where 
they induce an image current between the detector plates. The frequency of the image current is in 
resonance with the cyclotron frequency of ions of a particular m/z ratio, and the intensity is 
proportional to the number of ions. Ions of one specific m/z ratio are excited together and the image 
current is recorded and analysed by the data system. When the RF goes off resonance for that particular 
m/z ratio, the ions return to their initial motion (relax) and other ions can be excited. The RF sweep can 
also be adjusted to consider all frequencies simultaneously, resulting in a measurement of all the ions at 
the same time. 
This creates a complex frequency vs. time spectrum of all the signals. Deconvolution of these signals 
by FT methods results in a deconvoluted frequency vs. intensity spectrum, which can then be converted 
to a mass spectrum by the following equation: 
m/z = B / 2c 
 
 
 
 
 
 
 
 
Magnetic
Field, B
Trapping
plates
Detector
plates
Excitation on  one frequency (RF) excites one m/z
convoluted
frequency
spectrum
frequency
deconvoluted
spectrum
Mass
spectrum
Induced alternating
currentExcitation
plates
RF
FT MC
Ion 
Analyzer cell
 
39 
 
Figure 2.17. The principle of FT ICR mass spectrometry. 
 
The FT ICR is a powerful instrument with many possibilities. One great advantage of FT MS is the 
ion-trap nature of the instrument, which allows it to measure ions without destroying them. This makes 
it possible to perform subsequent fragmentation or perform other studies on the ions. Another 
dimension to the use of FT ICR mass spectrometry was the introduction of Electron Capture 
Dissociation (ECD), where charge-mediated fragmentation is produced. For proteins, this 
fragmentation occurs solely in the backbone, sparing the side chains, which make it an efficient method 
for the site-specific analyses of labile posttranslational modifications that are usually lost by traditional 
fragmentation (Sze et al., 2002). Proteins containing cysteine residues are an exception, due to their 
high radical affinity. ECD also enables the MSn analysis ofentire proteins and their larger fragments 
(“the top down” approach) as apposed to the traditional analysis of low molecular weight proteolysis 
products (“the bottom up” approach) (McLafferty et al., 2001) 
The FT instrument can also be combined with electrospray ionization giving high throughput 
measurements and also the possibility of coupling with separation techniques such as high-performance 
liquid chromatography (HPLC) (Schmid, et al., 2000). 
 
Linear ion trap- Fourier Transform (LTQ-FT) 
Prior to the development of the LTQ-orbitrap the most sophisticated and accurate MS instrument for 
performing protein identification ad quantitation was the LTQ-FT instrument which shares some of the 
characteristic and features as the LTQ-orbitrap. The difference is the speed of the analysis and the cost 
as the orbitrap do not need to be cooled by liquid helium which is a significant cost for the FT 
instruments. An illustration of the LTQ-FT instrument is shown in Figure 2.17. 
 
 
 
40 
 
 
 
 
 
 
 
 
 
 
Figure 2.17. The construct of the Finnigan LTQ-FT instrument. 
 
2.5 Mass spectrometric sequencing of peptides – tandem MS (MS/MS) 
MS driven proteomics is based on the ability to measure the molecular weight of proteins and peptides 
and to select these for fragmentation by MS/MS to obtain detailed sequence information. In order to 
retrieve structural information on peptides/proteins using MS, the peptide/protein needs to be 
fragmented into smaller parts that reflect the structure and modifications of the original peptide/protein. 
The m/z ratio of these fragments can then be compared to known m/z ratio of particular chemical 
structures stored in databases. This can most readily be achieved if the sequence of the peptide/protein 
is known prior to the analysis from for example genome sequencing. If the sequence of the protein is 
not known one can, based on the sequence information obtained from the tandem MS sequencing, 
search for homologe protein sequences in a database (using e.g., BLAST search), or generate enough 
sequence for designing primers for sequencing the particular protein – a procedure which is becoming 
very fast and in-expenseable. 
210L/sec 210L/sec60 m3/hr
Triple Ported Turbo
400L/sec300L/sec25L/sec
x
y
z
7 T Actively Shielded
Superconducting
Magnet
ICR Cell
2D Ion Trap
 
41 
 
 Fragmentation of peptides/proteins is achieved by transferring extra energy to the stable ions formed 
during ionization (see below). Extensive fragmentation can also be induced during ionization by the 
use of highly energetic ionization techniques. There are many different ways to nduce fragmentation of 
the peptide, e.g., collision induced dissociation/collision activated dissociation (CID/CAD), post source 
decay (PSD), Electron transfer dissociation (ETD), etc. Below is described the basic nomenclature of 
peptide fragmentation and the basis for some of these fragmentation techniques. 
 
Nomenclature for peptide fragmentation 
In 1984 Roepstorff and Fohlman proposed a common nomenclature for the different fragments 
obtained (Roepstorff and Fohlman, 1984). This nomenclature was modified in 1988 by Biemann 
(Biemann, et al., 1988). The cleavage of a bond in the main chain can occur at three different positions, 
in a C-C bond, a C-N bond or in an N-C bond. The resulting daughter ions can be divided into two 
separate groups. Those containing the N-terminal of the original peptide and those containing the C-
terminal of the original peptide. These two groups can each be further divided into 3 subgroups. The 
group of N-terminal containing fragments includes an, bn and cn ions depending on what position on the 
peptide backbone fragmentation has occurred. The group of C-terminal containing fragments includes 
xn, yn and zn ions. n indicating the number of amino acids in the fragment. An overview of the different 
fragment ion that can be obtained by MS/MS of peptides is showed in Figure 2.18. 
 
 
 
 
 
 
 
H2N C
R
H
+
A. Peptide fragmentation
B. Immonium ion
H2N CH
R1
C
O
NH CH C
R2 O
NH CH C
R3 O
OH
a1 b1 c1 a2 b2 c2
x2 y2 z2 x1 y1 z1
H+
H2N C
R
H
+
A. Peptide fragmentation
B. Immonium ion
H2N CH
R1
C
O
NH CH C
R2 O
NH CH C
R3 O
OH
a1 b1 c1 a2 b2 c2
x2 y2 z2 x1 y1 z1
H2N CH
R1
C
O
NH CH C
R2 O
NH CH C
R3 O
OH
a1 b1 c1 a2 b2 c2
x2 y2 z2 x1 y1 z1
H+
42 
 
Figure 2.18: Common nomenclature of peptide fragment ions. The common nomenclature 
for different fragment ions generated in MS/MS was proposed by Roepstorff and Fohlman in 
1984, and slightly modified four years later by Biemann. In CID, primarily y and b ions are 
produced, whereas z and c ions are generated by ECD and ETD. 
There are two additional classes of peptide fragments; fragments created by the cleavage of two bonds 
in the peptide chain, and fragments that are further fragmented in the amino acid lateral chain. 
Fragments of the first class are referred to as internal fragments, since they have lost both the N- and C-
terminal sides. They are usually no more than three or four amino residues in length and they are 
especially abundant if the amino acid proline is present in the peptide sequence. Internal fragments are 
designated with series of simple letters representing the sequences of the fragments. Fragments of the 
second class constitute the immonium ions of individual amino acid residues (Figure 2.18). They 
appear in the lower mass region and are not equally abundant for each amino acid residue, but some are 
commonly seen and these provide important information about a sequence. The immonium ion from 
the tyrosine residue at m/z 136 is one of the immonium ions commonly observed and this ion is 
important, when looking for phosphorylated tyrosine residues in a sequence. Immonium ions are 
represented by the capital one-letter code of the corresponding amino acid residues. Immonium ions are 
not frequently observed in iontrap instruments due to the 28% rule. Three additional fragment ion 
types can be observed in high energy spectra, dn, wn and vn. dn and wn and they are formed by the 
cleavage of the bond between the  and the  carbon atoms of the amino acid side chains, dn having the 
positive charge on the N-terminal and wn having the positive charge on the C-terminal fragment. The vn 
fragment ions are formed by the loss of the entire side chain. 
 
Collision induced dissociation (CID) 
Collisional induced dissociation (CID), also known as collisional activation dissociation (CAD), is 
accomplished by converting some of the kinetic energy from the peptide into vibrational energy by 
collision of the (M + nH)n+ ion with a neutral gas (e.g. Ar, Xe or He) in a collision cell (Shukla and 
Futrell, 2000)(Figure 2.19). The fragmented ions, termed daughter ions, are then separated in a second 
mass analyzer and detected by the detector, giving raise to a fragment ion spectrum. The number and 
43 
 
nature of the ions detected will depend on the collisional energy used in the collision cell, on the size of 
the peptide and on the amino acid sequence of the peptide. High energy collision induced 
fragmentation (several keV), as used by TOF-TOF (Medzihradszky, et al., 2000) and magnetic sector 
instruments (Hoffmann, 2001), induce extensive fragmentation and give rise to other types of 
fragments ions. These hold a lot of information, but also complicate the assignment of spectra to amino 
acid sequences. Low collisional energy ( 100 eV) as used by quadrupole and ion trap instruments, 
induce less fragmentation and subsequently

Outros materiais