Science Talk:
After the Genome, Part 2
The Paradigm Remains the Same
Ernest Beutler, Chair, Department of Molecular and
Experimental Medicine
Having the sequence of the whole genome is a useful
tool [rather than] an entirely new concept (In other
words, I don't agree with the idea that "It's a new
ballgame now that we know the genome"). Knowing the
sequence allows us to move towards our objectives more
rapidlywhether that objective is finding a new
drug target or solving a problem with a genetic disease.
That's all it really isa tool. And it's a tool
that didn't suddenly become available when people announced
that they had completed the sequence. We've been able
to perform genome searches for the last 14 years. It's
just that now, the database is more complete, although
still not entirely so.
There are a lot of different ways to use this tool
to find better drug targets. One is the forward genetic
approachrandom mutagenesis. If you produce a model
with hypertension or diabetes or cancer as an inherited
genotype, you perform linkage analysis. There are lots
of markers that help you see what piece of which chromosomes
have been inherited from which strain. Then you narrow
the field to a single gene by what is known as positional
cloning.
Could you positionally clone a gene before the genome
was available? Sure, but it was a lot more work. It
was a lot harder. Now areas that you would have had
to sequence through are already sequenced. Markers that
you would have had to find are already known. The availability
of the genome allows you to find a mutant gene much
faster, and, once you find the gene that causes the
phenotype, then you may gain some understanding of why
this strain has this particular disease. Once you have
that understanding, then maybe you'll be able to treat
the disease and its human counterpart.
The paradigm hasn't changed. Once you find the pathogenesis
of a disease, then you try to correct that through drugs,
diet, and a variety of different ways. Is [the genome]
a revolution? I don't think so. But it's a very powerful
tool for understanding disease processes, and it will
move things along a lot faster.
|
Making
a Mark
Steve Kay, Professor, Department of Cell Biology
Genomics aims to provide a definition of the gene content
of an organism. Functional genomics takes the genome
sequences and turns them into highly confident statements
about how many genes there are and what they do. It's
going to provide rich biological descriptions of function
on a scale that was unimaginable a few years ago.
What is really exciting is that much of the effort
at TSRI's Institute of Childhood and Neglected Diseases
(ICND) is about neurobiologytrying to understand
the molecular and cellular bases of psychiatric and
neurological disorders. It is extremely difficult to
define gene-by-gene how each may contribute to complex
behavior, for example, the sleep/wake cycle and how
it relates to depression; learning and memory; autism
and neurodegeneration; cerebellum disorders like epilepsy
and migraines; deafness.
One of the best ways of defining how genes contribute
to these complex biological phenomena is to use an in
vivo standarda working physiological system
to see how the genes work. At the ICND, we are fortunate
to have a partnership with the Genomics Institute of
the Novartis Research Foundation (GNF), which will make
these models available for researchers. This is critically
important for finding new targets for therapeutic intervention.
We can assay complex [phenotypes] and begin to determine
which genes are important for biological function and
which are important for dysfunction.
Genes want to express themselves and they do it differentially.
Every single cell has the same DNA content, but they
look different because of differential expression.
Differential expression mapping is a powerful way
of assigning function. A lot of people in the ICND are
interested in using gene expression analysis as a way
of assigning function to genes. What we did recently
was to identify a potassium channel that plays a key
role in regulating the sleep/wake cycle. This provides
a target for therapy.
The investment that the institute has made in facilities,
recruitment, and opportunities for interacting with
GNF in the past three years has provided the infrastructure
we needed to carry out functional genomics. Now we are
seeing it pay off in publications and in making a mark.
|
Proteomics,
Mass Spectrometry, and Post-Translational Modification
John Yates, Professor, Department of Cell Biology
The genome sequence tells you a lot about an organism,
but it doesn't tell you everything. The next level up
is the proteome. What makes proteomics more challenging
than genomics is the fact that [the proteome is dynamic]depending
on the given cellular state, environment, and so forth,
you get a different proteome expressed. Although our
genome is the same in every single cell in the body,
the kinds of proteins that are expressed in those cells
will be very different, depending on what the cell is
supposed to be doing.
Lots of proteins come together to form complexes in
order to carry out some physiological function. We need
to know what proteins are coming together, when they
come together, what's driving them together, what is
the stoichiometry of those complexeshow many of
each of these components are in there, things like that.
We also need information about post-translational modifications,
[which] turn out to be extremely interesting. There
are lots and lots of modifications, and the most common
modification that people correlate with regulation is
phosphorylation.
Phosphorylation is very widespread and it's important
to know when it's on and when it's off and how that
correlates with particular physiological processes.
You'll also find that acetylation and methylation may
have regulatory roles. There's a lot of interest now
in how those things might involve regulatory processesÉ
Even though you may have X amount of a particular protein,
how much of it is actually active? That may be a function
of the modification state of that particular protein.
Correlating those pieces of information will actually
be quite interesting.
[In proteomics], you need to be able to measure quickly,
robustly, and in a high-throughput fashion that kind
of information, and mass spectrometry [enables one to
do just that].
We use tandem mass spectrometry, [which] allows you
to select peptides from the mixture and get fragmentation
information that tells you the sequence. What my lab
invented a long time ago is a way to take that mass
spectrometry fragmentation pattern and correlate it
with sequences in the database. So, as genomes get sequenced,
that information goes into the database and it becomes
very easy to associate that mass spectrometry data to
those sequences in the database, and you can do this
in a very high-throughput manner.
It also enables new approaches for looking at protein
complexes, looking at the components of the cells, looking
at where proteins might localize, [and studying post-translational
modification]. If the computer program knows to check
for something like phosphorylation, it'll go and search
the sequence database. Every time it sees a residue
that could potentially be phosphorylated it's going
to ask the question, "is this phosphorylated or not?".
And you can do that with almost any kind of modification
in this world [with the possible exception of] carbohydrates...
|
The
Study of the Glycome
James Paulson, Professor, Department of Molecular
Biology
The Consortium for Functional Glycomics is focused
on information carried by carbohydrates that mediate
communication between cells. As part of that, we want
to understand the regulation of the genes responsible
for the synthesis of the carbohydrate structures required
for mediating cell communication.
To that end, we have established a core resource that
operates within the existing TSRI Gene Microarray Facility.
The purpose is to create a custom array of genes that
are related to the scope of the consortium. We've just
finished assembling a list of about 1,500 human and
murine genes that include glycosyl transferases, which
are the enzymes that synthesize carbohydrates, and carbohydrate
binding proteins, the proteins that bind the structures
that appear on the surfaces of cells. And then there
are classes of related genes of interest.
The array will be produced by Affymetrix [a firm that
develops technology for acquiring, analyzing, and managing
genetic information] who will take the list we developed
and create a custom gene chip [an array of nucleotides
on a solid surface]. We're doing that for two reasons.
We'll have a well focused set of genes that are well
annotated, and it's also cost-effective. Within two
months, we should have [the chips] in-house.
Once they are available, investigators will apply
to use the chips for experiments within the scope of
the consortium. We will either send them the chips (if
they have their own array facility), or they can send
us the RNA needed to analyze gene expression using the
chips, and we can do the work here and send them the
data. In either case, the (raw and analyzed) data will
be posted on a public web site.
Now that the genome has been completed, there is a
general interest in the other layers of complexity that
define an organism as complex as a human. In the post-genomic
era, interest has shifted towards these other layers
of complexity including glycomics, the study of the
glycome [all the carbohydrates in the human body]. I
don't think we're ready yet to do the same kind of comprehensive
analysis with the glycome that has been done with the
genome, because to do that, you would need to categorize
every single carbohydrate that is produced on every
single cell type in the organism. And even every glycoprotein
produced in every cell.
We know, based on a lot of work through the years,
that most of that information will not be particularly
useful. It will be re-documenting over and over again
very similar carbohydrate structures. That's why our
focus has been functional glycomics, where we're trying
to sort out the sugar structures that do make a difference,
that do participate in biological interactions. Use
of the glyco-gene microarray will be particularly valuable
to understanding how glycosylation of a cell changes
during differentiation and / or activation.
|
|
|