Computer Simulations Of HD Protein Polyglutamine

There is a new interest in computer simulations of the HD protein.
The two articles below describe computer experiments of the portion of the huntingtin protein responsible for HD.

It has been known since 1993 that HD is caused by an expanded set of CAG repeats in the IT15 gene. It is normal to have 20 or so repeats. See the full amino acid sequence of normal huntingtin below, and note the series of repeated q's in the first row. The q's stand for glutamine, which is the amino acid corresponding to a CAG sequence in DNA. HD occurs when the number of q's exceeds 36 or so. Thus, the stretch of q's, polyglutamine, is somehow responsible for the disease. Scientists have spent a great deal of resources studying polyglutamine, but a molecule of just glutamines is difficult to work with, since it is insoluble. Hence, computer simulations are an attractive alternative.

Max Perutz, the Nobel prize winning father of X-ray crystallography, was the first to apply computer modeling of polyglutamine strands (Perutz, et. al, 1993). Dr Perutz found the now famous "Polar Zipper" structure (motif) that can arise from a series of polyglutamine strands (Figure). This structure inspired us to look for other motifs that could arise from polyglutamine and to study the thermodynamics of these, the goal being to develop a computer model of the aberrant polyglutamine interaction in order to design small molecules that inhibit this interaction (Lathrop, et. al., 1998). We abandoned the computer simulations in favor of mouse models some years ago, but have recently taken up the quest again. It therefore came as a pleasant surprise to find the two papers below, both of which appeared today. They both use computer simulations to study polyglutamine dynamics.

Now that such models have been developed, it is possible to study the effect of various compounds, as modeled on the computer, for their potential for preventing polar sheets from developing. Be prepared for the lights to dim, since we have found molecular dynamics simulations to be extremely compute intensive. And expect to hear more about the role of computer simulations in HD research in the near future.

The sequence of amino acids that defines the huntingtin protein in a normal individual:
LOCUS NP_002102 3144 aa linear PRI 26-OCT-2004
DEFINITION huntingtin [Homo sapiens].
ACCESSION NP_002102
VERSION NP_002102.2 GI:4753163
DBSOURCE REFSEQ: accession NM_002111.4
.dataTable {font-family:monospace} .right {text-align:right} td {padding:0}

1matleklmka feslksfqqq qqqqqqqqqq qqqqqqqqqq pppppppppp pqlpqpppqa
61qpllpqpqpp ppppppppgp avaeeplhrp kkelsatkkd rvnhcltice nivaqsvrns
121pefqkllgia melfllcsdd aesdvrmvad eclnkvikal mdsnlprlql elykeikkng
181aprslraalw rfaelahlvr pqkcrpylvn llpcltrtsk rpeesvqetl aaavpkimas
241fgnfandnei kvllkafian lksssptirr taagsavsic qhsrrtqyfy swllnvllgl
301lvpvedehst llilgvlltl rylvpllqqq vkdtslkgsf gvtrkemevs psaeqlvqvy
361eltlhhtqhq dhnvvtgale llqqlfrtpp pellqtltav ggigqltaak eesggrsrsg
421siveliaggg sscspvlsrk qkgkvllgee ealeddsesr sdvsssalta svkdeisgel
481aassgvstpg saghdiiteq prsqhtlqad svdlascdlt ssatdgdeed ilshsssqvs
541avpsdpamdl ndgtqasspi sdssqttteg pdsavtpsds seivldgtdn qylglqigqp
601qdedeeatgi lpdeaseafr nssmalqqah llknmshcrq psdssvdkfv lrdeatepgd
661qenkpcrikg digqstddds aplvhcvrll sasflltggk nvlvpdrdvr vsvkalalsc
721vgaavalhpe sffsklykvp ldtteypeeq yvsdilnyid hgdpqvrgat ailcgtlics
781ilsrsrfhvg dwmgtirtlt gntfsladci pllrktlkde ssvtcklact avrncvmslc
841sssyselglq liidvltlrn ssywlvrtel letlaeidfr lvsfleakae nlhrgahhyt
901gllklqervl nnvvihllgd edprvrhvaa aslirlvpkl fykcdqgqad pvvavardqs
961svylkllmhe tqppshfsvs titriyrgyn llpsitdvtm ennlsrviaa vshelitstt
1021raltfgccea lcllstafpv ciwslgwhcg vpplsasdes rksctvgmat miltllssaw
1081fpldlsahqd alilagnlla asapkslrss waseeeanpa atkqeevwpa lgdralvpmv
1141eqlfshllkv inicahvldd vapgpaikaa lpsltnppsl spirrkgkek epgeqasvpl
1201spkkgseasa asrqsdtsgp vttskssslg sfyhlpsylk lhdvlkatha nykvtldlqn
1261stekfggflr saldvlsqil elatlqdigk cveeilgylk scfsrepmma tvcvqqllkt
1321lfgtnlasqf dglssnpsks qgraqrlgss svrpglyhyc fmapythftq aladaslrnm
1381vqaeqendts gwfdvlqkvs tqlktnltsv tknradknai hnhirlfepl vikalkqytt
1441ttcvqlqkqv ldllaqlvql rvnyclldsd qvfigfvlkq feyievgqfr eseaiipnif
1501fflvllsyer yhskqiigip kiiqlcdgim asgrkavtha ipalqpivhd lfvlrgtnka
1561dagkeletqk evvvsmllrl iqyhqvlemf ilvlqqchke nedkwkrlsr qiadiilpml
1621akqqmhidsh ealgvlntlf eilapsslrp vdmllrsmfv tpntmasvst vqlwisgila
1681ilrvlisqst edivlsriqe lsfspylisc tvinrlrdgd ststleehse gkqiknlpee
1741tfsrfllqlv gilledivtk qlkvemseqq htfycqelgt llmclihifk sgmfrritaa
1801atrlfrsdgc ggsfytldsl nlrarsmitt hpalvllwcq illlvnhtdy rwwaevqqtp
1861krhslsstkl lspqmsgeee dsdlaaklgm cnreivrrga lilfcdyvcq nlhdsehltw
1921livnhiqdli slsheppvqd fisavhrnsa asglfiqaiq srcenlstpt mlkktlqcle
1981gihlsqsgav ltlyvdrllc tpfrvlarmv dilacrrvem llaanlqssm aqlpmeelnr
2041iqeylqssgl aqrhqrlysl ldrfrlstmq dslspsppvs shpldgdghv sletvspdkd
2101wyvhlvksqc wtrsdsalle gaelvnripa edmnafmmns efnlsllapc lslgmseisg
2161gqksalfeaa revtlarvsg tvqqlpavhh vfqpelpaep aaywsklndl fgdaalyqsl
2221ptlaralaqy lvvvsklpsh lhlppekekd ivkfvvatle alswhliheq iplsldlqag
2281ldccclalql pglwsvvsst efvthacsli ycvhfileav avqpgeqlls perrtntpka
2341iseeeeevdp ntqnpkyita acemvaemve slqsvlalgh krnsgvpafl tpllrniiis
2401larlplvnsy trvpplvwkl gwspkpggdf gtafpeipve flqekevfke fiyrintlgw
2461tsrtqfeetw atllgvlvtq plvmeqeesp peedtertqi nvlavqaits lvlsamtvpv
2521agnpavscle qqprnkplka ldtrfgrkls iirgiveqei qamvskreni athhlyqawd
2581pvpslspatt galishekll lqinperelg smsyklgqvs ihsvwlgnsi tplreeewde
2641eeeeeadapa psspptspvn srkhragvdi hscsqfllel ysrwilpsss arrtpailis
2701evvrsllvvs dlfternqfe lmyvtltelr rvhpsedeil aqylvpatck aaavlgmdka
2761vaepvsrlle stlrsshlps rvgalhgvly vlecdllddt akqlipvisd yllsnlkgia
2821hcvnihsqqh vlvmcatafy lienypldvg pefsasiiqm cgvmlsgsee stpsiiyhca
2881lrglerllls eqlsrldaes lvklsvdrvn vhsphramaa lglmltcmyt gkekvspgrt
2941sdpnpaapds esvivamerv svlfdrirkg fpcearvvar ilpqflddff ppqdimnkvi
3001geflsnqqpy pqfmatvvyk vfqtlhstgq ssmvrdwvml slsnftqrap vamatwslsc
3061ffvsastspw vaailphvis rmgkleqvdv nlfclvatdf yrhqieeeld rrafqsvlev
3121vaapgspyhr lltclrnvhk vttc

References:

Perutz MF, Staden R, Moens L, De Baere I. Polar zippers. Curr Biol. 1993 May 1;3(5):249-53.

Lathrop RH, Casale M, Tobias DJ, Marsh JL, Thompson LM. Modeling protein homopolymeric repeats: possible polyglutamine structural motifs for Huntington's disease. Proc Int Conf Intell Syst Mol Biol. 1998;6:105-14.

Khare SD, Ding F, Gwanmesia KN, Dokholyan NV.
Two publications on computer simulations of the HD protein
Khare SD, Ding F, Gwanmesia KN, Dokholyan NV. Molecular origin of polyglutamine aggregation in neurodegenerative diseases. PLoS Comput Biol. 2005 Aug;1(3):e30.

Expansion of polyglutamine (polyQ) tracts in proteins results in protein aggregation and is associated with cell death in at least nine neurodegenerative diseases. Disease age of onset is correlated with the polyQ insert length above a critical value of 35-40 glutamines. The aggregation kinetics of isolated polyQ peptides in vitro also shows a similar critical-length dependence. While recent experimental work has provided considerable insights into polyQ aggregation, the molecular mechanism of aggregation is not well understood. Here, using computer simulations of isolated polyQ peptides, we show that a mechanism of aggregation is the conformational transition in a single polyQ peptide chain from random coil to a parallel beta-helix. This transition occurs selectively in peptides longer than 37 glutamines. In the beta-helices observed in simulations, all residues adopt beta-strand backbone dihedral angles, and the polypeptide chain coils around a central helical axis with 18.5 +/- 2 residues per turn. We also find that mutant polyQ peptides with proline-glycine inserts show formation of antiparallel beta-hairpins in their ground state, in agreement with experiments. The lower stability of mutant beta-helices explains their lower aggregation rates compared to wild type. Our results provide a molecular mechanism for polyQ-mediated aggregation.

Armen RS, Bernard BM, Day R, Alonso DO, Daggett V. Characterization of a possible amyloidogenic precursor in glutamine-repeat neurodegenerative diseases. Proc Natl Acad Sci U S A. 2005 Sep 12;

Several neurodegenerative diseases are linked to expanded repeats of glutamine residues, which lead to the formation of amyloid fibrils and neuronal death. The length of the repeats correlates with the onset of Huntington's disease, such that healthy individuals have <38 residues and individuals with >38 repeats exhibit symptoms. Because it is difficult to obtain atomic-resolution structural information for poly(L-glutamine) (polyQ) in aqueous solution experimentally, we performed molecular dynamics simulations to investigate the conformational behavior of this homopolymer. In simulations of 20-, 40-, and 80-mer polyQ, we observed the formation of the "alpha-extended chain" conformation, which is characterized by alternating residues in the alphaL and alphaR conformations to yield a sheet. The structural transition from disordered random-coil conformations to the alpha-extended chain conformation exhibits modest length and temperature dependence, in agreement with the exp erimental observation that aggregation depends on length and temperature. We propose that fibril formation in polyQ may occur through an alpha-sheet structure, which was proposed by Pauling and Corey [Pauling, L. & Corey, R. B. (1951) Proc. Natl. Acad. Sci. USA 37, 251-256]. Also, we propose an atomic-resolution model of how the inhibitory peptide QBP1 (polyQ-binding peptide 1) may bind to polyQ in an alpha-extended chain conformation to inhibit fibril formation.