Genome polyprotein

Details

Name
Genome polyprotein
Synonyms
  • 3.6.1.15
  • P2A
Gene Name
Not Available
Organism
HHAV
Amino acid sequence
>lcl|BSEQ0012706|Genome polyprotein
MNMSKQGIFQTVGSGLDHILSLADIEEEQMIQSVDRTAVTGASYFTSVDQSSVHTAEVGS
HQIEPLKTSVDKPGSKKTQGEKFFLIHSADWLTTHALFHEVAKLDVVKLLYNEQFAVQGL
LRYHTYARFGIEIQVQINPTPFQQGGLICAMVPGDQSYGSIASLTVYPHGLLNCNINNVV
RIKVPFIYTRGAYHFKDPQYPVWELTIRVWSELNIGTGTSAYTSLNVLARFTDLELHGLT
PLSTQMMRNEFRVSTTENVVNLSNYEDARAKMSFALDQEDWKSDPSQGGGIKITHFTTWT
SIPTLAAQFPFNASDSVGQQIKVIPVDPYFFQMTNTNPDQKCITALASICQMFCFWRGDL
VFDFQVFPTKYHSGRLLFCFVPGNELIDVTGITLKQATTAPCAVMDITGVQSTLRFRVPW
ISDTPYRVNRYTKSAHQKGEYTAIGKLIVYCYNRLTSPSNVASHVRVNVYLSAINLECFA
PLYHAMDVTTQVGDDSGGFSTTVSTEQNVPDPQVGITTMRDLKGKANRGKMDVSGVQAPR
GSYQQQLNDPVLAKKVPETFPELKPGESRHTSDHMSIYKFMGRSHFLCTFTFNSNNKEYT
FPITLSSTSNPPHGLPSTLRWFFNLFQLYRGPLDLTIIITGATDVDGMAWFTPVGLAVDP
WVEKESALSIDYKTALGAVRFNTRRTGNIQIRLPWYSYLYAVSGALDGLGDKTDSTFGLF
LFEIANYNHSDEYLSFSCYLSVTEQSEFYFPRAPLNSNAMLSTESMMSRIAAGDLESSVD
DPRSEEDRRFESHIECRKPYKELRLEVGKQRLKYAQEELSNEVLPPPRKMKGLFSQAKIS
LFYTEEHEIMKFSWRGVTADTRALRRFGFSLAAGRSVWTLEMDAGVLTGRLIRLNDEKWT
EMKDDKIVSLIEKFTSNKYWSKVNFPHGMLDLEEIAANSKDFPNMSETDLCFLLHWLNPK
KINLADRMLGLSGVQEIKEQGVGLIAECRTFLDSIAGTLKSMMFGFHHSVTVEIINTVLC
FVKSGILLYVIQQLNQDEHSHIIGLLRVMNYADIGCSVISCGKVFSKMLETVFNWQMDSR
MMELRTQSFSNWLRDICSGITIFKSFKDAIYWLYTKLKDFYEVNYGKKKDILNILKDNQQ
KIEKAIEEADNFCILQIQDVEKFDQYQKGVDLIQKLRTVHSMAQVDPNLGVHLSPLRDCI
ARVHQKLKNLGSINQAMVTRCEPVVCYLYGKRGGGKSLTSIALATKICKHYGVEPEKNIY
TKPVASDYWDGYSGQLVCIIDDIGQNTTDEDWSDFCQLVSGCPMRLNMASLEEKGRHFSS
PFIIATSNWSNPSPKTVYVKEAIDRRLHFKVEVKPASFFKNPHNDMLNVNLAKTNDAIKD
MSCVDLIMDGHNISLMDLLSSLVMTVEIRKQNMSEFMELWSQGISDDDNDSAVAEFFQSF
PSGEPSNWKLSSFFQSVTNHKWVAVGAAVGILGVLVGGWFVYKHFSRKEEEPIPAEGVYH
GVTKPKQVIKLDADPVESQSTLEIAGLVRKNLVQFGVGEKNGCVRWVMNALGVKDDWLLV
PSHAYKFEKDYEMMEFYFNRGGTYYSISAGNVVIQSLDVGFQDVVLMKVPTIPKFRDITQ
HFIKKGDVPRALNRLATLVTTVNGTPMLISEGPLKMEEKATYVHKKNDGTTVDLTVDQAW
RGKGEGLPGMCGGALVSSNQSIQNAILGIHVAGGNSILVAKLVTQEMFQNIDKKIESQRI
MKVEFTQCSMNVVSKTLFRKSPIHHHIDKTMINFPAAMPFSKAEIDPMAMMLSKYSLPIV
EEPEDYKEASVFYQNKIVGKTQLVDDFLDLDMAITGAPGIDAINMDSSPGFPYVQEKLTK
RDLIWLDENGLLLGVHPRLAQRILFNTVMMENCSDLDVVFTTCPKDELRPLEKVLESKTR
AIDACPLDYTILCRMYWGPAISYFHLNPGFHTGVAIGIDPDRQWDELFKTMIRFGDVGLD
LDFSAFDASLSPFMIREAGRIMSELSGTPSHFGTALINTIIYSKHLLYNCCYHVCGSMPS
GSPCTALLNSIINNINLYYVFSKIFGKSPVFFCQALRILCYGDDVLIVFSRDVQIDNLDL
IGQKIVDEFKKLGMTATSADKNVPQLKPVSELTFLKRSFNLVEDRIRPAISEKTIWSLMA
WQRSNAEFEQNLENAQWFAFMHGYEFYQKFYYFVQSCLEKEMIEYRLKSYDWWRMRFYDQ
CFICDLS
Number of residues
2227
Molecular Weight
251897.82
Theoretical pI
6.56
GO Classification
Functions
ATP binding / cysteine-type endopeptidase activity / ion channel activity / RNA binding / RNA helicase activity / RNA-directed RNA polymerase activity / structural molecule activity
Processes
pore formation by virus in membrane of host cell / protein oligomerization / RNA-protein covalent cross-linking / suppression by virus of host gene expression / suppression by virus of host MAVS activity / suppression by virus of host MAVS activity by MAVS proteolysis / transcription, DNA-templated / viral entry into host cell / viral RNA genome replication / virion attachment to host cell
Components
host cell cytoplasmic vesicle membrane / host cell mitochondrial outer membrane / integral to membrane of host cell / membrane / viral capsid
General Function
Structural molecule activity
Specific Function
Capsid proteins VP1, VP2, and VP3 form a closed capsid enclosing the viral positive strand RNA genome. All these proteins contain a beta-sheet structure called beta-barrel jelly roll. Together they form an icosahedral capsid (T=3) composed of 60 copies of each VP1, VP2, and VP3, with a diameter of approximately 300 Angstroms. VP1 is situated at the 12 fivefold axes, whereas VP2 and VP3 are located at the quasi-sixfold axes. The capsid interacts with HAVCR1 to provide virion attachment to target cell (By similarity).Protein VP0: VP0 precursor is a component of immature procapsids. The N-terminal domain of VP0, protein VP4, is needed for the assembly of 12 pentamers into the icosahedral structure. Unlike other picornaviruses, HAV VP4 does not seem to be myristoylated and has not been detected in mature virions, supposedly owing to its small size (By similarity).VP1-2A precursor is a component of immature procapsids and corresponds to an extended form of the structural protein VP1. The C-terminal domain of VP1-2A, protein 2A, acts as an assembly signal that allows multimerization of VP1-2A and formation of pentamers of VP1-VP2-VP3 trimers. It is proteolytically removed from the precursor by a host protease and does not seem to be found in mature particles (By similarity).Protein 2B and 2BC precursor affect membrane integrity and cause an increase in membrane permeability.Protein 2C: Associates with and induces structural rearrangements of intracellular membranes. It displays RNA-binding, nucleotide binding and NTPase activities (By similarity).Protein 3A, via its hydrophobic domain, serves as membrane anchor to the 3AB and 3ABC precursors.The 3AB precursor interacts with the 3CD precursor and with RNA structures found at both the 5'- and 3'-termini of the viral genome. Since the 3AB precursor contains the hydrophobic domain 3A, it probably anchors the whole viral replicase complex to intracellular membranes on which viral RNA synthesis occurs (By similarity).The 3ABC precursor is targeted to the mitochondrial membrane where protease 3C activity cleaves and inhibits the host antiviral protein MAVS, thereby disrupting activation of IRF3 through the IFIH1/MDA5 pathway. In vivo, the protease activity of 3ABC precursor is more efficient in cleaving the 2BC precursor than that of protein 3C. The 3ABC precursor may therefore play a role in the proteolytic processing of the polyprotein (By similarity).Protein 3B is covalently linked to the 5'-end of both the positive-strand and negative-strand genomic RNAs. It acts as a genome-linked replication primer (By similarity).Protease 3C: cysteine protease that generates mature viral proteins from the precursor polyprotein. In addition to its proteolytic activity, it binds to viral RNA, and thus influences viral genome replication. RNA and substrate bind cooperatively to the protease. Also cleaves host proteins such as PCBP2 (By similarity).RNA-directed RNA polymerase 3D-POL replicates genomic and antigenomic RNA by recognizing replications specific signals.
Pfam Domain Function
Transmembrane Regions
Not Available
Cellular Location
Virion
Gene sequence
>lcl|BSEQ0007431|6684 bp
ACTCAGGGGCATTTAGGTTTTTCCTCATTCTTAAACAATAATGAATATGTCCAAACAAGG
AATTTTCCAGACTGTTGGGAGTGGCCTTGACCACATCCTGTCTTTGGCAGATATTGAGGA
AGAGCAAATGATTCAGTCCGTTGATAGGACTGCAGTGACTGGAGCTTCTTACTTCACTTC
TGTGGACCAATCTTCAGTTCATACTGCTGAGGTTGGCTCACATCAAATTGAACCTTTGAA
AACCTCTGTTGATAAACCTGGTTCTAAGAAAACTCAGGGGGAAAAGTTTTTCCTGATTCA
TTCTGCTGATTGGCTCACTACACATGCTCTCTTTCATGAAGTTGCAAAATTGGATGTGGT
GAAACTACTGTATAATGAGCAGTTTGCCGTCCAAGGTTTGTTGAGATACCATACATATGC
AAGATTTGGCATTGAGATTCAAGTTCAGATAAATCCCACACCCTTTCAGCAAGGAGGACT
AATTTGTGCCATGGTTCCTGGTGACCAAAGTTATGGTTCAATAGCATCCTTGACTGTTTA
TCCTCATGGTCTGTTAAATTGCAATATCAACAATGTAGTTAGAATAAAGGTTCCATTTAT
TTATACTAGAGGTGCTTATCATTTTAAAGATCCACAGTACCCAGTTTGGGAATTGACAAT
CAGAGTTTGGTCAGAGTTGAATATTGGAACAGGAACTTCAGCTTACACTTCACTCAATGT
TTTAGCTAGGTTTACAGATTTGGAGTTGCATGGATTAACTCCTCTTTCTACACAGATGAT
GAGAAATGAATTTAGGGTCAGTACTACTGAAAATGTTGTAAATTTGTCAAATTATGAAGA
TGCAAGGGCAAAAATGTCTTTTGCTTTGGATCAGGAAGATTGGAAGTCTGATCCTTCCCA
AGGTGGTGGAATTAAAATTACTCATTTTACTACCTGGACATCCATTCCAACCTTAGCTGC
TCAGTTTCCATTTAATGCTTCAGATTCAGTTGGACAACAAATTAAAGTTATTCCAGTGGA
CCCATACTTTTTCCAAATGACAAACACTAATCCTGATCAAAAATGTATAACTGCCTTGGC
CTCTATTTGTCAGATGTTCTGCTTTTGGAGGGGAGATCTTGTTTTTGATTTTCAGGTTTT
TCCAACCAAATATCATTCAGGTAGACTGTTGTTTTGTTTTGTTCCTGGGAATGAGTTAAT
AGATGTTACTGGAATTACATTAAAACAGGCAACTACTGCTCCTTGTGCAGTGATGGACAT
TACAGGAGTGCAGTCAACCTTGAGATTTCGTGTTCCTTGGATTTCTGATACACCTTATCG
AGTGAATAGGTACACGAAGTCAGCACATCAAAAAGGTGAGTACACTGCCATTGGGAAGCT
TATTGTGTATTGTTATAACAGACTGACTTCTCCTTCTAATGTTGCCTCTCATGTTAGAGT
TAATGTTTATCTTTCAGCAATTAATTTGGAATGTTTTGCTCCTCTTTACCATGCTATGGA
TGTTACTACACAGGTTGGAGATGATTCAGGAGGTTTCTCAACAACAGTTTCTACAGAGCA
GAATGTTCCTGATCCCCAAGTTGGGATAACAACCATGAGGGATTTAAAAGGAAAAGCCAA
TAGGGGAAAGATGGATGTTTCAGGAGTGCAAGCACCTCGTGGGAGCTATCAGCAACAATT
GAACGATCCAGTTTTAGCAAAGAAAGTACCTGAGACATTTCCTGAATTGAAGCCTGGAGA
GTCCAGACATACATCAGATCACATGTCTATTTATAAATTCATGGGAAGGTCTCATTTTTT
GTGCACTTTTACTTTCAATTCAAATAATAAAGAGTACACATTTCCAATAACCCTGTCTTC
GACTTCTAATCCTCCTCATGGTTTACCATCAACATTAAGGTGGTTCTTCAATTTGTTTCA
GTTGTATAGAGGACCATTGGATTTAACAATTATAATCACAGGAGCCACTGATGTGGATGG
TATGGCCTGGTTTACTCCAGTGGGCCTTGCTGTCGACCCTTGGGTGGAAAAGGAGTCAGC
TTTGTCTATTGATTATAAAACTGCCCTTGGAGCTGTTAGATTTAATACAAGAAGAACAGG
AAACATTCAAATTAGATTGCCGTGGTATTCTTATTTGTATGCCGTGTCTGGAGCACTGGA
TGGCTTGGGGGATAAGACAGATTCTACATTTGGATTGTTTCTATTCGAGATTGCAAATTA
CAATCATTCTGATGAATATTTGTCCTTCAGTTGTTATTTGTCTGTCACAGAGCAATCAGA
GTTCTATTTTCCTAGAGCTCCATTAAATTCAAATGCTATGTTGTCCACTGAATCCATGAT
GAGTAGAATTGCAGCTGGAGACTTGGAGTCATCAGTGGATGATCCCAGATCAGAGGAGGA
TAGAAGATTTGAGAGTCATATAGAATGTAGGAAACCATACAAAGAATTGAGACTGGAGGT
TGGGAAACAAAGACTCAAATATGCTCAGGAAGAGTTATCAAATGAAGTGCTTCCACCTCC
TAGGAAAATGAAGGGGTTATTTTCACAAGCTAAAATTTCTCTTTTTTATACTGAGGAGCA
TGAAATAATGAAGTTTTCTTGGAGAGGAGTGACTGCTGATACTAGGGCTTTGAGAAGATT
TGGATTCTCTCTGGCTGCTGGTAGAAGTGTGTGGACTCTTGAAATGGATGCTGGAGTTCT
TACTGGAAGATTGATCAGATTGAATGATGAGAAATGGACAGAAATGAAGGATGATAAGAT
TGTTTCATTAATTGAAAAGTTCACAAGCAATAAATATTGGTCTAAAGTGAATTTTCCACA
TGGAATGTTGGATCTTGAAGAAATTGCTGCCAATTCTAAGGATTTTCCAAATATGTCTGA
GACAGATTTGTGTTTCCTGTTACATTGGCTAAATCCAAAGAAAATCAATTTAGCAGATAG
AATGCTTGGATTGTCTGGAGTGCAGGAAATTAAGGAACAGGGTGTTGGACTGATAGCAGA
GTGTAGAACTTTCTTGGATTCTATTGCTGGGACTTTGAAATCTATGATGTTTGGGTTTCA
TCATTCTGTGACTGTTGAAATTATAAATACTGTGCTTTGTTTTGTTAAGAGTGGAATCCT
GCTTTATGTCATACAACAATTGAACCAAGATGAACACTCTCACATAATTGGTTTGTTGAG
AGTTATGAATTATGCAGATATTGGCTGTTCAGTTATTTCATGTGGTAAAGTTTTTTCCAA
AATGTTAGAAACAGTTTTTAATTGGCAAATGGATTCTAGAATGATGGAGCTGAGGACTCA
GAGCTTCTCTAATTGGTTAAGAGATATTTGTTCAGGAATTACTATTTTTAAAAGTTTTAA
GGATGCCATATATTGGTTATATACAAAATTGAAGGATTTTTATGAAGTAAATTATGGCAA
GAAAAAGGATATTCTTAATATTCTCAAAGATAATCAGCAAAAAATAGAAAAAGCCATTGA
AGAAGCAGACAATTTTTGCATTTTGCAAATTCAAGATGTAGAGAAATTTGATCAGTATCA
GAAAGGGGTTGATTTAATACAAAAGCTGAGAACTGTCCATTCAATGGCGCAAGTTGACCC
CAATTTGGGGGTTCATTTGTCACCTCTCAGAGATTGCATAGCAAGAGTCCACCAAAAGCT
CAAGAATCTTGGATCTATAAATCAGGCCATGGTAACAAGATGTGAGCCAGTTGTTTGCTA
TTTGTATGGCAAAAGAGGGGGAGGGAAAAGCTTGACTTCAATTGCATTGGCAACCAAAAT
TTGTAAACACTATGGTGTTGAACCTGAGAAAAATATTTACACCAAACCTGTGGCCTCAGA
TTATTGGGATGGATATAGTGGACAATTAGTTTGCATTATTGATGATATTGGCCAAAACAC
AACAGATGAAGATTGGTCAGATTTTTGTCAATTAGTGTCAGGATGCCCAATGAGATTGAA
TATGGCTTCTCTAGAGGAGAAGGGCAGACATTTTTCCTCTCCTTTTATAATAGCAACTTC
AAATTGGTCAAATCCAAGTCCAAAAACAGTTTATGTTAAGGAAGCAATTGATCGTAGGCT
TCATTTTAAGGTTGAAGTTAAACCTGCTTCATTTTTTAAAAATCCTCACAATGATATGTT
GAATGTTAATTTGGCCAAAACAAATGATGCAATTAAGGACATGTCTTGTGTTGATTTAAT
AATGGATGGACACAATATTTCATTGATGGATTTACTTAGTTCCTTAGTGATGACAGTTGA
AATTAGGAAACAGAATATGAGTGAATTCATGGAGTTGTGGTCTCAGGGAATTTCAGATGA
TGACAATGATAGTGCAGTGGCTGAGTTTTTCCAGTCTTTTCCATCTGGTGAACCATCAAA
TTGGAAGTTATCTAGTTTTTTCCAATCTGTCACTAATCACAAGTGGGTTGCTGTGGGAGC
TGCAGTTGGCATTCTTGGAGTGCTTGTGGGAGGATGGTTTGTGTATAAGCATTTTTCCCG
CAAAGAGGAAGAACCAATTCCAGCTGAAGGGGTTTATCATGGCGTGACTAAGCCCAAACA
AGTGATTAAATTGGATGCAGATCCAGTAGAGTCCCAGTCAACTCTAGAAATAGCAGGATT
AGTTAGGAAAAATCTGGTTCAGTTTGGAGTTGGTGAGAAAAATGGATGTGTGAGATGGGT
CATGAATGCCTTAGGAGTGAAGGATGATTGGTTGTTAGTACCTTCTCATGCTTATAAATT
TGAAAAGGATTATGAAATGATGGAGTTTTACTTCAATAGAGGTGGAACTTACTATTCAAT
TTCAGCTGGTAATGTTGTTATTCAATCTTTAGATGTGGGATTTCAAGATGTTGTTTTAAT
GAAGGTTCCTACAATTCCCAAGTTTAGAGATATTACTCAACACTTTATTAAGAAAGGAGA
TGTGCCTAGAGCCTTAAATCGCTTGGCAACATTAGTGACAACCGTTAATGGAACTCCTAT
GTTAATTTCTGAGGGACCATTAAAGATGGAAGAAAAAGCCACTTATGTTCATAAGAAGAA
TGATGGTACTACAGTTGATTTGACTGTAGATCAGGCATGGAGAGGAAAAGGTGAAGGTCT
TCCTGGAATGTGTGGTGGGGCCCTAGTGTCATCAAATCAGTCCATACAGAATGCAATTTT
GGGTATTCATGTTGCTGGAGGAAATTCAATTCTTGTGGCAAAGCTGGTTACTCAAGAAAT
GTTTCAAAACATTGATAAGAAAATTGAAAGTCAGAGAATAATGAAAGTGGAATTTACTCA
ATGTTCAATGAATGTAGTCTCCAAAACGCTTTTTAGAAAGAGTCCCATTCATCACCACAT
TGATAAAACCATGATTAATTTTCCTGCAGCTATGCCTTTCTCTAAAGCTGAAATTGATCC
AATGGCTATGATGTTGTCCAAATATTCATTACCTATTGTGGAGGAACCAGAGGATTACAA
GGAAGCTTCAGTTTTTTATCAAAACAAAATAGTAGGCAAGACTCAGCTAGTTGATGACTT
TTTAGATCTTGATATGGCTATTACAGGGGCTCCAGGCATTGATGCTATCAATATGGATTC
ATCTCCTGGGTTTCCTTATGTTCAAGAAAAATTGACCAAAAGAGATTTAATTTGGTTGGA
TGAAAATGGTTTGCTGTTAGGAGTTCACCCAAGATTGGCCCAGAGAATTTTATTTAATAC
TGTCATGATGGAAAATTGTTCTGACTTAGATGTTGTTTTTACAACTTGTCCAAAAGATGA
ATTGAGACCATTAGAGAAAGTTTTGGAATCAAAAACAAGAGCCATTGATGCTTGTCCTTT
GGATTATACAATTCTATGTCGAATGTATTGGGGTCCAGCTATCAGTTATTTCCATTTGAA
TCCAGGGTTTCACACAGGTGTTGCTATTGGCATAGATCCTGATAGACAGTGGGATGAATT
ATTTAAAACAATGATAAGATTTGGAGATGTTGGTCTTGATTTAGATTTCTCTGCTTTTGA
TGCCAGTCTTAGTCCATTTATGATTAGGGAAGCAGGTAGAATCATGAGTGAATTATCTGG
AACACCATCTCATTTTGGAACAGCTCTTATCAATACTATCATTTATTCTAAACATCTGCT
GTACAACTGTTGTTATCATGTTTGTGGTTCAATGCCTTCTGGGTCTCCTTGCACAGCTTT
GTTGAATTCAATTATTAATAATATTAATCTGTATTATGTGTTTTCTAAAATATTTGGAAA
GTCTCCAGTTTTCTTTTGTCAAGCTTTGAGGATCCTTTGTTACGGAGATGATGTTTTGAT
AGTTTTTTCCAGAGATGTTCAAATTGACAATCTTGACTTGATTGGACAGAAAATTGTAGA
TGAGTTCAAAAAACTTGGCATGACAGCCACCTCAGCTGATAAAAATGTGCCTCAACTGAA
GCCAGTTTCAGAATTGACTTTTCTCAAAAGATCTTTCAATTTGGTGGAGGATAGAATTAG
ACCTGCAATTTCAGAAAAGACAATTTGGTCTTTGATGGCTTGGCAGAGAAGTAACGCTGA
GTTTGAGCAGAATTTAGAAAATGCTCAGTGGTTTGCTTTTATGCATGGCTATGAGTTCTA
TCAGAAATTTTATTATTTTGTTCAGTCCTGTTTGGAGAAAGAGATGATAGAATATAGACT
TAAATCTTATGATTGGTGGAGAAT
Chromosome Location
Not Available
Locus
Not Available
External Identifiers
ResourceLink
UniProtKB IDP06441
UniProtKB Entry NamePOLG_HAVLA
GenBank Protein ID329597
GenBank Gene IDK02990
General References
  1. Najarian R, Caput D, Gee W, Potter SJ, Renard A, Merryweather J, Van Nest G, Dina D: Primary structure and gene organization of human hepatitis A virus. Proc Natl Acad Sci U S A. 1985 May;82(9):2627-31. [Article]

Drug Relations

Drug Relations
DrugBank IDNameDrug groupPharmacological action?ActionsDetails
DB04634N-BENZYLOXYCARBONYL-L-SERINE-BETALACTONEexperimentalunknownDetails