ProGlyProt IDBC147
Organism Information
Organism NameHaemophilus influenz
DomainBacteria
ClassificationFamily: Pasteurellaceae
Order: Pasteurellales
Class: Gammaproteobacteria
Division or phylum: "Proteobacteria"
Taxonomic ID (NCBI)727
Genome Sequence (s)
EMBLU08876
Gene Information
Gene Namehmw1
NCBI Gene ID
Protein Information
Protein NameHMW1 (Adhesin)
UniProtKB/SwissProt IDQ48031
NCBI RefSeq
EMBL-CDSAAA20527.1
UniProtKB Sequence>tr|Q48031|Q48031_HAEIN Adhesin OS=Haemophilus influenzae GN=hmw1A PE=1 SV=1 MNKIYRLKFSKRLNALVAVSELARGCDHSTEKGSEKPARMKVRHLALKPLSAMLLSLGVT SIPQSVLASGLQGMDVVHGTATMQVDGNKTIIRNSVDAIINWKQFNIDQNEMVQFLQENN NSAVFNRVTSNQISQLKGILDSNGQVFLINPNGITIGKDAIINTNGFTASTLDISNENIK ARNFTFEQTKDKALAEIVNHGLITVGKDGSVNLIGGKVKNEGVISVNGGSISLLAGQKIT ISDIINPTITYSIAAPENEAVNLGDIFAKGGNINVRAATIRNQGKLSADSVSKDKSGNIV LSAKEGEAEIGGVISAQNQQAKGGKLMITGDKVTLKTGAVIDLSGKEGGETYLGGDERGE GKNGIQLAKKTSLEKGSTINVSGKEKGGRAIVWGDIALIDGNINAQGSGDIAKTGGFVET SGHDLFIKDNAIVDAKEWLLDPDNVSINAETAGRSNTSEDDEYTGSGNSASTPKRNKEKT TLTNTTLESILKKGTFVNITANQRIYVNSSINLSNGSLTLWSEGRSGGGVEINNDITTGD DTRGANLTIYSGGWVDVHKNISLGAQGNINITAKQDIAFEKGSNQVITGQGTITSGNQKG FRFNNVSLNGTGSGLQFTTKRTNKYAITNKFEGTLNISGKVNISMVLPKNESGYDKFKGR TYWNLTSLNVSESGEFNLTIDSRGSDSAGTLTQPYNLNGISFNKDTTFNVERNARVNFDI KAPIGINKYSSLNYASFNGNISVSGGGSVDFTLLASSSNVQTPGVVINSKYFNVSTGSSL RFKTSGSTKTGFSIEKDLTLNATGGNITLLQVEGTDGMIGKGIVAKKNITFEGGNITFGS RKAVTEIEGNVTINNNANVTLIGSDFDNHQKPLTIKKDVIINSGNLTAGGNIVNIAGNLT VESNANFKAITNFTFNVGGLFDNKGNSNISIAKGGARFKDIDNSKNLSITTNSSSTYRTI ISGNITNKNGDLNITNEGSDTEMQIGGDVSQKEGNLTISSDKINITKQITIKAGVDGENS DSDATNNANLTIKTKELKLTQDLNISGFNKAEITAKDGSDLTIGNTNSADGTNAKKVTFN QVKDSKISADGHKVTLHSKVETSGSNNNTEDSSDNNAGLTIDAKNVTVNNNITSHKAVSI SATSGEITTKTGTTINATTGNVEITAQTGSILGGIESSSGSVTLTATEGALAVSNISGNT VTVTANSGALTTLAGSTIKGTESVTTSSQSGDIGGTISGGTVEVKATESLTTQSNSKIKA TTGEANVTSATGTIGGTISGNTVNVTANAGDLTVGNGAEINATEGAATLTTSSGKLTTEA SSHITSAKGQVNLSAQDGSVAGSINAANVTLNTTGTLTTVKGSNINATSGTLVINAKDAE LNGAALGNHTVVNATNANGSGSVIATTSSRVNITGDLITINGLNIISKNGINTVLLKGVK IDVKYIQPGIASVDEVIEAKRILEKVKDLSDEEREALAKLGVSAVRFIEPNNTITVDTQN EFATRPLSRIVISEGRACFSNSDGATVCVNIADNGR
Sequence length1536 AA
Subcellular LocationSurface
FunctionHigh-molecular weight protein (virulence exoprotein) that is secreted by the bacterial two-partner secretion pathway and mediates adherence to respiratory epithelium, an essential early step in the pathogenesis.
Protein Structure
PDB ID2ODL
Glycosylation Status
Glycosylation TypeN (Asn) linked
Experimentally Validated Glycosite(s) in Full Length ProteinN444, N484, N498, N546, N560, N570, N605, N609, N636, N642, N709, N773, N801, N806, N828, N835, N912, N928, N946, N952, N964, N973, N995, N1004, N1029, N1044, N1131, N1156, N1348, N1352, N1366,
Experimentally Validated Glycosite(s ) in Mature ProteinN444, N484, N498, N546, N560, N570, N605, N609, N636, N642, N709, N773, N801, N806, N828, N835, N912, N928, N946, N952, N964, N973, N995, N1004, N1029, N1044, N1131, N1156, N1348, N1352, N1366,
Glycosite(s) Annotated Protein Sequence>tr|Q48031|Q48031_HAEIN Adhesin OS=Haemophilus influenzae GN=hmw1A PE=1 SV=1 MNKIYRLKFSKRLNALVAVSELARGCDHSTEKGSEKPARMKVRHLALKPLSAMLLSLGVT SIPQSVLASGLQGMDVVHGTATMQVDGNKTIIRNSVDAIINWKQFNIDQNEMVQFLQENN NSAVFNRVTSNQISQLKGILDSNGQVFLINPNGITIGKDAIINTNGFTASTLDISNENIK ARNFTFEQTKDKALAEIVNHGLITVGKDGSVNLIGGKVKNEGVISVNGGSISLLAGQKIT ISDIINPTITYSIAAPENEAVNLGDIFAKGGNINVRAATIRNQGKLSADSVSKDKSGNIV LSAKEGEAEIGGVISAQNQQAKGGKLMITGDKVTLKTGAVIDLSGKEGGETYLGGDERGE GKNGIQLAKKTSLEKGSTINVSGKEKGGRAIVWGDIALIDGNINAQGSGDIAKTGGFVET SGHDLFIKDNAIVDAKEWLLDPDN*(444)VSINAETAGRSNTSEDDEYTGSGNSASTPKRNKEKT TLTN*(484)TTLESILKKGTFVN*(498)ITANQRIYVNSSINLSNGSLTLWSEGRSGGGVEINNDITTGD DTRGAN*(546)LTIYSGGWVDVHKN*(560)ISLGAQGNIN*(570)ITAKQDIAFEKGSNQVITGQGTITSGNQKG FRFNN*(605)VSLN*(609)GTGSGLQFTTKRTNKYAITNKFEGTLN*(636) ISGKVN*(642)ISMVLPKNESGYDKFKGR TYWNLTSLNVSESGEFNLTIDSRGSDSAGTLTQPYNLNGISFNKDTTFN*(709)VERNARVNFDI KAPIGINKYSSLNYASFNGNISVSGGGSVDFTLLASSSNVQTPGVVINSKYFN*(773)VSTGSSL RFKTSGSTKTGFSIEKDLTLN*(801)ATGGN*(806)ITLLQVEGTDGMIGKGIVAKKN*(828)ITFEGGN*(835)ITFGS RKAVTEIEGNVTINNNANVTLIGSDFDNHQKPLTIKKDVIINSGNLTAGGNIVNIAGNLT VESNANFKAITN*(912)FTFNVGGLFDNKGNSN*(928)ISIAKGGARFKDIDNSKN*(946)LSITTN*(952)SSSTYRTI ISGN*(964)ITNKNGDLN*(973)ITNEGSDTEMQIGGDVSQKEGN*(995)LTISSDKIN*(1004)ITKQITIKAGVDGENS DSDATNNAN*(1029)LTIKTKELKLTQDLN*(1044)ISGFNKAEITAKDGSDLTIGNTNSADGTNAKKVTFN QVKDSKISADGHKVTLHSKVETSGSNNNTEDSSDNNAGLTIDAKNVTVNNN*(1131)ITSHKAVSI SATSGEITTKTGTTIN*(1156)ATTGNVEITAQTGSILGGIESSSGSVTLTATEGALAVSNISGNT VTVTANSGALTTLAGSTIKGTESVTTSSQSGDIGGTISGGTVEVKATESLTTQSNSKIKA TTGEANVTSATGTIGGTISGNTVNVTANAGDLTVGNGAEINATEGAATLTTSSGKLTTEA SSHITSAKGQVNLSAQDGSVAGSINAAN*(1348)VTLN*(1352)TTGTLTTVKGSNIN*(1366)ATSGTLVINAKDAE LNGAALGNHTVVNATNANGSGSVIATTSSRVNITGDLITINGLNIISKNGINTVLLKGVK IDVKYIQPGIASVDEVIEAKRILEKVKDLSDEEREALAKLGVSAVRFIEPNNTITVDTQN EFATRPLSRIVISEGRACFSNSDGATVCVNIADNGR
Sequence Around Glycosites (21 AA)DAKEWLLDPDNVSINAETAGR
KRNKEKTTLTNTTLESILKKG
ESILKKGTFVNITANQRIYVN
ITTGDDTRGANLTIYSGGWVD
YSGGWVDVHKNISLGAQGNIN
NISLGAQGNINITAKQDIAFE
SGNQKGFRFNNVSLNGTGSGL
KGFRFNNVSLNGTGSGLQFTT
AITNKFEGTLNISGKVNISMV
EGTLNISGKVNISMVLPKNES
GISFNKDTTFNVERNARVNFD
PGVVINSKYFNVSTGSSLRFK
GFSIEKDLTLNATGGNITLLQ
KDLTLNATGGNITLLQVEGTD
MIGKGIVAKKNITFEGGNITF
AKKNITFEGGNITFGSRKAVT
ESNANFKAITNFTFNVGGLFD
GGLFDNKGNSNISIAKGGARF
Glycosite Sequence Logoseqlogo
Glycosite Sequence Logo
Technique(s) used for Glycosylation DetectionDigoxygenin (DIG)-glycan detection
Technique(s) used for Glycosylated Residue(s) DetectionMS-MS (tandem mass spectrometry)
Protein Glycosylation- ImplicationGlycosylation protects HMW1 against premature degradation during the process of secretion and facilitates HMW1 tethering to the bacterial surface, a prerequisite for HMW1-mediated adherence.
Glycan Information
Glycan AnnotationUnusual carbohydrate modification includes glucose, galactose, and possibly mannose and corresponds to 7–8 kDa of the molecular mass. 31 modification sites carry 47 hexose units indicating the presence of hexose and dihexose (162-Da) sugars. HMW1C is capable of transferring glucose and galactose to HMW1 and is also able to generate hexose-hexose bonds.
Technique(s) used for Glycan Identification
Protein Glycosylation linked (PGL) gene(s)
OST Gene Namehmw1C (R2846_0712)
OST NCBI Gene ID
OST GenBank Gene Sequence
OST Protein NameHMW1C (ApHMW1C Actinobacillus pleuropneumoniae)
OST UniProtKB/ SwissProt IDE3GU42
OST NCBI RefSeq
OST EMBL-CDSADO96126.1
OST UniProtKB Sequence>tr|E3GU42|E3GU42_HAEI2 Adhesin glycotransferase protein Hmw1C OS=Haemophilus influenzae (strain R2846 / 12) GN=hmw1C PE=4 SV=1 MTKENLQSVPQNTTASLVESNNDQTSLQILKQPPKPNLLRLEQHVAKKDYELACRELMAI LEKMDANFGGVHDIEFDAPAQLAYLPEKLLIHFATRLANAITTLFSDPELAISEEGALKM ISLQRWLTLIFASSPYVNADHILNKYNINPDSEGGFHLATDNSSIAKFCIFYLPESNVNM SLDALWAGNQQLCASLCFALQSSRFIGTASAFHKRAVVLQWFPKKLAEIANLDELPANIL HDVYMHCSYDLAKNKHDVKRPLNELVRKHILTQGWQDRYLYTLGKKDGKPVMMVLLEHFN SGHSIYRTHSTSMIAAREKFYLVGLGHEGVDNIGREVFDEFFEISSNNIMERLFFIRKQC ETFQPAVFYMPSIGMDITTIFVSNTRLAPIQAVALGHPATTHSEFIDYVIVEDDYVGSED CFSETLLRLPKDALPYVPSALAPQKVDYVLRENPEVVNIGIAATTMKLNPEFLLTLQEIR DKAKVKIHFHFALGQSTGLTHPYVKWFIESYLGDDATAHPHAPYHDYLAILRDCDMLLNP FPFGNTNGIIDMVTLGLVGVCKTGDEVHEHIDEGLFKRLGLPEWLIADTRETYIECALRL AENHQERLELRRYIIENNGLQKLFTGDPRPLGKILLKKTNEWKRKHLSKK
OST EC Number (BRENDA)
OST Genome Context
Characterized Accessory Gene(s)HMW1C is a novel glycosyltransferase that forms both hexose-hexose and hexose-Asn bonds.
PGL Additional LinksCAZy
Literatures
Reference(s)1) Choi, K.J., Grass, S., Paek, S., St Geme, J.W., 3rd and Yeo, H.J. (2010) The Actinobacillus pleuropneumoniae HMW1C-like glycosyltransferase mediates N-linked glycosylation of the Haemophilus influenzae HMW1 adhesin. PLoS One, 5, e15888. [PubMed: 21209858]
2) Grass, S., Lichti, C.F., Townsend, R.R., Gross, J. and St Geme, J.W., 3rd. (2010) The Haemophilus influenzae HMW1C protein is a glycosyltransferase that transfers hexose residues to asparagine sites in the HMW1 adhesin. PLoS Pathog,
Additional Comments
Year of Identification2003
Year of Validation2008