CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-011486
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Trichohyalin 
Protein Synonyms/Alias
  
Gene Name
 TCHH 
Gene Synonyms/Alias
 THH; THL; TRHY 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
551ERREQLLKREEEKRLubiquitination[1]
Reference
 [1] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961
Functional Description
 Intermediate filament-associated protein that associates in regular arrays with keratin intermediate filaments (KIF) of the inner root sheath cells of the hair follicle and the granular layer of the epidermis. It later becomes cross-linked to KIF by isodipeptide bonds. It may serve as scaffold protein, together with involucrin, in the organization of the cell envelope or even anchor the cell envelope to the KIF network. It may be involved in its own calcium-dependent postsynthetic processing during terminal differentiation. 
Sequence Annotation
 DOMAIN 23 48 EF-hand 1.
 DOMAIN 49 84 EF-hand 2.
 REPEAT 314 326 1-1; approximate.
 REPEAT 327 339 1-2; approximate.
 REPEAT 340 351 1-3; approximate.
 REPEAT 352 364 1-4.
 REPEAT 365 377 1-5.
 REPEAT 378 383 2-1.
 REPEAT 384 389 2-2.
 REPEAT 390 395 2-3.
 REPEAT 396 401 2-4.
 REPEAT 402 407 2-5.
 REPEAT 408 413 2-6.
 REPEAT 414 419 2-7.
 REPEAT 420 425 2-8.
 REPEAT 906 935 4-1.
 REPEAT 936 965 4-2.
 REPEAT 966 995 4-3.
 REPEAT 996 1025 4-4.
 REPEAT 1026 1055 4-5.
 REPEAT 1056 1085 4-6.
 REPEAT 1086 1115 4-7.
 REPEAT 1116 1145 4-8.
 REPEAT 1146 1175 4-9.
 REPEAT 1176 1204 4-10.
 REGION 1 91 S-100-like.
 REGION 314 377 5 X 13 AA tandem repeats of R-R-E-Q-E-E-
 REGION 378 425 8 X 6 AA tandem repeats of R-R-E-Q-Q-L.
 REGION 425 683 9 X 28 AA approximate tandem repeats.
 REGION 906 1204 10 X 30 AA tandem repeats.
 REGION 1292 1894 23 X 26 AA approximate tandem repeats.  
Keyword
 Calcium; Citrullination; Complete proteome; Keratinization; Metal-binding; Polymorphism; Reference proteome; Repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1943 AA 
Protein Sequence
MSPLLRSICD ITEIFNQYVS HDCDGAALTK KDLKNLLERE FGAVLRRPHD PKTVDLILEL 60
LDLDSNGRVD FNEFLLFIFK VAQACYYALG QATGLDEEKR ARCDGKESLL QDRRQEEDQR 120
RFEPRDRQLE EEPGQRRRQK RQEQERELAE GEEQSEKQER LEQRDRQRRD EELWRQRQEW 180
QEREERRAEE EQLQSCKGHE TEEFPDEEQL RRRELLELRR KGREEKQQQR RERQDRVFQE 240
EEEKEWRKRE TVLRKEEEKL QEEEPQRQRE LQEEEEQLRK LERQELRRER QEEEQQQQRL 300
RREQQLRRKQ EEERREQQEE RREQQERREQ QEERREQQLR REQEERREQQ LRREQEEERR 360
EQQLRREQEE ERREQQLRRE QQLRREQQLR REQQLRREQQ LRREQQLRRE QQLRREQQLR 420
REQQLRREQE EERHEQKHEQ ERREQRLKRE QEERRDWLKR EEETERHEQE RRKQQLKRDQ 480
EEERRERWLK LEEEERREQQ ERREQQLRRE QEERREQRLK RQEEEERLQQ RLRSEQQLRR 540
EQEERREQLL KREEEKRLEQ ERREQRLKRE QEERRDQLLK REEERRQQRL KREQEERLEQ 600
RLKREEVERL EQEERREQRL KREEPEEERR QQLLKSEEQE ERRQQQLRRE QQERREQRLK 660
REEEEERLEQ RLKREHEEER REQELAEEEQ EQARERIKSR IPKWQWQLES EADARQSKVY 720
SRPRKQEGQR RRQEQEEKRR RRESELQWQE EERAHRQQQE EEQRRDFTWQ WQAEEKSERG 780
RQRLSARPPL REQRERQLRA EERQQREQRF LPEEEEKEQR RRQRREREKE LQFLEEEEQL 840
QRRERAQQLQ EEEDGLQEDQ ERRRSQEQRR DQKWRWQLEE ERKRRRHTLY AKPALQEQLR 900
KEQQLLQEEE EELQREEREK RRRQEQERQY REEEQLQQEE EQLLREEREK RRRQERERQY 960
RKDKKLQQKE EQLLGEEPEK RRRQEREKKY REEEELQQEE EQLLREEREK RRRQEWERQY 1020
RKKDELQQEE EQLLREEREK RRLQERERQY REEEELQQEE EQLLGEERET RRRQELERQY 1080
RKEEELQQEE EQLLREEPEK RRRQERERQC REEEELQQEE EQLLREEREK RRRQELERQY 1140
REEEEVQQEE EQLLREEPEK RRRQELERQY REEEELQQEE EQLLREEQEK RRQERERQYR 1200
EEEELQRQKR KQRYRDEDQR SDLKWQWEPE KENAVRDNKV YCKGRENEQF RQLEDSQLRD 1260
RQSQQDLQHL LGEQQERDRE QERRRWQQRD RHFPEEEQLE REEQKEAKRR DRKSQEEKQL 1320
LREEREEKRR RQETDRKFRE EEQLLQEREE QPLRRQERDR KFREEELRHQ EQGRKFLEEE 1380
QRLRRQERER KFLKEEQQLR CQEREQQLRQ DRDRKFREEE QQLSRQERDR KFREEEQQVR 1440
RQERERKFLE EEQQLRQERH RKFREEEQLL QEREEQQLHR QERDRKFLEE EQQLRRQERD 1500
RKFREQELRS QEPERKFLEE EQQLHRQQRQ RKFLQEEQQL RRQERGQQRR QDRDRKFREE 1560
EQLRQEREEQ QLSRQERDRK FRLEEQKVRR QEQERKFMED EQQLRRQEGQ QQLRQERDRK 1620
FREDEQLLQE REEQQLHRQE RDRKFLEEEP QLRRQEREQQ LRHDRDRKFR EEEQLLQEGE 1680
EQQLRRQERD RKFREEEQQL RRQERERKFL QEEQQLRRQE LERKFREEEQ LRQETEQEQL 1740
RRQERYRKIL EEEQLRPERE EQQLRRQERD RKFREEEQLR QEREEQQLRS QESDRKFREE 1800
EQLRQEREEQ QLRPQQRDGK YRWEEEQLQL EEQEQRLRQE RDRQYRAEEQ FATQEKSRRE 1860
EQELWQEEEQ KRRQERERKL REEHIRRQQK EEQRHRQVGE IKSQEGKGHG RLLEPGTHQF 1920
ASVPVRSSPL YEYIQEQRSQ YRP 1943 
Gene Ontology
 GO:0005813; C:centrosome; IDA:HPA.
 GO:0005737; C:cytoplasm; IDA:HPA.
 GO:0045111; C:intermediate filament cytoskeleton; IDA:HPA.
 GO:0005886; C:plasma membrane; IDA:HPA.
 GO:0005509; F:calcium ion binding; TAS:UniProtKB.
 GO:0031424; P:keratinization; IEA:UniProtKB-KW. 
Interpro
 IPR011992; EF-hand-like_dom.
 IPR018247; EF_Hand_1_Ca_BS.
 IPR002048; EF_hand_dom.
 IPR001751; S100/CaBP-9k_CS.
 IPR013787; S100_Ca-bd_sub. 
Pfam
 PF01023; S_100 
SMART
  
PROSITE
 PS00018; EF_HAND_1
 PS50222; EF_HAND_2
 PS00303; S100_CABP 
PRINTS