CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-015476
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Thyroid adenoma-associated protein 
Protein Synonyms/Alias
 Gene inducing thyroid adenomas protein 
Gene Name
 THADA 
Gene Synonyms/Alias
 GITA; KIAA1767 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
136TDLYSYRKVTDNISSubiquitination[1]
454LEWHIKGKYTCLGCLubiquitination[1]
566ESLQYMVKILQTSIDubiquitination[1]
575LQTSIDAKTGQEQSFubiquitination[1]
627LVSDARIKQGLIHQHubiquitination[1, 2]
702ESSQVLYKLEQSKSKubiquitination[1, 3]
709KLEQSKSKREPENELubiquitination[1]
1008NDYFNQAKILKEHDSubiquitination[1, 3]
1011FNQAKILKEHDSFDMubiquitination[1]
1154KCSDPSSKLCATRRSubiquitination[1]
1440TDITVCTKAKLWLAKubiquitination[1]
1578AASGLGEKGVPPLLCubiquitination[1]
1659VALRLASKVISHHMQubiquitination[1]
1897PPAAEFVKTVEFTRLubiquitination[1]
Reference
 [1] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961]
 [2] Ubiquitin ligase substrate identification through quantitative proteomics at both the protein and peptide levels.
 Lee KA, Hammerle LP, Andrews PS, Stokes MP, Mustelin T, Silva JC, Black RA, Doedens JR.
 J Biol Chem. 2011 Dec 2;286(48):41530-8. [PMID: 21987572]
 [3] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473
Functional Description
  
Sequence Annotation
 MOD_RES 1024 1024 Phosphoserine.
 MOD_RES 1161 1161 Phosphoserine.  
Keyword
 Alternative splicing; Chromosomal rearrangement; Coiled coil; Complete proteome; Phosphoprotein; Polymorphism; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1953 AA 
Protein Sequence
MGVKKKKEMQ VAALTICHQD LETLKSFADV EGKNLASLLL HCVQLTDGVS QIHYIKQIVP 60
LLEKADKNGM CDPTIQSCLD ILAGIYLSLS LKNPLKKVLA SSLNSLPDFF LPEAMHRFTS 120
RLQEELNTTD LYSYRKVTDN ISSCMENFNL GRASVNNLLK NVLHFLQKSL IEILEENRKC 180
AGNHIIQTQL MNDLLVGIRV SMMLVQKVQD FQGNLWKTSD SPIWQNMCGL LSIFTKVLSD 240
DDLLQTVQST SGLAIILFIK TMFHPSEKIP HLISSVLLRS VDCTSVPEWF MSSCRSLCCG 300
DISQSAVLFL CQGTLAMLDW QNGSMGRSGE ALLLDTAHVL FTLSSQIKEP TLEMFLSRIL 360
ASWTNSAIQV LESSSPSLTD SLNGNSSIVG RLLEYVYTHW EHPLDALRHQ TKIMFKNLLQ 420
MHRLTVEGAD FVPDPFFVEL TESLLRLEWH IKGKYTCLGC LVECIGVEHI LAIDKTIPSQ 480
ILEVMGDQSL VPYASDLLET MFRNHKSHLK SQTAESSWID QWHETWVSPL LFILCEGNLD 540
QKSYVIDYYL PKLLSYSPES LQYMVKILQT SIDAKTGQEQ SFPSLGSCNS RGALGALMAC 600
LRIARAHGHL QSATDTWENL VSDARIKQGL IHQHCQVRID TLGLLCESNR STEIVSMEEM 660
QWIQFFITYN LNSQSPGVRQ QICSLLKKLF CRIQESSQVL YKLEQSKSKR EPENELTKQH 720
PSVSLQQYKN FMSSICNSLF EALFPGSSYS TRFSALTILG SIAEVFHVPE GRIYTVYQLS 780
HDIDVGRFQT LMECFTSTFE DVKILAFDLL MKLSKTAVHF QDSGKLQGLF QAALELSTST 840
KPYDCVTASY LLNFLIWQDA LPSSLSAYLT QQVACDNGDR PAAVVERNTL MVIKCLMENL 900
EEEVSQAENS LLQAAAAFPM YGRVHCITGA LQKLSLNSLQ LVSEWRPVVE KLLLMSYRLS 960
TVVSPVIQSS SPEGLIPMDT DSESASRLQM ILNEIQPRDT NDYFNQAKIL KEHDSFDMKD 1020
LNASVVNIDT STEIKGKEVK TCDVTAQMVL VCCWRSMKEV ALLLGMLCQL LPMQPVPESS 1080
DGLLTVEQVK EIGDYFKQHL LQSRHRGAFE LAYTGFVKLT EVLNRCPNVS LQKLPEQWLW 1140
SVLEEIKCSD PSSKLCATRR SAGIPFYIQA LLASEPKKGR MDLLKITMKE LISLAGPTDD 1200
IQSTVPQVHA LNILRALFRD TRLGENIIPY VADGAKAAIL GFTSPVWAVR NSSTLLFSAL 1260
ITRIFGVKRA KDEHSKTNRM TGREFFSRFP ELYPFLLKQL ETVANTVDSD MGEPNRHPSM 1320
FLLLLVLERL YASPMDGTSS ALSMGPFVPF IMRCGHSPVY HSREMAARAL VPFVMIDHIP 1380
NTIRTLLSTL PSCTDQCFRQ NHIHGTLLQV FHLLQAYSDS KHGTNSDFQH ELTDITVCTK 1440
AKLWLAKRQN PCLVTRAVYI DILFLLTCCL NRSAKDNQPV LESLGFWEEV RGIISGSELI 1500
TGFPWAFKVP GLPQYLQSLT RLAIAAVWAA AAKSGERETN VPISFSQLLE SAFPEVRSLT 1560
LEALLEKFLA AASGLGEKGV PPLLCNMGEK FLLLAMKENH PECFCKILKI LHCMDPGEWL 1620
PQTEHCVHLT PKEFLIWTMD IASNERSEIQ SVALRLASKV ISHHMQTCVE NRELIAAELK 1680
QWVQLVILSC EDHLPTESRL AVVEVLTSTT PLFLTNPHPI LELQDTLALW KCVLTLLQSE 1740
EQAVRDAATE TVTTAMSQEN TCQSTEFAFC QVDASIALAL ALAVLCDLLQ QWDQLAPGLP 1800
ILLGWLLGES DDLVACVESM HQVEEDYLFE KAEVNFWAET LIFVKYLCKH LFCLLSKSGW 1860
RPPSPEMLCH LQRMVSEQCH LLSQFFRELP PAAEFVKTVE FTRLRIQEER TLACLRLLAF 1920
LEGKEGEDTL VLSVWDSYAE SRQLTLPRTE AAC 1953 
Gene Ontology
  
Interpro
 IPR016024; ARM-type_fold.
 IPR019442; DUF2428_death-receptor-like. 
Pfam
 PF10350; DUF2428 
SMART
  
PROSITE
  
PRINTS