Search Database

Select Protein (1 found)

UniProt ID Gene Symbol Protein Name Organism Length Action
P02462 COL4A1 Collagen alpha-1(IV) chain [Cleaved into: … Homo sapiens (Human) 1669 aa

Protein Details: P02462 (COL4A1)

Protein Information
AccessionP02462
Protein NamesCollagen alpha-1(IV) chain [Cleaved into: Arresten]
Gene SymbolCOL4A1
OrganismHomo sapiens (Human)
Length1669 aa
IsoformsNo isoforms
Related PMIDs No related PMIDs
Database SourcesCysModDBdbPTMSwissPalm
These studies detected palmitoylation of this protein in the samples.
Protein Sequence
Types:
Experimental Database High Prediction Non-palmitylated Cys
1-501MGPRLSVWLL11LLPAALLLHE21EHSRAAAKGG31CAGSGCGKCD41CHGVKGQKGE
51-10051RGLPGLQGVI61GFPGMQGPEG71PQGPPGQKGD81TGEPGLPGTK91GTRGPPGASG
101-150101YPGNPGLPGI111PGQDGPPGPP121GIPGCNGTKG131ERGPLGPPGL141PGFAGNPGPP
151-200151GLPGMKGDPG161EILGHVPGML171LKGERGFPGI181PGTPGPPGLP191GLQGPVGPPG
201-250201FTGPPGPPGP211PGPPGEKGQM221GLSFQGPKGD231KGDQGVSGPP241GVPGQAQVQE
251-300251KGDFATKGEK261GQKGEPGFQG271MPGVGEKGEP281GKPGPRGKPG291KDGDKGEKGS
301-350301PGFPGEPGYP311GLIGRQGPQG321EKGEAGPPGP331PGIVIGTGPL341GEKGERGYPG
351-400351TPGPRGEPGP361KGFPGLPGQP371GPPGLPVPGQ381AGAPGFPGER391GEKGDRGFPG
401-450401TSLPGPSGRD411GLPGPPGSPG421PPGQPGYTNG431IVECQPGPPG441DQGPPGIPGQ
451-500451PGFIGEIGEK461GQKGESCLIC471DIDGYRGPPG481PQGPPGEIGF491PGQPGAKGDR
501-550501GLPGRDGVAG511VPGPQGTPGL521IGQPGAKGEP531GEFYFDLRLK541GDKGDPGFPG
551-600551QPGMPGRAGS561PGRDGHPGLP571GPKGSPGSVG581LKGERGPPGG591VGFPGSRGDT
601-650601GPPGPPGYGP611AGPIGDKGQA621GFPGGPGSPG631LPGPKGEPGK641IVPLPGPPGA
651-700651EGLPGSPGFP661GPQGDRGFPG671TPGRPGLPGE681KGAVGQPGIG691FPGPPGPKGV
701-750701DGLPGDMGPP711GTPGRPGFNG721LPGNPGVQGQ731KGEPGVGLPG741LKGLPGLPGI
751-800751PGTPGEKGSI761GVPGVPGEHG771AIGPPGLQGI781RGEPGPPGLP791GSVGSPGVPG
801-850801IGPPGARGPP811GGQGPPGLSG821PPGIKGEKGF831PGFPGLDMPG841PKGDKGAQGL
851-900851PGITGQSGLP861GLPGQQGAPG871IPGFPGSKGE881MGVMGTPGQP891GSPGPVGAPG
901-950901LPGEKGDHGF911PGSSGPRGDP921GLKGDKGDVG931LPGKPGSMDK941VDMGSMKGQK
951-1000951GDQGEKGQIG961PIGEKGSRGD971PGTPGVPGKD981GQAGQPGQPG991PKGDPGISGT
1001-10501001PGAPGLPGPK1011GSVGGMGLPG1021TPGEKGVPGI1031PGPQGSPGLP1041GDKGAKGEKG
1051-11001051QAGPPGIGIP1061GLRGEKGDQG1071IAGFPGSPGE1081KGEKGSIGIP1091GMPGSPGLKG
1101-11501101SPGSVGYPGS1111PGLPGEKGDK1121GLPGLDGIPG1131VKGEAGLPGT1141PGPTGPAGQK
1151-12001151GEPGSDGIPG1161SAGEKGEPGL1171PGRGFPGFPG1181AKGDKGSKGE1191VGFPGLAGSP
1201-12501201GIPGSKGEQG1211FMGPPGPQGQ1221PGLPGSPGHA1231TEGPKGDRGP1241QGQPGLPGLP
1251-13001251GPMGPPGLPG1261IDGVKGDKGN1271PGWPGAPGVP1281GPKGDPGFQG1291MPGIGGSPGI
1301-13501301TGSKGDMGPP1311GVPGFQGPKG1321LPGLQGIKGD1331QGDQGVPGAK1341GLPGPPGPPG
1351-14001351PYDIIKGEPG1361LPGPEGPPGL1371KGLQGLPGPK1381GQQGVTGLVG1391IPGPPGIPGF
1401-14501401DGAPGQKGEM1411GPAGPTGPRG1421FPGPPGPDGL1431PGSMGPPGTP1441SVDHGFLVTR
1451-15001451HSQTIDDPQC1461PSGTKILYHG1471YSLLYVQGNE1481RAHGQDLGTA1491GSCLRKFSTM
1501-15501501PFLFCNINNV1511CNFASRNDYS1521YWLSTPEPMP1531MSMAPITGEN1541IRPFISRCAV
1551-16001551CEAPAMVMAV1561HSQTIQIPPC1571PSGWSSLWIG1581YSFVMHTSAG1591AEGSGQALAS
1601-16501601PGSCLEEFRS1611APFIECHGRG1621TCNYYANAYS1631FWLATIERSE1641MFKKPTPSTL
1651-16691651KAGELRTHVS1661RCQVCMRRT
Palmitoylation Sites Details
Position Database Domains Literature (PMID/Cell-Tissue) Mass(PMID/Cell-Tissue) Prediction Scores
31 - - -
GPS-Palm: 0.82
Deep-Palm: 0.81
36 - - -
GPS-Palm: 0.70
Deep-Palm: 0.44
39 - - -
Deep-Palm: 0.56
41 - - -
GPS-Palm: 0.65
Deep-Palm: 0.45
125 - - -
GPS-Palm: 0.67
Deep-Palm: 0.29
434 - - -
Deep-Palm: 0.18
467 - - -
GPS-Palm: 0.68
Deep-Palm: 0.24
470 - - -
GPS-Palm: 0.79
Deep-Palm: 0.33
1460 SWISSPALM DBPTM CYSMODDB C-terminal tandem repeated domain in type 4 procollagen - -
Deep-Palm: 0.14
1493 SWISSPALM DBPTM CYSMODDB C-terminal tandem repeated domain in type 4 procollagen - -
GPS-Palm: 0.84
Deep-Palm: 0.82
1505 - -
Jurkat T (32651440)
Deep-Palm: 0.14
1511 - - -
Deep-Palm: 0.05
1548 - - -
Deep-Palm: 0.42
1551 - - -
Deep-Palm: 0.30
1570 - - -
Deep-Palm: 0.06
1604 - - -
Deep-Palm: 0.96
1616 - - -
Deep-Palm: 0.29
1622 - - -
Deep-Palm: 0.07
1662 - - -
GPS-Palm: 0.86
Deep-Palm: 0.03
1665 - - -
GPS-Palm: 0.94
Deep-Palm: 0.02
Score Interpretation:
GPS-Palm: Thresholds - High (≥0.8920), Medium (≥0.7766), Low (≥0.6484), Very Low (<0.6484)
Deep-Palm: Higher score indicates higher probability of palmitoylation (High ≥0.9)
Conservation score for cysteine
PhyloP for Cysteine
PhastCons Conservation Scores for Cysteine
TCGA Cysteine Mutation Information

Note: Mutations indicate amino acid changes that may create potential palmitoylation sites.

Position Amino Acid Change Frequency Type Function Cancer Type
359 G → C 0.005618 SNP Missense Mutation PAAD
385 G → C 0.002033 SNP Missense Mutation LUSC
512 P → _G513insYNDFCF*SF 0.002294 INS Nonsense Mutation OV
522 G → C 0.001764 SNP Missense Mutation LUAD
547 G → C 0.001764 SNP Missense Mutation LUAD
597 R → C 0.002427 SNP Missense Mutation BLCA
621 G → C 0.002506 SNP Missense Mutation COAD
633 G → C 0.002506 SNP Missense Mutation COAD
715 R → C 0.002141 SNP Missense Mutation SKCM
933 G → C 0.001764 SNP Missense Mutation LUAD
936 G → C 0.002288 SNP Missense Mutation STAD
948 G → C 0.002033 SNP Missense Mutation LUSC
1002 G → C 0.001887 SNP Missense Mutation UCEC
1086 S → C 0.002545 SNP Missense Mutation GBM
1272 G → C 0.001764 SNP Missense Mutation LUAD
1315 F → C 0.001887 SNP Missense Mutation UCEC
1344 G → C 0.001764 SNP Missense Mutation LUAD
1402 G → C 0.002033 SNP Missense Mutation LUSC
1423 G → C 0.001764 SNP Missense Mutation LUAD
1495 R → C 0.002141 SNP Missense Mutation SKCM
1522 W → C 0.001764 SNP Missense Mutation LUAD
1604 C → Y 0.002545 SNP Missense Mutation GBM
1604 C → S 0.002033 SNP Missense Mutation LUSC
1618 G → C 0.001764 SNP Missense Mutation LUAD
1656 R → C 0.001887 SNP Missense Mutation UCEC
1616* C → ? 0.002294 SNP Nonsense Mutation OV