Biology:KIAA0825

From HandWiki
Short description: Protein-coding gene in the species Homo sapiens


A representation of the 3D structure of the protein myoglobin showing turquoise α-helices.
Generic protein structure example

KIAA0825 is a protein that in humans is encoded by the gene of the same name, located on chromosome 5, 5q15. It is a possible risk factor in Type II Diabetes, and associated with high levels of glucose in the blood. It is a relatively fast mutating gene, compared to other coding genes. There is however one region which is highly conserved across the species that have the gene, known as DUF4495. It is predicted to travel between the nucleus and the cytoplasm.

General information

The Isoforms of C5orf36

KIAA0825 is gene that appears to be a genetic factor that increases the risk of Type II Diabetes, possibly by increasing the level of blood glucose levels.[1] It has also been identified as a possible oncogene.[2] C5orf36 has one common alias KIAA0825. The gene is about 478 kb long and contains 22 exons. It produces 10 different variants: 9 alternatively spliced, and one un-spliced version. The longest experimentally confirmed mRNA is 7240 bp long and produces a protein 1275 amino acids long.[3] The protein is predicted to weigh about 147.8kDal. It has orthologs in most animals including Aplysia californica, but is not found outside animals with the possible exception of Plasmodiophora brassicae.

Protein information

The protein has a predicted weight of 147.8 kDal.[4][5] It does not contain a known nuclear localization signal but does contain a nuclear export signal.[6] The subcellular localization for the protein is predicted to be the nucleus and the cytoplasm.[7] This suggests that the protein might shuttle back and forth across the nuclear membrane.

Secondary structure

This is a 3-D Prediction created by I-TASSER. The green indicates the conserved DUF4495.

Several programs suggest that the secondary structure of the protein is mainly helices with only a few beta sheets.[8][9][10][11] Analysis of protein composition also suggests that the protein has relatively low levels of glycine.[12] This could suggest a fairly rigid structure relative to other proteins. The tertiary structure is harder to predict due to the size of the protein, partially due to its size. The 3-D structure shown shows a prediction made by I-TASSER. This is a possible strture with a C-score of -1.06 on a scale from -5 to 1 (in which the higher the number the greater the confidence).[13][14][15] This predicted structure indicates there are two main parts, and it is possible they interact depending on the state of the protein (e.g. whether or not it's phosphorylated).

Expression

mRNA expression data from the Human Protein Atlas, calculated as transcripts per million (TPM).
This shows the expression levels of C5orf36 in human tissue. It is provided by the Human Protein Atlas.

The mRNA for KIAA0825 is expressed at relatively low rates in comparison to other mRNAs.[16] The protein however is expressed at relatively high rates, especially in parts of the brain as well as adrenal glands and the thyroid.[17] This would suggest that the protein is not readily degraded and remains in the cell for long periods of time, such that continuous transcription of the DNA into mRNA is unnecessary. No current finding suggest that there is alternative expression of different isoforms in different tissues.

Regulation

Analysis of the promoter offers some insight into the expression of KIAA0825.[18] One possible regulator found is the NeuroD1 transcription factor. This factor is an important regulator for the insulin gene, and a mutation in this gene can lead to Type II diabetes.[19] This could explain why KIAA0825 is expressed at lower levels in patients with Type II diabetes. Another possible transcription factor is the Myeloid zinc finger 1 factor, which is tied to myeloid leukemia, because it delays apoptosis of cells in the presence of retinoic acid.[20] There are also several places where Vertebrate SMAD family transcription factors can bind. These transcription factors are thought to be responsible for nucleocytoplasmic dynamics.[21] This means that these SMAD transcription factors could affect KIAA0825, because subcellular localization suggests it shuttles across the nuclear envelope.

Function

There are two proteins found to interact with KIAA0825. One is Interleukin enhancer-binding factor 3.[22] ILF3 is a factor that complexes with other proteins and regulates gene expression and stabilizes mRNAs.[23] The other is the Amyloid-beta precursor protein.[24] This protein is an integral membrane protein found most commonly in the synapses of neurons. Neither of these proteins is well enough understood to indicate for certain the role of C5orf36 in human cells. They however suggest that KIAA0825 could serve a variety of roles in different parts of the cell.

Orthology

KIAA0825 orthologs can be found in virtually all animals, but cannot be found in plants, bacteria, or protozoa. It is mostly highly conserved in vertebrates especially mammals, but genes that contain region similar to DUF4495 region can be found in California sea hare, generally one of the most simple animal. The size especially in mammals is well conserved sticking very close to between 1250 and 1300 amino acids long. This suggests that the protein wraps around on itself forming important structures for its function.

There were no paralogs found of the gene KIAA0825 in humans or in any other species.

References

  1. "Impact of diabetes-related gene polymorphisms on the clinical characteristics of type 2 diabetes Chinese Han population". Oncotarget 7 (51): 85464–85471. December 2016. doi:10.18632/oncotarget.13399. PMID 27863428. 
  2. "Open reading frames associated with cancer in the dark matter of the human genome". Cancer Genomics & Proteomics 11 (4): 201–13. July–August 2014. PMID 25048349. 
  3. NCBI Resource Coordinators (January 2017). "Database Resources of the National Center for Biotechnology Information". Nucleic Acids Research 45 (D1): D12–D17. doi:10.1093/nar/gkw1071. PMID 27899561. 
  4. "Methods and algorithms for statistical analysis of protein sequences". Proceedings of the National Academy of Sciences of the United States of America 89 (6): 2002–6. March 1992. doi:10.1073/pnas.89.6.2002. PMID 1549558. Bibcode1992PNAS...89.2002B. 
  5. Brendel, Volker. "SDSC Biology Workbench". Department of Mathematics, Stanford University, CA. http://workbench.sdsc.edu/. Retrieved 17 April 2017. 
  6. "Analysis and prediction of leucine-rich nuclear export signals". Protein Engineering, Design & Selection 17 (6): 527–36. June 2004. doi:10.1093/protein/gzh062. PMID 15314210. 
  7. "PSORT: a program for detecting sorting signals in proteins and predicting their subcellular localization". Trends in Biochemical Sciences 24 (1): 34–6. January 1999. doi:10.1016/s0968-0004(98)01336-x. PMID 10087920. 
  8. "Predicting transmembrane beta-barrels in proteomes". Nucleic Acids Research 32 (8): 2566–77. 28 April 2004. doi:10.1093/nar/gkh580. PMID 15141026. 
  9. "The PredictProtein server". Nucleic Acids Research 32 (Web Server issue): W321–6. July 2004. doi:10.1093/nar/gkh377. PMID 15215403. 
  10. "Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins". Journal of Molecular Biology 120 (1): 97–120. March 1978. doi:10.1016/0022-2836(78)90297-8. PMID 642007. 
  11. "Analysis of Conformations of Amino Acid Residues and Prediction of Backbone Topography in Proteins". Israel Journal of Chemistry 12 (1–2): 239–286. 1974. doi:10.1002/ijch.197400022. 
  12. "Methods and algorithms for statistical analysis of protein sequences". Proceedings of the National Academy of Sciences of the United States of America 89 (6): 2002–6. March 1992. doi:10.1073/pnas.89.6.2002. PMID 1549558. Bibcode1992PNAS...89.2002B. 
  13. "I-TASSER server for protein 3D structure prediction". BMC Bioinformatics 9 (1): 40. January 2008. doi:10.1186/1471-2105-9-40. PMID 18215316. 
  14. "I-TASSER: a unified platform for automated protein structure and function prediction". Nature Protocols 5 (4): 725–38. April 2010. doi:10.1038/nprot.2010.5. PMID 20360767. 
  15. "The I-TASSER Suite: protein structure and function prediction". Nature Methods 12 (1): 7–8. January 2015. doi:10.1038/nmeth.3213. PMID 25549265. 
  16. "Proteomics. Tissue-based map of the human proteome". Science 347 (6220): 1260419. January 2015. doi:10.1126/science.1260419. PMID 25613900. 
  17. "Proteomics. Tissue-based map of the human proteome". Science 347 (6220): 1260419. January 2015. doi:10.1126/science.1260419. PMID 25613900. 
  18. "Genomatix". https://www.genomatix.de/index.html. Retrieved 7 May 2017. 
  19. "Effects of distamycin A on human leukocytes in vitro". Cytogenetics and Cell Genetics 23 (1–2): 103–7. 1 January 1999. doi:10.1128/MCB.19.1.704. PMID 83927. 
  20. "The myeloid zinc finger gene (MZF-1) delays retinoic acid-induced apoptosis and differentiation in myeloid leukemia cells". Leukemia 12 (5): 690–8. May 1998. doi:10.1038/sj.leu.2401005. PMID 9593266. 
  21. "Smad transcription factors". Genes & Development 19 (23): 2783–810. December 2005. doi:10.1101/gad.1350705. PMID 16322555. 
  22. "Multiple myeloma-associated chromosomal translocation activates orphan snoRNA ACA11 to suppress oxidative stress". The Journal of Clinical Investigation 122 (8): 2793–806. August 2012. doi:10.1172/JCI63051. PMID 22751105. 
  23. "Proteomic analysis of interleukin enhancer binding factor 3 (Ilf3) and nuclear factor 90 (NF90) interactome". Biochimie 95 (6): 1146–57. June 2013. doi:10.1016/j.biochi.2013.01.004. PMID 23321469. 
  24. "Interactions of pathological hallmark proteins: tubulin polymerization promoting protein/p25, beta-amyloid, and alpha-synuclein". The Journal of Biological Chemistry 286 (39): 34088–100. September 2011. doi:10.1074/jbc.M111.243907. PMID 21832049.