By Author
  By Title
  By Keywords

April 2017, Volume 67, Issue 4

Review Articles

In silico analysis of fragile histidine triad involved in regression of carcinoma

Muhammad Asif Rasheed  ( Department of Biosciences, COMSATS Institute of Information Technology, Sahiwal, Pakistan. )
Fatima Tariq  ( Department of Biosciences, COMSATS Institute of Information Technology, Sahiwal, Pakistan. )
Sara Afzal  ( Department of Biosciences, COMSATS Institute of Information Technology, Sahiwal, Pakistan. )
Shazia Mannanv  ( Department of Biosciences, COMSATS Institute of Information Technology, Sahiwal, Pakistan. )


Hepatocellular carcinoma (HCCa) is a primary malignancy of the liver. Many different proteins are involved in HCCa including insulin growth factor (IGF) II , signal transducers and activators of transcription (STAT) 3, STAT4, mothers against decapentaplegic homolog 4 (SMAD 4), fragile histidine triad (FHIT) and selective internal radiation therapy (SIRT) etc. The present study is based on the bioinformatics analysis of FHIT protein in order to understand the proteomics aspect and improvement of the diagnosis of the disease based on the protein. Different information related to protein were gathered from different databases, including National Centre for Biotechnology Information (NCBI) Gene, Protein and Online Mendelian Inheritance in Man (OMIM) databases, Uniprot database, String database and Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Moreover, the structure of the protein and evaluation of the quality of the structure were included from Easy modeler programme. Hence, this analysis not only helped to gather information related to the protein at one place, but also analysed the structure and quality of the protein to conclude that the protein has a role in carcinoma.
Keywords: Hepatocellular carcinoma, Fragile histidine triad, FHIT, Proteomics, Bioinformatics.


In our previous study,1 we presented the analysis of mothers against decapentaplegic homolog 4 (SMAD4) protein, while the current study is based on the analysis of fragile histidine triad (FHIT) protein involved in hepatocellular carcinoma. The proteomics studies are helpful in understanding the cell processes, and the effect of proteins on the cell processes. Moreover, it can also be helpful in studying the effect of the environment and the cell processes on the proteins. Hence, through applying the proteomics tools and identifying the proteins in a disease and the healthy individuals\\\' samples, we may identify the biomarkers to differentiate both the classes. Such discoveries may also lead to protein-based diagnostic tools and better understanding of a disease state. For example, protein expression pattern or profile during cancer is different compared to a normal healthy situation. Such unique proteins which are present in the diseased condition and not in the healthy cells can be used as markers for the disease study as well as target during the disease treatment. The analysis of the proteins using a proteomics approach is very useful and has different applications. For example, proteins isolated from body tissues gives a complete idea of the tissue situation as well as provided a good basis to study other processes such as post-translational modifications, protein functionality or protein complexes.2
Bioinformatics introduce new algorithms in the field of proteomics to handle large and heterogeneous data sets. With the integrated use of "-omics" disciplines, the identification and the characterisation of candidate genes, proteins and molecules involved in a given disease will probably represent one of the milestones of future healthcare.3 Proteomics deals with the identification of the proteins produced by cells in normal and diseased conditions, while metabolomics monitors the role of small molecules (lipids, sugars and amino acids) involved in daily cellular function.4 There are approximately 30,000 genes in the human genome and the number of proteins is likely at least three times higher, as resulting from alternative splicing and post-translational modifications.5

Hepatocellular Carcinoma

Hepatocellular carcinoma (HCCa) is a primary malignancy of the liver. Most cases of HCCa are secondary to either a viral hepatitide infection (hepatitis B or C) or cirrhosis (alcoholism) which is the most common cause of hepatic cirrhosis.6 In countries where hepatitis is not endemic, most malignant cancers in the liver are not primary HCCa but metastasis of cancer from elsewhere in the body, e.g., the colon. HCCa is the fifth most common cancer and the third most common cause of cancer death worldwide. There were few effective therapeutic options available for those suffering from advanced disease7 but progress has been seen in surgical treatment of hepatocellular carcinoma.8 Moreover, many immune-based approaches have shown efficacy in achieving disease regression and representing the most promising new treatment approach.9 Administration of levocarnitine and/or branched chain amino acids during invasive treatments reduced blood ammonia (NH3) concentration and suppressed the albumen.10 HCCa is reported to be the second most common cause of non-islet cell tumour hypoglycaemia (NICTH) which was controlled by using systemic chemotherapy.11 This type of cancer occurs more often in men than women and is usually seen in people with age of 50 or older.12 However, age varies in different parts of the world. Moreover, different risk factors of HCCa include hepatitis B virus (HBV) and alcohol intake, passive smoking, indoor air pollution and pesticide exposure. Moreover, fruit and tea intake may significantly lower the risk of HCCs.13 Furthermore, different factors influencing the prognosis of patients with HCCa were explored.14 Hence in order to prevent HCCa, clinicians should provide health education to overcome such risk factors and encourage HBV and hepatitis C virus (HCV) carriers to undergo annual physical examination and receive adequate treatment.15 Specifically, hepatitis B should be treated in order to prevent HCCa.16
Different up-regulated proteins during the disease include insulin growth factor (IGF) II, a disintegrin and metalloproteases (ADAM) 9, signal transducers and activators of transcription (STAT) 3, suppressors of cytokine signalling (SOCS) 3, and cyclin D1 while the down-regulated proteins during the disease include collagen I, SMAD 4, fragile histidine triad (FHIT), and SOCS1.17 Other proteins include, POU class 5 homeobox 1 (OCT4), baculoviral IAP repeat containing 5 (BIRC5), cyclin D1 (CCND1), ATP binding cassette subfamily G member 2 (BCRP), SRY-box 2 (Sox2), Glutathione S-transferases (GST), NCK adaptor protein 1 (NCK1), human leukocyte antigen DQ (HLA-DQ), miR-106b, c-Myc, Ki67 and selective internal radiation therapy (SIRT).

Analysis of FHIT by Different Databases and Software Tools

The bioinformatics analysis of FHIT protein was performed by using different tools, software and databases related to bioinformatics. Different details related to the proteins were gathered from different databases and software including the National Centre of Biotechnology Information (NCBI) gene, protein and Online Mendelian Inheritance in Man (OMIM) databases and UniProt database. Basic Local Alignment Search Tool (BLAST) was used to get the closest proteins related to FHIT in protein databank (PDB) database. Moreover, Kyoto Encyclopaedia of Genes and Genomes (KEGG) pathway database was used to show that the protein is involved in tumour suppression. Furthermore, String database was used to show the interacting proteins with FHIT while easy modeller programme was used to model the protein structure and to draw Ramachandaran plot.

FASTA Sequence of the Protein

The FASTA sequence of FHIT protein consists of 147 amino acids taken from NCBI protein database, which is: >gi|21595364|gb|AAH32336.1| FHIT protein [Homo sapiens]

Chromosome Location

The protein is located at chromosome 3 and the exact location of the protein is 3p14.2. Moreover, the molecular weight of the protein is 16858 Da. Different domains of FHIT protein include Histidine triad (HIT) domain, HIT-like domain and Histidine triad conserved site domain.19
BLAST was used to collect eight closest sequences compared to the query FASTA sequence of FHIT protein in PDB database. The PDB codes of selected proteins with other details were noted (Table).

Hence, the template chosen to predict the structure of the protein was 1FHIA. This template was predicted by Pace H.C. et al in 1998 by X-ray diffraction method with the resolution of 3.10 Å.20 The structure of FHIT was predicted by using easy modeller software (Figure-1),

while the quality of the predicted structure was checked by Ramachandran plot (Figure-2).

There were three most favourable regions in the plot and most of the residues were present in most favourable regions, suggesting a good quality protein structure.

Role of FHIT in other Carcinoma

Among 50% of oesophageal, stomach and colon carcinomas, inconsistent FHIT transcripts were identified.21 Moreover, loss of FHIT promoted carcinogens in human bronchial epithelial cells.22 Furthermore, in a series of small cell lung cancers (SCLCs) and non-small cell (NSCLC) types, FHIT gene structure and transcription were also analysed.23 The study was performed by reverse transcription polymerase chain reaction (RT-PCR) where in 11 of 14 SCLC tumours, abnormal-sized transcripts were found. In 9 of these cases, both normal and abnormal sized transcripts were present. Moreover, abnormal transcripts were found in 18 of 25 NSCLC tumours. One or 2 abnormal sized bands which were accompanied by a normal sized transcript suggested the presence of normal cells within the tumours. These researchers reported loss of heterozygosity for microsatellite markers internal to and flanking the FHIT locus. Furthermore, 11 of 12 tumours that exhibited abnormal FHIT transcripts showed allelic loss at 1 or more of the loci. Hence, the researchers suggested the inactivation of the FHIT gene by a mechanism of loss of 1 allele and altered expression of the remaining alleles in these tumours. They further concluded that accumulation of high levels of intracellular diadenosine tetraphosphate and the stimulation of deoxyribonucleic acid (DNA) synthesis and proliferation resulted in the loss of function of the FHIT

(Figure-3) which may occur as a consequence of physical, chemical, and biological agents. FHIT was associated with breast cancer,24 thyroid tumours,25 lung cancer,26,27 and acute lymphoblastic leukaemia.28 Hence, the tumour suppressor gene proteins (FHIT and others) expression can be one of the factors influencing prognosis and valuable for clinical treatment.29
String is a known and predicted protein-protein interaction database which shows the highest interaction of FHIT with ferredoxin reductase (FDXR)

(Figure-4) and it has also been detected by in vitro assay.30
Since the discovery of FHIT gene in 1996, more than 350 studies have been published.31 Hence, FHIT is altered in many human tumours, particularly in those caused by environmental carcinogens, such as those present in tobacco smoke.32 In many of these tumours, particularly in those induced by tobacco or other environmental carcinogens, alterations of FHIT occur very early during the multistep process of carcinogenesis. Infection with FHIT recombinant viruses may cause regression, and prevention of tumours in experimental animals showed that FHIT-negative cancer cells are very sensitive to the expression of FHIT.33 Thus, it is logical to predict the development of a gene therapy approach for the treatment and prevention of FHIT-negative human cancers. Moreover, the analysis of the protein using a proteomics approach is very useful and may have different applications. Bioinformatics introducing new algorithms in the field of proteomics to handle large and heterogeneous data sets and bioinformatics analysis of FHIT protein involved in different carcinomas may lead to better diagnosis as well as the treatment of the disease.


This review paper presented a summary of findings by a number of studies regarding significant role of FHIT in carcinogenesis. It is clear that the protein has a role in tumour regression. Moreover, bioinformatics has really made the proteomics research easy and efficient by introducing new algorithms in the field of proteomics to handle large and heterogeneous data sets. Proteins and molecules involved in a given disease will probably represent one of the milestones of future healthcare.
Disclaimer: None.
Conflicts of Interest: None.
Source of Funding: None.


1. Rasheed MA, Afzal S, Tariq F, Mannan S. In silico analysis of SMAD4 involved in hepatocellular carcinoma. Asian J Agri Biol 2014; 2: 28-33.
2. Görg A, Weiss W, Dunn MJ. Current two-dimensional electrophoresis technology for proteomics. Proteomics 2004; 4: 3665-85.
3. Tanke HJ. Genomics and proteomics: The potential role of oral diagnostics. Ann N Y Acad Sci 2007; 1098, 330-4.
4. Garcia I, Tabak LA. Beyond the "omics": Translating science into improved health. J Am Dent Assoc 2008; 139, 392-5.
5. Wright JT, Hart TC. The genome projects: Implications for dental practice and education. J Dent Educ 2002; 66, 659-71.
6. Kumar V, Fausto N, Abbas A, (editors). Robbins & Cotran Pathologic Basis of Disease. 7th ed. Philadelphia: Saunders, 2003; 914-7.
7. Parkin DM, Bray F, Ferlay J, Pisani P. Global cancer statistics, 2002. CA Cancer J Clin 2005; 55: 74-108.
8. Li KY, Liu LX, Yin DL. Progress in surgical treatment of hepatocellular carcinoma. Zhonghua Wai Ke Za Zhi 2016; 54: 148-52.
9. Liu D, Staveley-O K. Immune-based Therapy Clinical Trials in Hepatocellular Carcinoma. J Clin Cell Immunol 2015; 6: 376.
10. Iwasa M, Sugimoto R, Ishihara T, Sekoguchi-Fujikawa N, Yoshikawa K, Mifuji-Moroka R, et al. Usefulness of Levocarnitine and/or Branched-Chain Amino Acids during Invasive Treatment for Hepatocellular Carcinoma. J Nutr Sci Vitaminol 2015; 61: 433-40.
11. Huang J, Chang P. Refractory hypoglycemia controlled by systemic chemotherapy with advanced hepatocellular carcinoma: A case report. Oncol Lett 2016; 11: 898-900.
12. Parkin DM, Ohshima H, Srivatanukul P, Vatanasapt V. Cholangiocarcinoma: epidemiology, mechanisms of carcinogenese and prevention. Cancer Epidemiol Biomarkers Prev 1993; 2: 537-44.
13. Niu J, Lin Y, Guo Z, Niu M, Su C. The Epidemiological Investigation on the Risk Factors of Hepatocellular Carcinoma. Medicine (Baltimore) 2016; 95: e2758.
14. Rong WQ, Yu WW, Wu JX, Wu F, Wang LM, Tian F, et al. Analysis of prognostic factors in patients with hepatocellular carcinoma (?5 cm) underwent hepatectomy. Zhonghua Wai Ke Za Zhi 2016; 54: 89-93.
15. Jane S, Lin M, Chiu W, Lai L, Chen P, Chen M. Early detection of unhealthy behaviors, the prevalence and receipt of antiviral treatment for disabled adult hepatitis B and C carriers. BMC Public Health 2016; 16: 146.
16. Li YW, Yang FC, Lu HQ, Zhang JS. Hepatocellular carcinoma and hepatitis B surface protein. World J Gastroenterol 2016; 22: 1943-52
17. Tannapfel A, Anhalt K, Hausermann P, Sommerer F, Benicke M, Uhlmann D, et al. Identification of novel proteins associated with hepatocellular carcinomas using protein microarrays. J Pathol 2003; 201: 238-49.
18. First accessed in March-April, 2013 [title missing]
19. First accessed in March-April, 2013. [title missing]
20. Pace HC, Garrison PN, Robinson AK, Barnes LD, Draganescu A, Rösler A, et al. Genetic, biochemical, and crystallographic characterization of Fhit-substrate complexes as the active signaling form of Fhit. Proc Natl Acad Sci USA 1998; 95: 5484-9.
21. Ohta M, Inoue H, Cotticelli MG, Kastury K, Baffa R, Palazzo J, et al. The FHIT gene, spanning the chromosome 3p14.2 fragile site and renal carcinoma-associated t(3;8) breakpoint, is abnormal in digestive tract cancers. Cell 1996; 84: 587-97.
22. Boylston J, Brenner C. A knockdown with smoke model reveals FHIT as a repressor of Heme oxygenase 1. Cell Cycle 2014; 13: 2913-30.
23. Sozzi G, Veronese ML, Negrini M, Baffa R, Cotticelli MG, Inoue H, et al. The FHIT gene 3p14.2 is abnormal in lung cancer. Cell 1996; 85: 17-26.
24. Zaki S, Abdel-Azeez H, El Nagar M, Metwally K, Ahmed M. Analysis of FHIT gene methylation in egyptian breast cancer women: association with clinicopathological features. Asian Pac J Cancer Prevent 2015; 16: 1235-9.
25. Koc M, Aktimur R, Kagan Gokakin A, Atabey M, Koyuncu A, Elagoz S, et al. Expression of FHIT, p16, p53 and EGFR as prognostic markers in thyroid tumors of uncertain malignant potential. J BUON 2015; 20: 567-72.
26. Yu Y, Liu X, Yang Y, Zhao X, Xue J, Zhang W, et al. Effect of FHIT loss and p53 mutation on HPV-infected lung carcinoma development. Oncol Lett 2015; 10: 392-8.
27. Wu D, Hsu N, Wang Y, Lee M, Cheng Y, Chen C, et al. c-Myc suppresses microRNA-29b to promote tumor aggressiveness and poor outcomes in non-small cell lung cancer by targeting FHIT. Oncogene 2014; 34: 2072-82.
28. Malak CA, Elghanam DM, Elbossaty WF. FHIT Gene Expression in Acute Lymphoblastic Leukemia and its Clinical Significance. Asian Pac J Cancer Prev 2015; 16: 8197-201.
29. Chen Y, Wang X, Li F, Zhang L, Ma L, Liu Y. Relationship between expression of P27, Fragile Histidine Triad (FHIT), phosphatase and tensin homolog deleted on chromosome ten (PTEN), P73, and prognosis in esophageal squamous cell carcinoma. Ann Diagn Pathol 2015; 19: 33-6.
30. Trapasso F, Pichiorri F, Gaspari M, Palumbo T, Aqeilan RI, Gaudio E, et al. Fhit interaction with ferredoxin reductase triggers generation of reactive oxygen species and apoptosis of cancer cells. J Biol Chem 2008; 283: 13736-44.
31. Huebner K, Croce CM. Cancer and the FRA3B/FHIT fragile locus: it\\\'s a HIT. Br J Cancer 2003; 88: 1501-6.
32. Izzotti A, Pulliero A. Molecular damage and lung tumors in cigarette smoke-exposed mice. Ann N Y Acad Sci 2015; 1340: 75-83
33. Huebner K, Croce CM. FRA3B and other common fragile sites: the weakest links. Nat Rev Cancer 2001; 1: 214-21.
34. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H et al. The Protein Data Bank. Nucleic Acids Res. 2000; 28: 235-42. (Cited 2013 March-April). Available from URL:
35. Easy modeller software. (Cited 2013 March-April). Available from URL:
36. Kanehisa M, Goto S, Sato Y, Furumichi M, Tanabe M. KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 2012; 40(Database issue): D109-14. (Cited 2013 March-April). Available from URL:
37. STRING database. (Cited 2013 March-April). Available from URL:

Journal of the Pakistan Medical Association has agreed to receive and publish manuscripts in accordance with the principles of the following committees: