An Overview of Genetic Polymorphism and Lung Cancer Risk

ISSN 2472-0429

Advances in Cancer Prevention

  • Review Article   
  • Adv Cancer Prev 2016, Vol 1(2): 106
  • DOI: 10.4172/2472-0429.1000106

An Overview of Genetic Polymorphism and Lung Cancer Risk

Kouya Shiraishi*, Takayuki Honda and Takashi Kohno
Division of Genome Biology, National Cancer Center Research, Tokyo, Japan
*Corresponding Author: Kouya Shiraishi, Division of Genome Biology, National Cancer Center Research Institute, 5-1-1 Tsukiji, Chuo-ku, Tokyo 104-0045, Japan, Tel: +81-3-3542-2511, Fax: +81-3-3542-0807, Email: [email protected]

Received Date: Jan 20, 2016 / Accepted Date: Feb 22, 2016 / Published Date: Feb 29, 2016


The incidence and mortality rates of lung cancer in Asia have increased over the past few decades. Owing to the development of molecular-targeted drugs, the prognosis for patients with advanced stage lung cancer has improved. Despite this improvement the 5-year survival rate is very low. Accordingly, it is necessary to identify groups at high risk for lung cancer for prevention. Both environmental and genetic factors are related to the development of lung cancer. For example, cigarette smoking, passive (secondhand) smoking, asbestos exposure, and air pollution are associated with lung cancer. Genetic polymorphisms, including Single Nucleotide Polymorphisms (SNPs), underlie inter-individual differences in cancer susceptibility, and genetic loci for lung cancer risk have been identified by Genome-Wide Association Studies (GWAS) in both European/American and Asian populations. Recent GWAS of lung cancer have identified genetic susceptibility variants on Chromosome 15q25 (CHRNA), 5p15 (TERT), 3q28 (TP63), 6q22 (DCBLD1), 6p21 (BAT3-MSH2 and BTNL2), 10q25 (VTI1A), and 17q.24 (BPTF); however, loci significantly associated with lung cancer risk differ among ethnicities. Here, we review previous GWAS and discuss genetic factors that may be useful for the early detection or prevention of lung cancer.

Keywords: Lung cancer; Genome-wide association study; Single nucleotide polymorphism; Cancer susceptibility


Lung cancer is a leading cause of cancer-related mortality in developed and developing countries, and its incidence is increasing. Advanced cancer genome technologies can be used to detect alterations in oncogenes, such as EGFR, KRAS, BRAF, HER2/ERBB2, ALK, RET, and ROS1 [1,2]. Despite the development of moleculartargeted drugs for oncogenic mutations, there are few efficient therapies for the advanced stage of lung cancer. Owing to acquired resistance to therapy, recurrence rates are still high. In fact, the 5-year survival rate for stage IV lung cancer is less than 20%, in contrast to the 71.4% 5-year survival rate for stage IA [3]. These results suggest that earlier detection and treatment of lung cancer would significantly improve outcomes and reduce mortality. Cigarette smoke is a major cause of lung cancer. Cigarette smoke, including secondhand smoke, is associated with a substantially elevated risk of mortality [4]. Lung cancer types are typically histologically classified as Small Cell Lung Cancer (SCC) and non-small cell lung cancer, which includes Adenocarcinoma (ADC) and Squamous Cell Carcinoma (SQC) [5]. Smoking is more weakly associated with the development of ADC than with the development of SCC and SQC [6], indicating that the mechanisms of carcinogenesis differ among the histological types. Several Genome-Wide Association Studies (GWAS) report that inherited genetic factors (i.e., genetic polymorphisms) increase the risk of lung cancer [7-14]. Based on the results of meta-analyses and subgroup analyses, these genetic factors may have different effect sizes with respect to lung cancer risk depending on the population, smoking status, and histological type. In this review, we summarize GWAS related to lung cancer and review the putative roles of genetic factors in lung cancer development.

GWAS of lung cancer risk

The results of previous GWAS are summarized in the Table 1. Several studies report that three chromosomal loci, 15q24-25.1, 5p15.33, and 6p21, are associated with lung cancer risk in European and American populations [7-14], while four, 3q28, 5p15.33, 6p21, and 17q24.2, are associated with ADC risk in Japanese and/or Korean populations [10,11]. In addition, loci at 5q32, 10p14, 13q12.12, 20q13.2, and 22q12.2 are associated with lung cancer risk in the Chinese population [12,15] and loci at 10q25 and 6p21 are associated with susceptibility to lung cancer in females who have never smoked in the Asian population [16]. Loci at 12p13.33 and 12q23.1 are associated with SQC risk in individuals of European ancestry [17] and in the Chinese population [18]. However, the associations for some susceptibility loci were not validated in independent samples, and further verification is needed.

The chromosomal 15q24–25.1 region contains nicotinic acetylcholine receptor subunit genes, i.e., CHRNA3 (cholinergic receptor, nicotinic, alpha 3) and CHRNA5. These subunits are expressed in pulmonary epithelial cells and bind to nicotine and nitrosamines, including potential lung carcinogens in cigarette smoke and food [19,20]. The binding induces proliferation of cancer cells [20]. In Asia, associations between SNPs in these genes and lung cancer risk have been reported [21,22], but studies have yielded conflicting results [23]. Because the frequency of risk alleles in the Asian population is much lower than that in European populations, the conflicting results probably reflect the lower statistical power in these studies. At minimum, the contribution of the CHRNA risk alleles to lung cancer risk differs between Asian and European populations. Thus, it is necessary to investigate a cohort of subjects or large sample sets in Asian populations.

The 5p15.33 region contains the TERT (Telomerase Reverse Transcriptase) and CLPTM1L (cleft lip and palate transmembrane protein 1) genes. TERT protein functions in telomere replication and maintenance and promotes epithelial cell proliferation [24]. The risk allele frequency of the TERT SNP rs2736100 is similar among various ethnic populations, and associations have been detected in Europeans, Americans, and Asians [9-14,23]. According to a recent meta-analysis of various lung cancer types, this SNP is more strongly associated with ADC than with SQC or SCC [25]. Similarly, the association between lung cancer and TERT SNPs is stronger (i.e., associations with greater odds ratios) in Asian than in Caucasian populations (Table 1). The frequency of driver mutations in ADC is different among Caucasians and Asians. EGFR is activated by inflame deletion mutations in exon 19 and a point mutation in exon 21 (e.g., L858R) in 40–55% of ADC cases in the East Asian population and in 5–15% of ADC cases in the American population [26]. The association for the TERT SNP rs2736100 SNP in TERT intron 2 was reported to be stronger in ADC with an EGFR mutation than in ADC without an EGFR mutation [27]. Therefore, genetic modifiers and/or environmental factors might contribute to differences among histological types. The rs2736100 SNP is associated with susceptibility to other cancer types, including cancers of the brain, bladder, prostate, uterine cervix, and skin, as well as testicular cancer and chronic lymphocytic leukemia [28,29]. There are conflicting results regarding the association between the TERT SNPs and telomere length in leukocytes [28,30]. However, variants in the TERT promoter region (rs2853669 and rs2735940) and intron 4 (rs10069690) are likely to affect telomere length in leucocytes or noncancerous tissues [31,32]. The rs2853669 SNP is associated with ADC risk (rs2853669, odds ratio (OR) = 1.38); however, neither rs2853669 nor rs10069690 is statistically associated with ADC risk in the Japanese population (OR=1.06 and 1.07, respectively) [11]. To elucidate the effects of TERT SNPs, additional studies are needed to determine the relationships among TERT SNPs, ADC risk, and telomere length in non-cancerous or normal lung tissues.

Suscepti-bility locus Reported Gene(s) Strongest SNP-
Risk Allele
Risk allele frequency* Odds ratio Histology
EAS EUR SAS Asian Caucasian
(1) Common variants
3q28 TP63 rs4488809-C/
1.19-1.31 1.13 ADC
5p15.33 TERT rs2736100-G/
1.27-1.46 1.12-1.14 ADC
5p15.33 CLPTM1L rs31489-C/
0.99-1.33 1.12-1.15 NSCLC
(ACD and/or SQC)
5q32 STK32A rs2895680-C 0.338 0.233 0.429 1.14 - Lung cancer
6q22.1 DCBLD1 rs9387478-C 0.502 0.495 0.266 - 1.18 ADC
TRNAA-UGC rs4324798-A 1.000 0.071 1.000 - 1.16 ADC
6p21.32 HLA class II rs2395185-T 0.311 0.321 0.299 - 1.17 ADC
BTNL2 rs3817963-G 0.231 0.276 0.333 - 1.11-1.18 ADC
6p21.33 BAT3-MSH5-APOM rs3117582-C 0.000 0.075 0.000 - 1.22-1.24 ADC, SQC, SCC
10p14 Near GATA3 rs1663689-A 0.560 0.806 0.655 - - Lung cancer
10q25.2 VTI1A rs7086803-A 0.294 0.028 0.050 1.28 - ADC
12p13.33 RAD52 rs6489769-G 0.465 0.364 0.330 - 1.20 SQC
12q23.1 SLC17A8-NR1H4 rs12296850-A 0.743 0.940 0.806 0.78 - SQC
13q12.12 MIPEP rs753955-G 0.337 0.627 0.687 1.18 - Lung cancer
15q25.1 CHRNA3,CHRNA5, CHRNB4, PSMA4, AGPHD1 rs8034191-C/
0.99-1.06 1.29-1.35 ADC, SQC, SCC
17q24.2 BPTF rs7216064-A 0.692 0.214 0.269 1.16-1.20 - ADC
20q13.2 CYP24A1 rs4809957-T 0.394 0.791 0.625 1.13 - Lung cancer
22q12.2 MTMR3 rs36600-A 0.093 0.273 0.221 1.29 - Lung cancer
(2) Rare variants
13q13.1 BRCA2 rs11571833-T 0.000 0.011 0.0082 - 2.47 SQC
22q12.1 CHEK2 rs17879961-C 1.000 0.0050 1.000 - 0.38 SQC
6p21.33 BAT2 rs9469031-C 0.985 0.9960 0.989 1.92 - ADC
6p21.33 FKBPL rs200847762-C 0.9990 0.000 0.000 4.76 - ADC
20q11.21 BPIFB1 rs6141383-A 0.016 1.000 0.0020 1.72 - ADC
*EAS, East Asia in 1000 Genomes super population; EUR, Europe in 1000 Genomes super population; SAS, South Asia in 1000 Genomes super population.

Table 1: Summary of loci identified in previous GWAS for lung cancer risk.

CLPTM1L is located near TERT. This gene was identified by screening and ovarian cancer cell line for Cisplatin (CDDP) resistancerelated genes. A recent meta-analysis suggested that the association between rs31489 located in CLPTM1L and lung cancer risk was stronger in a population of European ancestry than in Asians [29]. CLPTM1L is required for lung tumorigenesis in a conditional KRasG12D transgenic mouse model [33]. The frequency of KRASmutated ADC is different among Caucasians and Asians. KRAS protein is activated by a single amino acid substitution (at codons 12 or 13) in 20–30% of ADC cases in the American population and in 8– 10% of ADC cases in the East Asian population [26]. CLPTM1L SNPs might be preferentially associated with the risk of KRAS-mutated ADC.

The 3q28 region contains TP63; it encodes a member of the tumor suppressor TP53 (also known as p53) gene family, which is involved in development, differentiation, and the cellular stress response [34]. The risk allele frequencies of the rs10937405 and rs4488809 SNPs, which are located in intron 1 of TP63, are similar among ethnic populations, and previous GWAS show that TP63 SNPs are risk-conferring factors for the development of non-small cell carcinoma, especially for ADC and East Asian populations [10,35]. The rs4488809 SNP was identified in an eQTL (expression quantitative trait loci) analysis of TP63 in lung tissue [36]. Therefore, inter-individual differences in the accumulation of DNA damage and the response to genotoxic stress might contribute to ADC development.

The 17q24.3 region contains BPTF (bromodomain PHD finger transcription factor), which encodes a nucleosome remodeling factor that regulates transcription via specific recognition of methylated histone proteins [37]. According to a GWAS, variants of BPTF are associated with ADC risk, and the risk allele frequency of the rs7216064 SNP is different among ethnic populations [11]. The association for this variant was validated in one study of female lung cancer patients with no history of smoking in Asian populations [16]. BPTF protein is required for c-MYC transcriptional activity [38,39] and somatic alterations in BPTF have been detected in a small percentage of ADC tissue samples [2,30,39]. However, further studies are needed to determine whether BPTF is responsible for susceptibility to ADC.

Associations between lung cancer and SNPs in the 6p21 region have been found in GWAS of Europeans and Asians [8-16]. This region contains BAT3 (HLA-B associated transcript 3) and MSH5 (mutS homolog 5). The BAT3 protein forms a complex with a histone acetyltransferase, p300, which acetylates the p53 protein in response to DNA damage, whereas MSH5 is involved in DNA mismatch repair. Variants in BAT3-MSH5 are associated with lung cancer risk in European populations, irrespective of histological type; however, these BAT3 and MSH5 SNPs are mono-morphic in Asian populations. In our own GWAS, variants of BTNL2 (butyrophilin-like 2) and the HLADQA1 locus located in the HLA (Human Leukocyte Antigen)-class II region in 6p21 were significantly associated with ADC risk [11,40]. The risk allele frequency of BTNL2 SNPs is similar among various ethnic populations. In addition, BTNL2 SNPs are associated with the risk of lung cancer in females with no history of smoking [16]. It is possible that polymorphisms in HLA-class II genes confer lung cancer susceptibility owing to inter-individual differences in the immune response against tumor cells. The 6p21 region is highly polymorphic and contains most major histocompatibility complex genes. The construction of HLA reference panels of Caucasians and Japanese [41,42] has enabled GWAS via HLA genotype imputation, and therefore will enable the identification of HLA alleles that confer a cancer risk.

To identify rare variants that have large effects on lung cancer risk (e.g., OR > 2.0), it may be particularly useful to conduct GWAS via genotype imputation using the 1000 Genomes reference panel [43] or whole-genome and/or whole-exome sequencing. By a study using genotype imputation, such variants in BRCA2 (p.Lys3326X: rs11571833) and CHEK2 (p.Ile157Thr: rs13314271) associated with SQC risk were found in populations of European ancestry [35]. Rare variants at 6p21.33 and 20q11.21 were found to be associated with lung cancer risk in the Chinese population [44]. Variants of PARK2 (p.Arg275Trp) and HER2 (p.Gly660Asp) have been detected in familial lung cancer by whole-exome sequencing [45,46]. These results indicate that genotype imputation and whole-genome and/or whole-exome sequencing may facilitate the identification of rare variants with large effects on lung cancer risk.

The limitations and applications of GWAS

Genetic factors affecting lung cancer risk have been identified by GWAS and other association studies. The results of these studies indicate that the effect sizes for risk variants differ among populations for a variety of reasons, including differences in allele frequency and interactive effects between variants and the genetic and environmental backgrounds. Additional studies are needed to clarify rare diseasecausing variants in a range of ethnic groups, particularly studies that use larger numbers of Asian and European subjects, i.e., via international and/or Intra Consortiums (ILCCO, the International Lung Cancer Consortium [47]; FLCCA, the Female Lung Cancer Consortium in Asia; and JLCS, Japanese Lung Cancer Collaborative Study) [11,16]. Another point to be clarified is interaction between genetic factors and exogenous/endogenous factors on lung cancer risk. Interaction of SNPs identified by GWASs with other cancer-related factors, such as smoking and age, is unclear. Recently, a GWAS focusing on gene-smoking interaction of cigarette smoke with lung cancer risk was performed in a Chinese population, but it did not yield a conclusive result [48]. Therefore, GWASs that consider such interactions should be further performed in several populations and/or in larger scales.

GWAS focusing on specific lung cancer types will be a powerful approach to identify additional genetic factors that are associated with the risk of specific lung cancer types, such as SCC (the deadest histological type of lung cancer), lung cancers of female neversmokers, those of patients of young ages, or those with specific oncogene mutations, such as EGFR and KRAS mutations. Understanding the underlying genetic factors will help greatly in clarifying the disease etiology and in identifying high-risk individuals for targeted screening and/or prevention based on a combination of genetic and environmental factors.


This work was supported in part by the Practical Research for Innovative Cancer Control from the Japan Agency for Medical Research and Development (15ck0106096h0002 and 15Ack0106168h0001), the Princess Takamatsu Cancer Research Fund, the Suzuken Memorial Foundation, and the National Cancer Center Research and Development Fund (26-A-8, 25-A-1 and NCC Biobank).


Citation: Shiraishi K, Honda T, Kohno T (2016) An Overview of Genetic Polymorphism and Lung Cancer Risk. Adv Cancer Prev 1: 106. Doi: 10.4172/2472-0429.1000106

Copyright: © 2016, Shiraishi K, et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Select your language of interest to view the total content in your interested language

Post Your Comment Citation
Share This Article
Relevant Topics
Recommended Conferences
Article Usage
  • Total views: 8607
  • [From(publication date): 5-2016 - Aug 20, 2018]
  • Breakdown by view type
  • HTML page views: 8480
  • PDF downloads: 127

Review summary

  1. Ritu
    Posted on Aug 30 2016 at 2:58 pm
    The review article discussed the various aspects of cancer describing the limitations and applications of genetic factors increasing the risk of lung cancer. Understanding the underlying genetic factors will help greatly in clarifying the disease etiology and in identifying high-risk individuals for targeted screening and/or prevention based on a combination of genetic and environmental factors.

Post your comment

captcha   Reload  Can't read the image? click here to refresh
Leave Your Message 24x7