alexa A Comparison of Protein and Phenolic Compounds in Seed from GMO and Non-GMO Soybean | Open Access Journals
ISSN: 2153-0602
Journal of Data Mining in Genomics & Proteomics
Make the best use of Scientific Research and information from our 700+ peer reviewed, Open Access Journals that operates with the help of 50,000+ Editorial Board Members and esteemed reviewers and 1000+ Scientific associations in Medical, Clinical, Pharmaceutical, Engineering, Technology and Management Fields.
Meet Inspiring Speakers and Experts at our 3000+ Global Conferenceseries Events with over 600+ Conferences, 1200+ Symposiums and 1200+ Workshops on
Medical, Pharma, Engineering, Science, Technology and Business

A Comparison of Protein and Phenolic Compounds in Seed from GMO and Non-GMO Soybean

Savithiry S Natarajan1*, Farooq H Khan1, Davanand L Luthria2, Mark L Tucker1, QijianSong1 and Wesley M Garrett3

1USDA-ARS, Soybean Genomics and Improvement Laboratory, Beltsville, MD 20705, USA

2USDA-ARS, Food Composition and Methods Development Laboratory, Beltsville, MD 20705, USA

3USDA-ARS, Animal Biosciences and Biotechnology Laboratory, Beltsville, MD 20705, USA

*Corresponding Author:
Savithiry S Natarajan
Soybean Genomics and Improvement Laboratory
ARS-USDA, 10300 Baltimore Avenue
Beltsville, USA
Tel: 301-504-5258
Fax: 301-504-5728
E-mail: [email protected]

Received Date: August 28, 2014; Accepted Date: September 11, 2014; Published Date: September 18, 2014

Citation: Natarajan SS, Khan FH, Luthria DL, Tucker ML, Song Q, et al. (2014) A Comparison of Protein and Phenolic Compounds in Seed from GMO and Non- GMO Soybean. J Data Mining Genomics Proteomics 5:161. doi: 10.4172/2153-0602.1000161

Copyright: 2014 Natarajan SS, et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited

Visit for more related articles at Journal of Data Mining in Genomics & Proteomics


Soybean protein is a valuable and important component in human and animal diets. Approximately 94% of the soybean planted in the US is genetically modified (GM) to enhance quality and productivity. Since value-added traits are continuously being developed by genetic modification, it is important to determine if any unintended changes occur in GM soybean seeds. In this investigation, we have selected three different transgenic lines, denoted event 1,2 and 3 with a single Agrobacterium tumefaciens T-DNA insert that included genes for a herbicide-resistance selectable gene (bar) and a β-glucuronidase (GUS) reporter gene expressed using a double 35S Cauliflower Mosaic Virus (CaMV) promoter and a soybean polygalacturonase (Glyma12g01480) promoter, respectively. The transgenic lines and nontransgenic
progenitor isoline (control) were used for both proteomic and phenolic compound analysis. Seed proteins
were separated by two-dimensional polyacrylamide gel electrophoresis (2D-PAGE). Out of approximately 1300 protein spots detected per protein extract, 30 spots were selected for further analysis based on software-determined differences (ANOVA) in their relative abundance in the protein gels for the control and three events. Subsequent statistical analysis after Bonferroni correction indicated that the abundance of only two of the thirty protein spots were significantly different at the 1% probability level. Two protein spots, an isoflavone reductase and a quinine oxidoreductase-like protein, in event 2 were significantly different from the control and the other two transgenic events. All thirty protein spots were analyzed and identified by mass spectrometry (MS) followed by a search of the NCBI databases using the Mascot
search engine. In addition to protein, two classes of phenolic compounds, isoflavonoids and phenolic acids, were analyzed by LC-MS. The results indicated no systematic differences in the amount or profile for either class of phenolic compounds in the control or three transgenic events.


Soybean; Transgenic; Proteomics; Two-dimensional gel electrophoresis; MALDI-TOF/TOF; Phenolic acids; Isoflavones


Soybean and soy derived products are consumed worldwide for benefits derived from their high protein and bioactive phenolic components, namely isoflavones and phenolic acids. Soybean proteins are used in human foods in a variety of forms. Consumption of soybeans reduced the risk of cancer, decreased risk factors for cardiovascular disease, and reduced chances of other chronic illnesses [1]. Soybean seed contains 2 major storage proteins: β-conglycinin and glycinin. Other proteins such as β-amylase, cytochrome c, lectin, lipoxygenase, urease, Bowman-Birk Inhibitor (BBI) and trypsin are also present [2]. Recently, Clemente et al. [3] reported that soybean trypsin and chymotrypsin inhibitor Bowman-Birk (BBI) were linked to the prevention and treatment of colorectal cancer. Soybean isoflavones also were reported to reduce the risk of breast, prostate, intestine, and stomach cancer [4,5].

Over the past few decades, genetically modified (GM) crops have played a significant role in increasing the productivity and nutritional value of crops, e.g., increasing tolerance to herbicides, improving resistance to pathogens and producing recombinant pharmaceutical molecules including human growth hormone and coagulation factor 1X [6-9].

Since 1996, GM crops have been commercially available in United States and many other countries have also approved the commercial use of GM crops. Clive [10] recently reported that approved GM crops are currently grown on approximately180 million hectares in more than 25 countries. Foods derived from GM crops are subjected to rigorous safety evaluation such as characterization of intended modification with specific tests for allergenicity and toxicity. In addition, potential unintended effects are evaluated on the basis of agronomic characteristics, compositional analysis, and evaluation of key nutrients [11].

A safety assessment of GMO crops is mandatory in US and other countries. Numerous international organizations have played vital roles in the formulation of universal safety/risk assessments of GMO crops (Codex Alimentarius Commission (CAC), 2003 and 2009; Food and Agriculture Organization of the United Nations (FAO), 2011; World Health Organization (WHO), 1993 and 2000; Organization of Economic Cooperation and Development (OECD), 1993). According to Millstone et al. [12], the safety assessment of new crops is based on the concept of “substantial equivalence”. If the chemical composition of a new crop is substantially similar to that of existing crop, it is not considered to pose a health risk [13]. Unintended effects may occur due to unforeseen interactions with other proteins or biochemical, or effects caused by random insertion into the genome, which can alter normal plant processes. It is therefore important that each genetic modification be examined on a case-by-case basis for unintended effects [14]. In 2010, the sequence of the soybean genome became available and greatly improved our ability to access unintended effects [15]. However, a clear understanding of unintended effects in regard to protein and metabolites due to transgenic modifications is still lacking [16-19].

Holistic profiling approaches such as genomic, proteomics, transcriptomics, and metabolomics have broadened the spectrum of compounds that can be identified and analyzed in contrast to earlier targeted analytical approaches [20-23]. In the present study, we utilize a combination of a proteomic and a more classical targeted approach. Proteins were separated using two-dimensional gel electrophoresis (2D-PAGE) and individual proteins identified by mass spectrometry. In addition, two classes of phenolic compounds, phenolic acids and isoflavones, were analyzed using high performance liquid chromatography (HPLC) with diode array (DAD) and mass spectral detections (MSD).

Materials and Methods

Plant material and generation of transgenic plants

A soybean phage genomic library was screened for clones with similarity to a PG11 cDNA. A 17 kb genomic insert was sequenced and when the soybean genomic sequence became available the two sequences were compared, with only a few differences found between the two sequences. Polymerase chain reaction (PCR) was used to fuse a 1951 bpGmPG11a gene (Glyma12g01480) promoter to a ß-glucuronidase (GUS) reporter gene immediately downstream from the ATGGUS open reading frame. The GUS open reading frame included an intron from the castor bean catalase gene 18 bp down from the start of translation [24]. The PG11a-GUS-NOS3 construct was cloned into the Bam H1- Eco R1 site of pTF101.1 in the opposite orientation to a herbicideresistance selectable marker gene (bialaphos resistance, bar) [25]. The herbicide-resistance gene was constitutively expressed using a double CaMV 35S promoter [26]. The Iowa State University Transformation Facility transferred this construct into A. tumefaciens (EHA101) and then transformed and regenerated transgenic soybean (Glycinel max Williams 82). Seven independent events displayed herbicide resistance and all seven tested positive for the GUS gene in a PCR genomic DNA assay. Five events displayed strong GUS staining in an abscission assay. Segregation analysis of second and third generation seed indicated that three events with strong GUS staining had a single copy of the transgene. Homozygous seed from third generation plants for each of the following three events were used for this study: ST-16-3-10, ST83- 28-1-15 and ST83-37-7-10. Seeds were collected from multiple plants for each event,mixed and several seeds were used for analysis

Protein extraction from transgenic soybean seeds

Protein was extracted from transgenic and isogenic control soybean seeds using a modified Trichloroacetic acid (TCA) / acetone method [27]. Soybean seeds were first pulverized to fine powder in liquid nitrogen using a mortar and pestle. Fifty mg of seed powder was homogenized in 1 mL of precipitation solution containing 10% (w/v) TCA in acetone with 0.07% (v/v) β-mercaptoethanol. The protein was allowed to precipitate for 24 hrs at -20°C. The precipitated protein then twice underwent 1 hr rinses with cold acetone containing 0.07% (v/v) β-mercaptoethanol, with each rinse followed by centrifugation at 20,800 g for 20 min at 4°C. The supernatant from each rinse was discarded. The protein precipitate was dried using a vacuum centrifuge for 30 minutes, and then resolubilized in 1.0 mL of lysis buffer [7M urea, 2M thiourea, 4% (w/v) CHAPS, 0.5% (w/v) DTT]. The mixture was homogenized with repetitive pipetting, followed by the use of a sonicator for 10 minutes, and centrifugation at 20,800 g for 20 minutes at 4°C. The supernatant containing the solubilized protein was used for each of the 2D-PAGE separations

2D-PAGE analysis

Protein (100 μg) was first separated by isoelectric focusing (IEF) on an IPGphor II apparatus using13 cm immobilized pH gradient (IPG) strips of pH ranges 4-7 and 6-11 (GE Healthcare, Piscataway, NJ).The dry IPG strips were hydrated 12 hrs in 250 μL rehydration buffer (7M urea, 2M thiourea, 4% CHAPS, 2% pharmalyte, 0.002% bromophenol blue) containing 100μg protein. The voltage settings for IEF were 500 V for 1 hr, 1000 V for 1 hr, 5000 V for 1 hr, and 8000 V to a total 24 kVhr. Following electrophoresis, the protein in the strips was reduced through incubation with equilibration buffer (50 mM Tris-HCl pH 8.8, 6 M urea, 30% glycerol, 2% SDS, 0.002% bromophenol blue, 1% DTT) and subsequently alkylated with the same buffer by substituting 2.5% iodoacetamide for DTT. Incubations for both reduction and alkylation were timed for 30 minute and took place on a shaker at room temperature. The second dimensional protein separation was achieved by polyacrylamide gel electrophoresis on a 12.5% polyacrylamide gel using a Hoefer SE 600 Ruby electrophoresis unit (GE Healthcare, Piscataway, NJ). The gels were then stained with colloidal CoomassieG-250. After de-staining with ddH20, gels were scanned using a GE ImageScanner III (GE Healthcare, Piscataway, NJ).

Protein visualization and image analysis

Protein expression analysis was conducted through the use of Progenesis SameSpots (TotalLab, Newcastle, England). Scanned images in Maya Embedded Language (MEL) file format were first uploaded and underwent a quality check for color saturation and ensured consistency in image resolution across all samples. The images were then aligned and spots were automatically detected. In addition to automated spot detection, a thorough visual inspection was used to eliminate nonspot fragments falsely reported by the software. Subsequent to the spot detection, the experimental design allowed the software to report differentially expressed spots across control and transgenic soybean gels. All of the differentially expressed spots with a p-value below 0.05 were chosen for subsequent analysis.

In-gel digestion of protein spots

Soybean protein spots differentially expressed across control and transgenic samples were excised with a 1.5 mm Spot Picker (The Gel Company, San Francisco, CA, USA). Protein digestion was performed using trypsin as described previously [27]. For further removal of the gel stain, the gel plugs were hydrated with 25mM ammonium bicarbonate on a shaker for 10 minutes, and then dried with acetonitrile for 10 minutes. The hydration and dehydration cycle was performed twice. The gel plugs were then thoroughly dried under vacuum and incubated overnight at 37ºC with 20 μL of 10 μg/mL porcine trypsin (Promega, sequencing grade, Madison, WI) in 20 mM ammonium bicarbonate for protein digestion. The resulting peptides were eluted from the gel in 50% acetonitrile and 5% trifluoroacetic acid. The extract was vacuum-dried and the dried peptides dissolved in 50% acetonitrile and 0.1% trifluoroacetic acid.

Mass spectrometry

Samples were spotted on a MALDI plate, co-crystallized with a 5 mg/ml concentration of α-cyanohydroxycinnamic acid (CHCA) matrix prepared in 70% acetonitrile containing 0.1% trifluoroacetic acid. Thirty fmol of a commercially prepared tryptic digest of bovine serum albumin (Michrom Bioresources, Inc. Auburn, CA, USA) was spotted onto the 13 calibration wells of the sample plate and 5 peptides with masses in the range of 927.493 m/z to 1881.905 m/z were used for the calibration. The mass spectrometer used was an AB SCIEX TOF/ TOF™ 5800 System (AB SCIEX, Framingham, MA, USA) operated in positive ion reflector mode to analyze tryptic peptides. Prior to analysis of unknowns, a plate model calibration was run to optimize mass accuracy and to update the instrument’s default calibration parameters. The instrument was operated in batch mode during peptide analysis, which entails first performing an MS survey scan on all spots of interest, followed by sequential MS/MS analysis of peaks detected in the MS scan. Acquisition of MS/MS data was controlled by an interpretation method that acquired MS/MS spectra on the strongest precursors first on up to 100 precursors detected in the MS scan. An exclusion mass list was prepared to prevent MS/MS analysis of common human keratin contaminant and minor porcine trypsin autolysis peaks. MS spectra for both standards and unknowns were acquired in positive ion reflector mode with 400 shots of a 349 nm Nd:YAG laser operating at 404 Hz. MS/MS spectra were also acquired in positive ion reflector mode with 250-1000 laser shots firing at a rate of 1010 Hz. Collision energy was set to 1kV and collision induced dissociation (CID) was enabled with air as the collision gas in the CID cell. When possible, known trypsin autolysis peaks at m/z 842.51 and 2,211.10 were used to internally calibrate the MS spectra.

Protein identification was performed using the Mascot search engine ( against the NCBI nonredundant database with the taxonomy filter Viridiplantae (green plans). The parameters for database searches included: monoisotopic mass, trypsin as the digestive enzyme with allowance for 1 missed cleavage, peptide tolerance of 50 parts per million (ppm), MS/MS tolerance of 0.6 Da, allowance of 1+ peptide charge, fixed modification for carbamidomethylation of cysteine residues, and variable modifications for oxidation of methionine residues as well as N-terminal pyroglutamic acid resulting from glutamic acid or glutamine. Positive identification of proteins by MS/MS analysis required a single peptide having a significant ion score. Samples identified as uncharacterized/unknown identity were subjected to sequence alignment match via BLAST against UniProt knowledgebase, where sequence similarity of 85% and above was used as a minimum criteria for inclusion.

Extraction and analysis of isoflavones from soybeans

Ground soybean samples (250 mg) were placed in 15 ml polypropylene conical tube with 5 ml of hexane. The mixture was placed in an ultrasonic bath for 15 min. The mixture was centrifuged and the hexane layer, which contained oil, was removed and discarded. The residue was extracted twice with 5 ml of with the previously optimized solvent mixture (EtOH:H2O:DMSO, 75:20:5, v/v/v). The supernatant from the two extraction cycles were pooled together in a volumetric flask and the volume of the combined extract was adjusted to 10 ml with extraction solvent. Appropriate aliquots of extracts were filtered through a 0.45 μm PVDF syringe filter for isoflavone analysis by LC-MS. Analysis of isoflavones was carried out using out using an Agilent 1290 Infinity LC system coupled to a diode array and a mass spectrometry detector from Agilent Technologies (Palo Alto, CA, USA). Separation of isoflavones was achieved using a reversed phase C18 Luna column (Phenomenex, Lorance, CA, USA, 150×4.6 mm; particle size 5 μm), preceded by a guard column (Phenomenex, 4×3.0 mm). The structures for all isoflavones were confirmed by comparison of retention time, UV spectra and mass spectral analysis. Peak areas were integrated for quantitation. Comparison of extraction efficiencies was achieved by comparing HPLC peak areas.

Extraction and analysis of phenolic acids from soybeans

Ground soybean samples (250 mg) were hydrolyzed in 2N NaOH containing 10 mM EDTA and 1% ascorbic acid for 30 min in an ultrasonic bath at 56°C as described previously [28]. After hydrolysis, the samples were cooled to an ambient temperature and the pH of the extract was adjusted to 2.5 with 6N HCl. Phenolic acids were isolated from the acidified extract with ethyl acetate (5 ml×2). The mixture was vortexed for 30 sec and centrifuged on a bench top centrifuge (Damon IEC HN-SII, Ramsey, Minnesota, USA) at 5000 rpm for 10 min. The upper organic ethyl acetate layer containing hydrolyzed phenolic acids was carefully transferred into a separate tube and evaporated under nitrogen gas. The dried residue was re-dissolved in 2 ml of 80% aqueous methanol filtered through a 0.45 μm PVDF syringe filters into HPLC vials for analysis. Four replicates of hydrolysis, extraction, and analysis were carried out with each sample. The structures for identified phenolic acids were confirmed by comparison of retention time, UV and mass spectral analysis as reported earlier [29]. Peak areas were integrated for quantitation. Comparison of phenolic acids was achieved by total peak area under the peak as detected by the UV-diode array detector.

Results and Discussion

Transgenic soybean seeds consist of single gene insertions containing both the bar herbicide gene and the β-glucuronidase (GUS) reporter gene controlled by the double CaMV 35S promoter and PG11a gene promoter, respectively. Although the 35S promoter is generally used as a constitutive promoter in plants, its expression level varies in different tissues. One study of soybean demonstrated that the CaMV 35S promoter was expressed strongly in leaves but very weakly in seed [30]. Similarly, we demonstrated using GUS staining that the PG11a promoter is expressed throughout the soybean seed with the highest expression in the vascular cylinder of the root radical (Figure 1). Seed morphology and coat color were similar between the 3 transgenic events and control soybean seeds tested in our current investigation


Figure 1: Histochemical GUS staining of non-transgenic and transgenic soybean seeds. Soybean seeds: event 1 (ST83-16-3-10), event 2 (ST83-28-1-15), event 3 (ST83-37-7-10), and control. Seeds were imbibed with water for 3 hours and then stained for GUS activated overnight at 37°C. Seeds were then split down the middle and pictures taken of the outside and inside of each half.

Savi # Spot # Control Event1 Event2 Event3 Bonferroni-corrected p-value
Abundance Abundance Abundance Abundance
1 2275 8.91E+05 2.05E+06 1.16E+06 1.08E+06 1
2 1402 1.54E+06 1.68E+06 1.69E+06 1.97E+06 1
3 584 7.63E+05 7.14E+05 1.10E+06 7.96E+05 0.3
4 412 1.68E+06 2.50E+06 1.34E+06 2.55E+06 0.1
5 415 2.19E+06 3.21E+06 2.06E+06 3.30E+06 0.93
6 912 6.98E+06 6.44E+06 9.40E+06 7.54E+06 0.35
7 873 3.82E+07 3.40E+07 4.90E+07 3.97E+07 0.06
8 958 4.98E+05 4.26E+05 6.99E+05 5.46E+05 0.18
9 862 8.15E+06 7.58E+06 1.28E+07 1.04E+07 0.07
10 1397 1.26E+06 1.18E+06 1.67E+06 1.59E+06 0.06
11 2295 1.30E+07 1.28E+07 2.05E+07 1.65E+07 0.02
12 905 4.73E+06 4.72E+06 7.32E+06 6.23E+06 0.16
13 856 3.24E+06 3.33E+06 5.33E+06 4.52E+06 0.14
14 904 2.08E+06 2.29E+06 3.58E+06 2.90E+06 0.27
15 2276 5.83E+05 5.10E+05 8.46E+05 7.20E+05 0.05
16 973 2.99E+05 3.35E+05 4.40E+05 4.13E+05 1
17 1159 2.52E+06 2.09E+06 3.19E+06 2.84E+06 0.11
18 1462 2.87E+06 3.00E+06 2.03E+06 2.56E+06 0.03
19 419 9.09E+06 7.14E+06 8.46E+06 9.22E+06 1
20 400 8.55E+07 8.31E+07 6.66E+07 7.22E+07 0.33
21 411 4.24E+06 3.25E+06 2.98E+06 3.49E+06 0.08
22 1124 3.21E+06 2.75E+06 4.05E+06 3.60E+06 0.01**
23 1118 3.02E+06 2.91E+06 3.92E+06 3.25E+06 0.11
24 1543 4.76E+05 1.12E+06 9.09E+05 1.02E+06 0.07
25 952 4.31E+05 5.94E+05 5.78E+05 5.45E+05 1
26 1060 8.28E+05 6.77E+05 1.15E+06 7.66E+05 0.01**
27 989 9.09E+06 7.66E+06 6.20E+06 6.39E+06 0.22
28 577 1.50E+06 2.05E+06 1.52E+06 1.64E+06 1
29 732 1.44E+06 1.87E+06 1.50E+06 1.29E+06 0.51
30 538 5.36E+06 7.42E+06 4.60E+06 5.56E+06 0.04

Table 1: Average protein levels in the seeds of control and transgenic soybeans and the Bonferroni-corrected p-value

Analysis of proteins

A modified TCA/acetone method was used to extract soybean proteins [27]. Three biological replicates of protein extracts from three transgenic events and control lines were separated by 2D-PAGE. Analysis was initially carried out with immobilized pH gradient (IPG) strips using a pH range of 3 to 10 (data not shown). However, over this broad pH range, storage proteins were poorly separated and the total numbers of resolved protein spots were insufficient. Improved separation and spot resolution were achieved using pH gradients between 4 to 7 and 6 to 11. Therefore, 100 μg of each protein extract was used for two separate 2D gels, one with a pH range from 4 to 7 and another with a pH range of 6 to 11. Representative 2D-PAGE gels for soybean seed protein are shown in (Figure 2) (Table 1). Comparison of 2D gel images for each of the transgenic events to that of control images identified 30 proteins spots that differed among treatments. Each of the 30 differentially expressed spots were excised manually from the gels and digested with trypsin. The tryptic digests were purified and analyzed by MALDI/TOF/TOF mass spectrometry. (Table 2) includes MASCOT information for each excised protein referenced by its spot number including: predicted protein identity, theoretical isoelectric point (pI), molecular weight (Mr), the original species that the protein was identified in, number of peptides matched, MOWSE score, and gene ID or accession number of the best match.


Figure 2: 2-D proteomic map of transgenic soybean. Proteins (200 μg) were separated in gradients of pH 4–7 for the first dimension, and resolved in the second dimension using 12.5% SDS-PAGE. The resulting gels were stained with Colloidal Coomassie Blue G-250. Numbered protein spots were excised and identified by MALDI/TOF/TOF. Differentially expressed proteins spots were numbered in the gel and protein information were listed in (Table 1).

Spot # Protein PI # MOWSE Score # of PM Mr (Da)
1 Beta-conglycinin, alpha chain gi|121281 669 7 121281
2 Beta-conglycinin, alpha chain-like [Glycine max] gi|356535993 67 1 68392
3 Alpha' subunit of beta-conglycinin [Glycine max] gi|9967361 277 2 65160
4 Alpha' subunit of beta-conglycinin [Glycine max] gi|9967361 566 6 65160
5 Alpha' subunit of beta-conglycinin [Glycine max] gi|9967361 533 6 65160
6 Beta-conglycinin alpha subunit [Glycine max] gi|335353923 943 11 70451
7 *Beta-conglycinin beta subunit [Glycine max] F7J077 1364 18 50442
8 *Beta-conglycinin beta subunit [Glycine max] F7J077 529 5 50442
9 *Beta-conglycinin beta subunit [Glycine max] F7J077 1462 20 50442
10 Beta-conglycinin beta subunit [Glycine max] gi|341603995 299 3 50010
11 Beta-conglycinin beta subunit [Glycine max] gi|63852207 1523 19 48358
12 Beta-conglycinin beta subunit [Glycine max] gi|341603995 906 11 50010
13 Beta-conglycinin beta subunit [Glycine max] gi|63852207 881 13 48358
14 Beta-conglycinin beta subunit [Glycine max] gi|341603995 432 3 50010
15 Beta-conglycinin beta subunit [Glycine max] gi|341603995 911 12 50010
16 Beta-conglycinin beta subunit [Glycine max] gi|341603995 172 1 50010
17 Beta-conglycinin beta subunit [Glycine max] gi|341603995 908 11 50010
18 Glycinin G1 precursor [Glycine max] gi|121276 100 1 56299
19 Glycinin A3B4 [Glycine max] gi|225440 521 6 27447
20 Glycinin G1 precursor [Glycine max] gi|121276 464 5 56299
21 Glycinin G1 precursor [Glycine max] gi|121276 405 5 56299
22 Isoflavone reductase homolog 2 [Glycine max] gi|351726399 811 9 33919
23 Isoflavone reductase homolog 2 [Glycine max] gi|351726399 826 11 33919
24 Cysteine synthase [Glycine max] gi|351727525 858 6 34362
25 *Pv42p [Phaseolus vulgaris] Q41108 119 1 41312
26 Quinone oxidoreductase-like protein At1g23740 gi|356571378 180 1 34660
27 *Alcohol dehydrogenase 1 [Glycine max] Q8LJR2 501 4 40007
28 Nucleoside diphosphate kinase [Glycine max] gi|26245395 252 3 16402
29 Peptidyl-prolyl cis-trans isomerase 1 / Cyclophilin 1 [Glycine max] gi|75304514 359 4 18395
30 Seed maturation protein PM30 [Glycine max] gi|351727184 228 2 15145
* Refers to identity based on BLASTp on resulting sequence from MASCOT

Table 2: Identification of differentially expressed protein between control and transgenic soybean seed variteies

Variation of storage proteins

Soybean seeds contain two major classes of storage proteins, the 7S and 11S proteins, which are normally referred to as β-conglycinin and glycinin, respectively. The β-conglycinins are trimeric glycoproteins consisting of three types of non-identical, but homologous polypeptide subunits: α, α′, and β. They form seven different combinations with the molecular weight of 180 kDa [31]. The second class of storage protein, glycinin, is hexameric with molecular weights of 360 kDa and consists of acidic (A) and basic (B) polypeptides. Glycinin is encoded by five non-allelic genes, Gy1, Gy2, Gy3, Gy4, and Gy5. It codes for five precursor protein molecules, G1, G2, G3, G4, and G5, respectively [32]. Based on physical properties, these five subunits are classified into two distinct major groups; group I consists of G1 (A1aBx), G2 (A2B1a), and G3 (A1aB1b) proteins and group II contains G4 (A5A4B3) and G5 (A3B4) subunits. Beillinson et al. [33] identified additional two genes, glycinin pseudogene (gy6) and functional gene (Gy7).

In this investigation a total of 30 differentially expressed protein spots were detected over pH ranges 4-7 and 6-11. Among the 30 spots, a total of 21 spots were identified as storage proteins. More specifically, 6 protein spots (spot#1-6) were identified as α subunits of β-conglycinin, 11 protein spots (spot#7-17) were identified as of β-subunits of β-conglycinin, and 4 spots (spot#18-21) were identified as glycinin G1 subunits. Based on the results, these protein spots are distributed over a wide range of pI and molecular weights and varied among the 3 transgenic events. Transgenic events 1, 2, and 3 had between 1.0 to 1.5 fold increase/decrease in protein abundance for the α-subunit and β-subunit of β-conglycinin. The significance of the protein level of each storage protein spot among the control and three events were tested using ANOVA procedure of SAS [34], as multiple comparisons were involved, a Bonferroni correction at 1% threshold was used to determine the significance of each protein spot. If a significant difference was observed for any spot, a comparison of the control vs. each of the three events was conducted using Dunnett’s test.

Table 1- includes results and statistics for the relative abundance of each protein spot for each of the 3 events compared with the nontransgenic control. There was a broad range of variability of both acidic and basic storage proteins components among the transgenic events. Gomes et al. [35] investigated alpha subunit of β-conglycinin spot variation in four conventional soybean genotypes namely BRS 257, 258, 267 and Embrapa 48, using 2D-PAGE analysis and also by ID-PAGE. They showed variation of total number of protein spots in BRS 257, BRS258, Embrapa 48 and BRS 267 is 102, 124, 113, and 99 respectively. They are also reported 46 differentially expressed proteins (storage, allergenic, maturation, agglutinin, and trypsin inhibitors) in 2D gels among 4 non-GM soybeans. Similarly, significant differences of both β-conglycinin and glycinin storage proteins using proteomics was reported in 14 Canadian commercial soybean varieties by Zarkada et al. [36] and in 90 Brazillian soybean cultivars by [37]. Houston et al. [38] quantified soybean allergens in 20 non-GM soybeans and observed 10-fold variation of glycinin G3 when comparing 2 different varieties. Multiple authors have reported variation in protein profiles in different cultivars of soybeans grown under different environmental conditions using different proteomics techniques [37,39, 40]. In addition, Hughes and Murphy [41] reported that glycinin content varies from 7.5% for soybean grown in a uniform environment to 10% for soybeans grown at different locations. The previous publications from our laboratory also showed variation of storage proteins, allergen, and anti-nutritional proteins among several wild and cultivated soybean genotypes [42,43]. In another study, Fehr et al. [44] reported environmental effects caused significant variation of glycinin and β-conglycinin with the exception of A3 subunit of glycinin observed in 14 genotypes grown in 8 locations for three years. In addition, Helm et al. [45] also reported variation of above proteins due to environment and genotype and the magnitude of the differences between the 2 cultivars for 11S/7S ratio varied among locations within years. Piper and Boote [46] reported that maturity group, location, and environmental variation affect characteristics of soybean seeds, which suggested temperature has a significant effect on protein expression [47]. Koo et al. [48] reported 19 differentially expressed protein spots of P34, storage, and seed maturation related proteins identified by immunoblot and MALDI-TOF/TOF analysis between PI 567476 and Clark accessions. The authors suggested that β-conglycinin subunits are products of a multigene family, and the variation in the distribution of protein spots could be due to post-translational modifications such as a sequence of glycosylation, deglycosylation, and proteolysis [49]. Several other transgenic studies in soybeans and other crops have been also reported in the literature, which concluded that the occurred variation is within the natural variation of conventional cultivars [13,22,50,51].

Variation of other proteins

Other differentially expressed proteins found in this investigation were involved in primary metabolic processes and synthesis of nucleotides and other secondary compounds including: 2 spots of isoflavone reductase (spot# 22, 23); one spot of cysteine synthase (spot#24), one spot of Pv42P (Phaseolus vulgaris) (spot# 25), one spot of quinine oxidoreductase-like protein (spot#26), one spot of alcohol dehydrogenase (spot# 27) ; one spot of nucleoside diphosphate kinase (spot# 28); one spot of peptidyl-prolylcis-trans Isomerase 1 (spot# 29); one spot of seed maturation proteins (spot# 30). Except for the spots 22 and 26, no significant difference of the protein level was observed among the control, and the three events at the 1% probability level. For the spots 22 and 26, only the difference between Control and event 2 was significant, the significance of Control vs. Event 3 and Control vs. Event 4 were not significant. The authors concluded that no significant changes were detected between transgenic and non-transgenic lines and the differences occurred in 2 spots fell in the range of natural variation

Variation in isoflavones and phenolic acids content

Isoflavonoids are the principal group of phenolic compounds present in soybeans. Lee et al. [52] extracted soybean protein and investigated the effects of year, location, and genotype on soybean isoflavones. The authors concluded that environmental and genotype effects were the most important sources of isoflavones content variation in soybeans. In another study, Wei et al. [53] investigated the differences in isoflavones content in locally grown soybeans and genetically modified imported soybeans. They reported that the isoflavone content of imported genetically modified soybeans were similar to the regular Taiwanese grown soybeans. Taie et al. [54] reported that varying levels of organic fertilization produced significant influence on the phenolic content and antioxidant activity of soybeans. Similarly, influence of postharvest and storage conditions on phenolic content was reviewed by Amarowicz et al. [55]. The authors revealed that the variation in polyphenol content was often negligible as compared to the differences in content between various plant varieties.

In the present study, we compared the isoflavone content from four soybean samples and the results are presented in (Table 3A). Identification of isoflavones was achieved by comparison of retention times, ultraviolet, and mass spectral data with authentic commercial standards or results published in the literature. The six isoflavones were identified as daidzein, glycetin, glycetein, daidzin, genistin, and genistein. Four additional peaks were tentatively identified as malonyl and acetyl conjugates of the three isoflavones at retention times 20.4, 20.9, 24.6, and 26.9 mins respectively. These compounds were present in all four soybean samples. Similar compounds have been identified and reported previously [56]. Quantification in the present study was achieved by comparing peak areas under the curve for the identified isoflavones. The relative standard deviation for four replicate analyses for most samples was less than 5%. There were minor differences in individual isoflavones between control and transgenic samples. The total quantity of isoflavones extracted from four different samples varied between 10-20%. Based on the statistical data no systemetic variation in different isoflavones content was observed between different events and control sample. For example, two isoflavonoids, genistein and daidzein, showed no significant differences between event 1 and control but showed difference among events 1, 2, 3 and control at the 1% probability level. We concluded that no consistent trend in genetic variation was observed in our study with isoflavones, but the results indicated that the variation is due to environmental effects since the seeds used in this investigation were harvested in different times. Berman et al. [57] reported that the second generations of transgenic soybeans were compositionally equivalent to the conventional soybeans. However comparative large variations in isoflavone content that ranged from 551-7584 μg/g with an average of 2973 μg/g were reported in soybeans grown in Northern and Southern China from maturity groups 0-6 [58]. Similar high variations in isoflavones content in different cultivars and growing conditions in Kansas grown soybeans were also reported [59].

Isoflavone Control Event 1 Event 2 Event 3 Bonferoni corrected p-value Trait significance
  Average (HPLC area) St. Dev %RSD Average (HPLC area) St. Dev %RSD Average (HPLC area) St. Dev %RSD Average (HPLC area) St. Dev %RSD    
Glycetin (Rt 12.8 min; n=4) 131.500 6.165 4.688 144.875 5.652 3.901 124.175 4.388 3.534 122.025 3.242 2.657 0.002  
Diadzein (Rt 30.2 min; n=4) 50.525 2.644 5.232 50.800 3.477 6.844 83.200 2.810 3.377 82.925 4.172 5.032 < 0.001 2,3 (1,C)*
Glycetein (Rt 31.6 min; n=4) 15.275 2.689 17.602 18.925 1.374 7.263 23.100 1.608 6.962 20.700 1.610 7.780 0.009  
Genistein (Rt 41.0 min; n=4) 64.425 3.075 4.773 65.750 1.686 2.565 117.650 2.983 2.535 78.500 4.511 5.747 < 0.001 2,3 (1,C)*
Phenolic acid Control Event 1 Event 2 Event 3 Bonferoni corrected p-value Trait significance
Vanilic acid (Rt 27.8 min; n=4) Average (mg/g) 0.062 0.065 0.063 0.068 0.002  
%RSD 4.602 3.663 4.731 2.546
Syringic acid (Rt 31.4 min; n=4) Average (mg/g) 0.242 0.276 0.266 0.275 0.236  
%RSD 3.254 3.707 2.053 1.349
p- Coumaric acid (Rt 38.5 min; n=4) Average (mg/g) 0.070 0.082 0.108 0.087 < 0.001 2,(3,1),C*
%RSD 1.964 2.815 1.854 1.629
Ferulic acid (Rt 42.2 min; n=4) Average (mg/g) 0.083 0.084 0.077 0.091 0.012  
%RSD 2.544 2.035 2.165 2.055
Sinapic acid (Rt 43.7 min; n=4) Average (mg/g) 0.089 0.093 0.098 0.101 0.014  
%RSD 2.255 2.537 3.431 2.286

Table 3: LC-MS analysis of phenolic compounds from control and genetically modified soybean. (A) HPLC area of four isoflavones was used for comparison and four separate extractions and analysis were carried out for each sample. (B) Quantification of five phenolic acids was achieved using authentic commercial standards and four separate extractions and analysis were carried out for each sample.

It is well documented in the literature that the phenolic acid content in seeds of several crops is influenced by cultivars, growing conditions, and the methodologies used for analysis [28, 60-62]. Table 3B includes information for the identity and quantification of phenolic acids from the four soybean samples. Identification of phenolic acids was achieved by comparison of retention times, ultraviolet and mass spectral data with authentic commercial standards. The five prominent phenolic acids were identified as vanillic, syringic, p-coumaric, ferulic, and sinapic acids. In addition, there were four minor peaks detected at retention times 14.4, 47.2, 50.6, and 56.4 mins respectively. These compounds were present in minor amounts in all four soybean samples. The total quanity of phenolic acids extracted from four different samples ranged between 0.55 mg/g to 0.62 mg/g. The relative standard deviation of four replicate analyses was below 5%. Similar phenolic acids profiles have been reported in soybean, flaxseed, and olives [63]. However, our results are different from the data published by Taie et al. [54], where authors observed chlorogenic acid in addition to the other phenolic acids in soybeans grown under organic growing conditions. Based on the statistical data, only one out of five acids, p-coumaric acid, showed significant difference among transgenic events 2, 3, 1 and control at the 1% probability level with no consistent trend. The variations in the phenolic phytochemicals content observed in the present study are within the natural range of variations observed in various conventional cultivars grown and processed under different environmental conditions or analyzed by different methods.


We separated soybean proteins using 2D-PAGE and identified 30 proteins that appeared to be differentially expressed in at least one of 3 transgenic events in comparison to non-transgenic control seeds. Each of these proteins (spots) was excised from the gels and their identity determined with MALDI-TOF/TOF tandem mass spectrometry. Phenolic acids and isoflavones were extracted and analyzed by LCMS analysis. The results indicated that minor variations in proteins, isoflavones, and phenolic acids profiles exist between control and transgenic soybean seeds. The variations observed in the present study are generally within the natural range of variations observed in conventional cultivars grown and processed under different environmental conditions. However, additional long term studies with different cultivars grown over multiple years with more transgenic insertions with their isogenic lines are needed to precisely evaluate the impact of genetic transformation. Detailed metabolomic profile analysis looking at multiple classes of phytochemicals in both control and transgenic lines are needed to confirm the current results


We thank Dr. Richard Sicher for critical review of this manuscript. Funding for this research was provided by ARS project 1245-21220-232-00D. Mention of trade name, proprietary product or vendor does not constitute a guarantee or warranty of the product by the U.S. Department of Agriculture or imply its approval to the exclusion of other products or vendors that also may be suitable.


Select your language of interest to view the total content in your interested language
Post your comment

Share This Article

Relevant Topics

Recommended Conferences

  • 3rd World Congress on Human Genetics
    August 14-15, 2017 Edinburgh, Scotland
  • 9th International Conference on Bioinformatics
    October 23-24, 2017 Paris, France
  • 9th International Conference and Expo on Proteomics
    October 23-25, 2017 Paris, France

Article Usage

  • Total views: 11659
  • [From(publication date):
    October-2014 - Jun 18, 2017]
  • Breakdown by view type
  • HTML page views : 7863
  • PDF downloads :3796

Post your comment

captcha   Reload  Can't read the image? click here to refresh

Peer Reviewed Journals
Make the best use of Scientific Research and information from our 700 + peer reviewed, Open Access Journals
International Conferences 2017-18
Meet Inspiring Speakers and Experts at our 3000+ Global Annual Meetings

Contact Us

© 2008-2017 OMICS International - Open Access Publisher. Best viewed in Mozilla Firefox | Google Chrome | Above IE 7.0 version