<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD 2.3 20070202//EN" "http://dtd.nlm.nih.gov/publishing/2.3/journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
		<journal-meta>
			<journal-id journal-id-type="nlm-ta">J Comput Sci Syst Biol</journal-id>
			<journal-id journal-id-type="publisher-id">opg</journal-id>						
			<journal-title>Journal of Computer Science &amp; Systems Biology</journal-title>			 
			<issn pub-type="epub">0974-7230</issn>
			<publisher>
				<publisher-name>OMICS Publishing Group</publisher-name>
				<publisher-loc>India, USA</publisher-loc>
			</publisher>
		</journal-meta>
		
		<article-meta>	
			<article-id pub-id-type="doi">10.4172/jcsb.1000037</article-id>		
			<article-id pub-id-type="publisher-id">000063</article-id>
			<article-categories>
				<subj-group subj-group-type="heading">
					<subject>Research Article</subject>
				</subj-group>
				<subj-group subj-group-type="Discipline">
					<subject>Biochemistry</subject>
				</subj-group>
				<subj-group subj-group-type="System Taxonomy">
					<subject>Proteomics</subject>
					<subject>Bioinformatics</subject>
					<subject>Genomics</subject>
					<subject>Transcriptomics</subject>
					<subject>Biomarkers</subject>
				</subj-group>
			</article-categories>
			<title-group>
			  <article-title>Optimizing Number of Inputs to Classify Breast Cancer Using Artificial Neural Network</article-title>
			  </title-group>
			  <contrib-group>
			    <contrib contrib-type="author">
				  <name>
				    <surname>Garg</surname>
					<given-names>Bindu</given-names>
				  </name>					  
				  <xref ref-type="aff" rid="a1">1</xref>
				  <xref ref-type="corresp" rid="cor1">&ast;</xref>	
				  </contrib>
				 <contrib contrib-type="author">
				  <name>
				    <surname>Beg</surname>
					<given-names>M.M. Sufian</given-names>
				  </name>	
				  <xref ref-type="aff" rid="a2">2</xref>
				  </contrib> 
				  <contrib contrib-type="author">
				  <name>
				    <surname>Ansari</surname>
					<given-names>A.Q.</given-names>
				  </name>	
				  <xref ref-type="aff" rid="a3">3</xref>
				  </contrib>  
				 </contrib-group> 
				<aff id="a1"><label>1</label>Department of Computer Science and Information Technology, Institute Of Technology and Management, Sec-23 A, Gurgaon-122017, India, <email>bindusingla@gmail.com</email></aff>
				<aff id="a2"><label>2</label>Department of Computer Engineering, Jamia Millia Islamia, Jamia Nagar, New Delhi-110025 India, <email>mmsbeg@hotmail.com</email></aff>
				<aff id="a3"><label>3</label>Department of Electrical Engineering, Jamia Millia Islamia, Jamia Nagar,
New Delhi-110025, India, <email>aqansari62@gmail.com</email></aff>
				<author-notes>
				<corresp id="cor1">&ast; To whom correspondence should be addressed: Bindu Garg, Department of Computer Science and Information Technology, Institute Of Technology and Management, Sec-23 A, Gurgaon-122017, India, E-mail: <email>bindusingla@gmail.com</email></corresp>
			</author-notes>
				<pub-date pub-type="collection">
				   <month>08</month>
				   <year>2009</year>
				</pub-date>
				<pub-date pub-type="epub">
				    <day>26</day>
					<month>08</month>
					<year>2009</year>
				</pub-date>
				<volume>2</volume>
				<issue>4</issue>
				<fpage>247</fpage>
				<lpage>254</lpage>  
				 <history>
				  <date date-type="received">
				     <day>07</day>
					 <month>07</month>
					 <year>2009</year>
					</date> 
				  <date date-type="accepted">
				    <day>25</day>
					<month>08</month>
					<year>2009</year>
				 </date>	
				 </history>
				 <permissions>
				 <copyright-statement><bold>Copyright</bold> &copy; 2009 Garg B, et al.</copyright-statement>
				 <copyright-year>2009</copyright-year>
				 <license license-type="open access">
				 <p>This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited</p>
				 </license>
				 </permissions>		
				 <abstract>
				<p>The Objective of this research work is to prove significant role of each attribute to decide breast cancer type
using Computer Aided Diagnosis. One of major challenges in medical domain is the extraction of intelligible knowledge from medical diagnostic data in minimum time and cost This research shows that out of these attributes stated, some attributes can be ignored to decide the type Breast Cancer as if the number of inputs are less then it reduces the time and cost in analyzing the breast cancer. In this paper, significant role of each attribute is proved by experiment in matlab.</p>
			</abstract>
			 <kwd-group>
				<kwd>ANN</kwd>
				<kwd>Back Propagation</kwd>
				<kwd>Perceptron</kwd>
				<kwd>Adaptive</kwd>
				<kwd>Commutative</kwd>									
			</kwd-group>		
   <custom-meta-wrap>
     <custom-meta>
	   <meta-name>Citation</meta-name>
	   <meta-value>Garg B, Sufian Beg MM, Ansari AQ (2009) Optimizing Number of Inputs to Classify Breast Cancer Using
Artificial Neural Network. J Comput Sci Syst Biol 2: 247-254. doi:<ext-link ext-link-type="doi" xlink:href="10.4172/jcsb.1000037">10.4172/jcsb.1000037</ext-link>
	   </meta-value>
	   </custom-meta>
	   </custom-meta-wrap>
	  </article-meta>    	 
  </front>
  <body>
     <sec>
	 	<title>Introduction</title>
			<p>Breast cancer is an uncontrolled growth of breast cells. Cancer occurs as a result of mutations, or abnormal 
changes, in the genes responsible for regulating the growth of cells and keeping them healthy. The genes are in each cell&rsquo;s nucleus, which acts as the &ldquo;control room&rdquo; of each cell. Normally, the cells in our bodies replace themselves through an orderly process of cell growth: healthy new cells take over as old ones die out. But over time, mutations can change cells by &ldquo;turning on&rdquo; certain genes and &ldquo;turning off&rdquo; others in a cell. Changed cell gains the ability to keep dividing without control, producing more cells just like it and forming a tumor (<xref ref-type="bibr" rid="r14">World Health Organization International Agency for Research on Cancer, June 2003</xref>; <xref ref-type="bibr" rid="r1">a b c World Health Organization, February 2006</xref>).</p> 
<p>A tumor can be benign (not dangerous to health) or malignant (dangerous to health). Benign tumors are not considered cancerous: their cells are close to normal in appearance, they grow slowly, and they do not invade nearby tissues or spread to other parts of the body. Malignant tumors are cancerous. Less commonly, breast cancer can begin in the stromal tissues (<xref ref-type="bibr" rid="r2">G.L Takkab, 2003</xref>) which include fatty and fibrous connective tissues of the breast. It is widely believed that the breast cancer is caused by a genetic abnormality (<ext-link ext-link-type="uri" xlink:href="http://www.merck.com/mmhe/sec23/ch266/ch266a.html">http://www.merck.com/mmhe/sec23/ch266/ch266a.html</ext-link>). However, only 5-10% of cancers are due to an abnormality inherited from mother or father. About 90% of breast cancers are due to genetic abnormalities that happen as a result of the aging process and the &ldquo;wear and tear&rdquo; of life in general. One of the outstanding problems in the present day is to diagnose, define and classify the type of breast cancer using minimum cost. Several Artificial Intelligence (AI) techniques including neural networks and fuzzy logic are successfully applied to a wide variety of decision making problem in the area of medical diagnosis.</p>
<p>This paper is organized as follows: Section 2 describes Artificial Neural Network. Section 3 describes Wisconsin Breast Cancer Database. Section 4 describes related work to this research. Section 5 describes Algorithm part. Section 6 describes Simulation and results. The final section provides some conclusion relating to the performance of the algorithm when applied to the breast cancer.</p>
	</sec>
	<sec>
		<title>Artificial Neural Network</title>
			<p>During the recent times, because of their discriminative training ability and easy implementation, the Artificial Neural Networks (ANN) find extensive use in classification of the type of tumor in breast cancer problem. It turns out that the selection of number of nodes for an ANN is an important criterion in breast cancer analysis. However, a large network means more computational expenses, resulting in more hardware and time related cost. Therefore a compact and optimum design of neural network is needed towards real time detection of tumor type in breast cancer analysis.</p>
			<p>An ANN can be defining as a highly connected array of elementary processors called neurons. A widely used model called the multi-layered perceptron (MLP) <xref ref-type="fig" rid="g1">Figure 1</xref>. The MLP type ANN consists of one input layer, one or more hidden layers and one output layer. Each layer employs several neurons and each neuron in a layer is connected to the neurons in the adjacent layer with different weights. Signals flow into the input layer, pass through the hidden layers, and arrive at the output layer. With the exception of the input layer, each neuron receives signals from the neurons of the previous layer linearly weighted by the interconnect values between neurons. The neuron then produces its output signal by passing the summed signal through a sigmoid function.</p>
 				<fig id="g1">
					<label>Figure 1</label>
					<caption>
						<title>Structure of a Three-Layered Perceptron Type ANN.</title>						
					</caption>
					<graphic xlink:href="JCSB-02-247-g001.tif"/>
				</fig>
	<sec>	
		<title>Training Algorithm</title>
			<p>Training may be supervised or unsupervised. Supervised learning refers to the training and synaptic modification of a neural network that is provided with a number of training samples or task examples that construct the training set. Each one training example consists of a unique input signal and the corresponding desired output (response). We can train a neural network to perform a particular function by adjusting the values of the connections (weights) between elements. There, the network is adjusted, based on a comparison of the output and the target, until the network output matches the target. Batch training of a network proceeds by making weight and bias changes based on an entire set (batch) of input vectors. Incremental training changes the weights and biases of a network as needed after presentation of each individual input vector. Incremental training is sometimes referred to as &ldquo;on line&rdquo; or &ldquo;adaptive&rdquo; training.</p>
			<p>The Back-Propagation Learning Algorithm is applied on multilayer feed forward networks, also referred as multilayer perceptrons (MLP). It is based on an error correction learning rule and specifically on the minimization of the mean squared error that is a measure of the difference between the actual and the desired output. Assuming that yj(n) is the actual output of the jth neuron of the output layer at the iteration n and dj(n) is the corresponding desired output, the error signal ej(n) is defined as:</p>
			<p>e j(n) = d j(n) - y j(n).</p>
			<p>As all multilayer feed forward networks, the Multilayer preceptrons are constructed of at least three layers (one input layer, one or more hidden layers and one output layer), each layer consisting of elementary processing units (artificial neurons), which incorporate a nonlinear activation function, commonly the logistic sigmoid function.</p>
			<p>There are generally four steps in the training process:</p>
			<list list-type="bullet" id="l1">
			<list-item>
				<p>Assemble the training data</p> 
			</list-item>
			<list-item>
				<p>Create the network object</p>
			</list-item>
			<list-item>
				<p>Train the network</p>
			</list-item>
			<list-item>
				<p>Simulate the network response to new inputs</p>
			</list-item>
		</list>
	</sec>
	</sec>
	<sec>
		<title>Wisconsin Diagnostic Breast Cancer Data</title>
			<p>The Wisconsin Breast Cancer dataset (<xref ref-type="bibr" rid="r7">Merz J, 1996</xref>) was obtained from a repository of a machine earning Database. The Wisconsin Prognostic Breast Cancer (WDBC) dataset contains 69 instances with 32 attributes each. There are no missing attribute in the dataset. This dataset can be use to predict if breast tumor is benign and malignant<xref ref-type="table" rid="t1">.</xref> </p>
<p>Several studies have been conducted based on this database. For example, Bennet and Mangasarian (<xref ref-type="bibr" rid="r9">Nett K.P, 1992</xref>) used linear programming techniques, obtaining a 99.6% classification rate on 487 cases (the reduced database available at the time) (<xref ref-type="bibr" rid="r11">Ravi Jain, 2003</xref>).</p>
<p>Have comparative study of fuzzy classification methods on Breast Cancer Data. There are 9 attributes which are primarily used to classify the breast cancer as either benign or malignant based on cell description gathered by microscopic examination. These are: 1. Clump Thickness 2. Uniformity of Cell Size 3. Uniformity of Cell Shape 4. Marginal Adhesion 5. Single Epithelial Cell Size 6. Bare Nuclei 7. Bland Chromatic 8. Normal Nucleoli and 9. Mitoses.</p>
<p>Breast cancer data has been divided into two categories: one dataset is used for training and other dataset is used for testing. Normalized Data is posted (<ext-link ext-link-type="uri" xlink:href="http://www.scribd.com/doc/14939420/Refrencing-Data.html">http://www.scribd.com/doc/14939420/Refrencing-Data.html</ext-link>). Multilayer neural network is trained and tested using 569 Data Samples (<ext-link ext-link-type="uri" xlink:href="http://www.scribd.com/doc/14939420/Refrencing-Data.html">http://www.scribd.com/doc/14939420/Refrencing-Data.html</ext-link>). In Training Data (<ext-link ext-link-type="uri" xlink:href="http://www.scribd.com/doc/14939420/Refrencing-Data.html">http://www.scribd.com/doc/14939420/Refrencing-Data.html</ext-link>) the first column is ID number, columns 2-10 represent characteristics of cancer cell, and the 11th field denotes the type of cancer (2 denotes malignant and 4 denotes benign).</p>
<p>The type of Breast cancer is decided by all 9 attributes (<ext-link ext-link-type="uri" xlink:href="http://www.scribd.com/doc/14939420/Refrencing-Data.html">http://www.scribd.com/doc/14939420/Refrencing-Data.html</ext-link>).</p>
	</sec>
	<sec>
		<title>Related Work</title>
			<p>The artificial neural network has become a very popular alternative in prediction and classification tasks due to
its associated memory characteristics and generalization capability (<xref ref-type="bibr" rid="r12">Shien-Ming Chon, 2004</xref>). Thirteen cytology of fine needle aspiration image (i.e. cellularity, background information, cohesiveness, significant stromal component, clump thickness, nuclear membrane, bare nuclei, normal nuclei, mitosis, nucleus stain, uniformity of cell, fragility and number of cells in cluster) are evaluated their possibility to be used as input data for artificial neural network in order to classify the breast precancerous cases into four stages, namely malignant, fibro adenoma, fibrocystic disease, and other benign diseases. An intelligent diagnostic system based on the hybrid multilayer perceptron (HMLP) network to determine the four stages of breast pre-cancerous, namely malignant, fibroadenoma, fibrocystic disease, and other benign diseases was proposed (<xref ref-type="bibr" rid="r10">Nor Ashidi MatIsa, 2007</xref>). An automated computerized classification method based on computer vision and artificial neural networks to estimate the likelihood that a mammographic lesion is malignant. The features selected to characterize the mass lesions as well as the classifier used to merge the features are crucial components of the overall classification method. We have shown that our computerized method is robust to case mix and digitization technique (<xref ref-type="bibr" rid="r6">Maryellen L. Giger, 2000</xref>). The three neural networks were trained and compared using 1300 data samples. The classification results are indicating that all the networks give good overall diagnostic performance. However, only Hybrid Multilayered Network that provides 100% accuracy, sensitivity and specificity (<xref ref-type="bibr" rid="r8">Mohd Yusoff Mashor, 1996</xref>). We have developed a computer vision system that can classify microcalcifications (<xref ref-type="bibr" rid="r15">Yuzheng C. Wu, 1995</xref>) objectively and consistently to aid radiologists in the diagnosis of breast cancer. A convolution neural network (CNN) was employed to classify benign and malignant microcalcifications in the radiographs of pathological specimen.</p>
	</sec>
	<sec>
		<title>Algorithm</title>
			<p>In this research MATLAB is used for implementation. The training data is stored in the file Train.txt which contains training patterns for patients. Firstly, feed forward neural network is trained with 2 to 10 columns as inputs and 11th column as output. Then the network is simulated with input and output. After that testing of network is done by taking Data from test.txt file (contains the testing data). The algorithm is as follows:</p>
			<p>dex= load('train.txt');</p>
			<p>p=x(:,2:10);</p>
			<p>p=p';</p>
			<p>t=x(:,11);</p>
			<p>t=t';</p>
			<p>network=newff(minmax(p),[9,6,1],</p>
			<p>{'logsig','logsig','logsig'},'traingdm');</p>
			<p>net.trainParam.lr=0.5; net.trainParam.mc=0.9;</p>
			<p>network=init(network);</p>
			<p>y=sim(network,p);</p>
			<p>figure,plot(p,t,p,y,'o'),title('Before Training');</p>
			<p>network.trainParam.epochs = 100;</p>
			<p>network=train(network,p,t);</p>
			<p>y=sim(network,p);</p>
			<p>figure,plot(p,t,p,y,'o'),title('after Training');</p>
			<p>display('Final weight vector and bias values : \n');</p>
			<p>Weights=network.iw{1}; Bias=network.b{1};</p>
			<p>Weights</p>
			<p>Bias</p>
			<p>y=load('test.txt'); p1=y(:,2:10);</p>
			<p>p1=p1'; t1=y(:,11); t1=t1';y1=sim(network,p1);</p>
			<p>figure,plot(p1,t1,p1,y1,'o'),title('after testing');</p>
			<sec>
				<title>Description of the Algorithm</title>
					<p>Training Data is stored in file Train.Txt. Train.Txt file contain 11 Column. First column ID no is omitted. It is not containing any valuable information. So input for the neural network is 2 to 10 column. Actual output is in 11<sup>th</sup> column. Feed-forward architecture of neural network is used. There are really two decisions that must be made regarding the hidden layers: how many hidden layers to actually have in the neural network and how many neurons will be in each of these layers.</p>
					<p>There are many rule-of-thumb methods for determining the correct number of neurons to use in the hidden layers, such as the following:</p>
		<list list-type="bullet" id="l2">
			<list-item>
				<p>The number of hidden neurons should be between the size of the input layer and the size of the output layer.</p> 
			</list-item>
			<list-item>
				<p>The number of hidden neurons should be 2/3 the size of the input layer, plus the size of the output layer.</p> 
			</list-item>
			<list-item>
				<p>The number of hidden neurons should be less than twice the size of the input layer.</p> 
			</list-item>
		</list>
		<p>So number of hidden neurons are 6</p>
		<p>Usually it is recommended to start with only one hidden layer, and if the results are not good, the number of hidden layers will grow up (<xref ref-type="bibr" rid="r5">Hornik K, 1990</xref>). The number of neurons of each hidden layers are being calculated automatically (<xref ref-type="bibr" rid="r13">Ward Systems Group Inc., 1999</xref>).</p>
		<p>Number of iterations has been taken 100. In standard backprop, too low a learning rate makes the network learn very slowly. Too high earning rate makes the weights and objective function diverge, so there is no learning at all. The larger the learning rate ethe larger the weight changes on each epoch, and the quicker the network learns. However, the size of the learning rate can also influence whether the network achieves a stable solution. If the learning rate gets too large, then the weight changes no longer approximate a gradient descent procedure. (True gradient descent requires infinitesimal steps). By default learning rate is .25 and momentum constant is 0.9 when default learning rate was taken then algorithm was not showing correct
Results. Value of Learning Rate is taken by heat and trial in matlab.</p>
<p>The objective of this algorithm is to optimize the number of inputs. Now each attribute is removed one by one and it is checked that which attribute is important to decide tumor type.</p>
			</sec>
	</sec>
	<sec>
		<title>Simulation and Results</title>
			<p>If the algorithm is executed for all attributes (inputs) in MATLAB, Training is given for all 9 parameters; it gives accurate results as shown in <xref ref-type="fig" rid="g2">Figure 2</xref> and <xref ref-type="fig" rid="g3">Figure 3</xref>. It means that type of breast cancer is decided accurately. Bottom X axis represents Actual Output. Top X axis Represents Target Output. Left and Right Y Axis Represents Input.</p>
			<p>If a particular attribute is removed then dotes (colored) coincides on the point 1 at the top horizontal than output is accurate and this means that particular attribute can be discarded. Sample Data in <xref ref-type="table" rid="t2">Table 2</xref> shows the mathematical deviation in accuracy of output after removing an attribute. For complete data refer to reference (<ext-link ext-link-type="uri" xlink:href="http://www.scribd.com/doc/14939420/Refrencing-Data.html">http://www.scribd.com/doc/14939420/Refrencing-Data.html</ext-link>).</p>
				<fig id="g2">
					<label>Figure 2</label>
					<caption>
						<title>All Parameters (After Training).</title>						
					</caption>
					<graphic xlink:href="JCSB-02-247-g002.tif"/>
				</fig>
				<fig id="g3">
					<label>Figure 3</label>
					<caption>
						<title>All parameters (After Testing).</title>						
					</caption>
					<graphic xlink:href="JCSB-02-247-g003.tif"/>
				</fig>
			<sec>
				<title>Removal of Clump Thickness (Without Clump thickness)</title>
					<p>If the first attributes clump thickness is removed, there are only 8 inputs to the network. Therefore feed forward neural network is trained with 3 to 10 columns as inputs and 11th column as output. To check whether this attribute plays important role in deciding type of tumor, network is first trained and tested with the sample training data. Subsequently network is tested with the testing data as shown in <xref ref-type="fig" rid="g4">Figure 4</xref> and <xref ref-type="fig" rid="g5">Figure 5</xref>.</p>
					<fig id="g4">
					<label>Figure 4</label>
					<caption>
						<title>After Training (Clump Thickness).</title>						
					</caption>
					<graphic xlink:href="JCSB-02-247-g004.tif"/>
				</fig>
				<fig id="g5">
					<label>Figure 5</label>
					<caption>
						<title>After Testing (Clump Thickness).</title>						
					</caption>
					<graphic xlink:href="JCSB-02-247-g005.tif"/>
				</fig>
					<p>It is proved that even if clump thickness is not considered then also accurate results are obtained. As from Figure 4 dotes are coinciding with the top horizontal. It means without this attribute, type of breast cancer can be decided. Refer Result Data (<ext-link ext-link-type="uri" xlink:href="http://www.scribd.com/doc/14939420/Refrencing-Data.html">http://www.scribd.com/doc/14939420/Refrencing-Data.html</ext-link>) for deviation in accuracy of output after removing Clump thickness attribute.</p>
			</sec>
			<sec>
				<title>Removal of Uniformity of Cell Size (without uniformity of cell size)</title>
					<p>If the second attribute, Uniformity of cell size, is removed, there are only 8 inputs to the network. Here feed forward neural network is trained with 2 and 4 to 10 columns as inputs and 11th column as output. To check whether this attribute plays important role in deciding type of tumor, network is first trained and tested with the sample training data as shown in the figures below.</p>
					<p><xref ref-type="fig" rid="g6">Figure 6</xref> and <xref ref-type="fig" rid="g7">figure 7</xref> shows that type of breast cancer cannot be decided without this attribute as the top horizontal is not coinciding. So this attribute cannot be discarded. Refer Result Data (<ext-link ext-link-type="uri" xlink:href="http://www.scribd.com/doc/14939420/Refrencing-Data.html">http://www.scribd.com/doc/14939420/Refrencing-Data.html</ext-link>) for deviation in accuracy of output after removing Uniformity of cell size attribute.</p>
					<fig id="g6">
					<label>Figure 6</label>
					<caption>
						<title>After Training (Uniformity of Cell Size).</title>						
					</caption>
					<graphic xlink:href="JCSB-02-247-g006.tif"/>
				</fig>
				<fig id="g7">
					<label>Figure 7</label>
					<caption>
						<title>After Testing (Uniformity of cell Size).</title>						
					</caption>
					<graphic xlink:href="JCSB-02-247-g007.tif"/>
				</fig>
					<p>Now applying the algorithm on the remaining attributes it is found that the top horizontal coincides on colored dotes on point 1 in case of Uniformity of Cell Shape, Marginal adhesion, Bare Nuclei, Bland Chromatic and Mitoses. Therefore theses attributes can be discarded one at a time only. And it does not coincide on Single Epithelial Cell Size, Normal Nucleoli so these can not be discarded.</p>
</sec>
	</sec>
	<sec>
		<title>Conclusion</title>
			<p>In this paper, a case study of attributes is done for real time detection of tumor type in breast cancer analysis.
Our study shows that uniformity of cell size, single epithelial cell size and Normal Nucleoli attributes are very important to decide tumor type. But clump thickness, uniformity of cell shape, bare nucleoli, Marginal Adhesion, bland chromatic and mitoses can be ignored. Effectively reducing the time and cost in analyzing tumor type. It can be seen that in previous researches, only the classification of cancer is being studied. In this research, the significant role of each attribute is discussed and studied using matlab. This can help in Hospitals.</p>
<p>The advantage of the proposed algorithm is that it is very simple to implement, neither the trial-and-error process nor prior knowledge about the parameters is required.</p>
	</sec>
	</body>	  
	<back>
	<ref-list>
	   <title>References</title>
	         <ref id="r1">
			 <citation citation-type="journal">			       
					 <collab>a b c World Health Organization</collab>
 					 <source>Fact sheet No. 297: Cancer</source>
  					 <year>2006</year>
  					 <month>February</month>
					 <ext-link ext-link-type="uri" xlink:href="http://www.who.int/mediacentre/factsheets/fs297/en/index.html">http://www.who.int/mediacentre/factsheets/fs297/en/index.html</ext-link>
			</citation>
			</ref>
			<ref id="r2">
			 <citation citation-type="journal">
			         <person-group>
					 <name>
					      <surname>Takkab</surname>
						  <given-names>GL</given-names>
					</name>
					 <name>
					      <surname>Kamboj</surname>
						  <given-names>VP</given-names>
					</name>	
					</person-group>
					<year>2003</year>
					<article-title>Effect of Altered Hormonal States on Histo-Chemical Distribution of Polysaccharides</article-title>
					<source>Lucknow: Central Drug Research Institute</source>
			</citation>
			</ref>
			<ref id="r3">
			 <citation citation-type="web">
			       <ext-link ext-link-type="uri" xlink:href="http://www.merck.com/mmhe/sec23/ch266/ch266a.html">http://www.merck.com/mmhe/sec23/ch266/ch266a.html</ext-link>
			</citation>
			</ref>
			<ref id="r4">
			 <citation citation-type="web">
			        <ext-link ext-link-type="uri" xlink:href="http://www.scribd.com/doc/14939420/Refrencing-Data.html">http://www.scribd.com/doc/14939420/Refrencing-Data.html</ext-link>	
			</citation>
			</ref>
			<ref id="r5">
			 <citation citation-type="journal">
			         <person-group>
					 <name>
					      <surname>Hornik</surname>
						  <given-names>K</given-names>
					</name>
					 <name>
					      <surname>Stinchombe</surname>
						  <given-names>M</given-names>
					</name>		
					 <name>
					      <surname>White</surname>
						  <given-names>H</given-names>
					</name>	
					</person-group>
					<year>1990</year>
					<article-title>Neural Networks</article-title>
					<volume>2</volume>			
					<fpage>359</fpage>
			</citation>
			</ref>
			<ref id="r6">
			 <citation citation-type="journal">
			         <person-group>
					 <name>
					      <surname>Giger</surname>
						  <given-names>ML</given-names>
					</name>
					<name>
					      <surname>Huo</surname>
						  <given-names>Z</given-names>
					</name>	
					</person-group>
					<year>2000</year>
					<article-title>Artificial Neural Networks in Breast Cancer Diagnosis, Merging of Computer- Extracted Features From Breast Images</article-title>
					<source>IEEE Xplore</source>
			</citation>
			</ref>
			<ref id="r7">
			 <citation citation-type="journal">
			         <person-group>
					 <name>
					      <surname>Merz</surname>
						  <given-names>J</given-names>
					</name>
					 <name>
					      <surname>Murphy</surname>
						  <given-names>PM</given-names>
					</name>		
					</person-group>
					<year>1996</year>
					<article-title>UCI repository of machine learning databases</article-title>
					<ext-link ext-link-type="uri" xlink:href="http://www.ics.uci.edu/-learn/MLRepository.html">http://www.ics.uci.edu/-learn/MLRepository.html</ext-link>	
			</citation>
			</ref>
			<ref id="r8">
			 <citation citation-type="journal">
			         <person-group>
					 <name>
					      <surname>Mashor</surname>
						  <given-names>MY</given-names>
					</name>
					<name>
					      <surname>Esugasini</surname>
						  <given-names>S</given-names>
					</name>
					 <name>
					      <surname>Mat-Isa</surname>
						  <given-names>NA</given-names>
					</name>
					<name>
					      <surname>Othman</surname>
						  <given-names>NH</given-names>
					</name>		
					</person-group>
					<year>1996</year>
					<article-title>Classification of Breast Lesions Using Artificial Neural Network</article-title>
					<source>Springer Berlin Heidelberg</source>	
					<volume>15</volume>	
			</citation>
			</ref>
			<ref id="r9">
			 <citation citation-type="journal">
			         <person-group>
					 <name>
					      <surname>Nett</surname>
						  <given-names>KP</given-names>
					</name>
					<name>
					      <surname>Mangasarian</surname>
						  <given-names>OL</given-names>
					</name>
					</person-group>
					<year>1992</year>
					<article-title>Neural Network Training via Linear Programming, Advances in Optimization and Parallel Computing</article-title>
					<source>Elsevier Science</source>
					<fpage>pp56</fpage>
					<lpage>pp57</lpage>
			</citation>
			</ref>
			<ref id="r10">
			 <citation citation-type="journal">
			         <person-group>
					 <name>
					      <surname>Mat-Isa</surname>
						  <given-names>NA</given-names>
					</name>
					 <name>
					      <surname>Subramaniam</surname>
						  <given-names>E</given-names>
					</name>
					<name>
					      <surname>Mashor</surname>
						  <given-names>MY</given-names>
					</name>		
					<name>
					      <surname>Othman</surname>
						  <given-names>NH</given-names>
					</name>				
					</person-group>
					<year>2007</year>
					<article-title>Fine Needle Aspiration Cytology Evaluation for Classifying Breast Cancer Using Artificial Neural</article-title>
					<source>American Journal of Applied Sciences</source>
					<volume>4</volume>					
					<fpage>999</fpage>
					<lpage>1008</lpage>
			</citation>
			</ref>
			<ref id="r11">
			 <citation citation-type="confproc">
			         <person-group>
					 <name>
					      <surname>Jain</surname>
						  <given-names>R</given-names>
					</name>
					<name>
					      <surname>Abraham</surname>
						  <given-names>A</given-names>		
					</name>		
					</person-group>
					<year>2003</year>
					<article-title>A Comparative Study of Fuzzy Classification Methods on Breast Cancer Data</article-title>
  					<conf-name>International Conference on Artificial Neural Network IWANN&rsquo;03</conf-name>
  					<conf-loc>Spain</conf-loc>
			</citation>
			</ref>
			<ref id="r12">
			 <citation citation-type="journal">
			         <person-group>
					 <name>
					      <surname>Shien</surname>
						  <given-names>MC</given-names>
					</name>	
					<name>
					      <surname>Lee</surname>
						  <given-names>TS</given-names>
					</name>	
					<name>
					      <surname>Shao</surname>
						  <given-names>YF</given-names>
					</name>	
					<name>
					      <surname>Chen</surname>
						  <given-names>IF</given-names>
					</name>										
					</person-group>
					<year>2004</year>
					<article-title>Mining the breast cancer pattern using artificial neural networks and multivariate adaptive regression spines</article-title>										
					<source>Expert System With Application</source> 
					<volume>27</volume>
					<fpage>pp133</fpage>
					<lpage>pp142</lpage>					
			</citation>
			</ref>
			<ref id="r13">
			 <citation citation-type="book">
			        <collab>Ward Systems Group Inc</collab>
  					<source>NeuroShel 2</source>
 				 	<year>1999</year>
  					<publisher-name>Ward Systems Group</publisher-name>
			</citation>
			</ref>
			<ref id="r14">
			  <citation citation-type="journal">			       
					 <collab>World Health Organization International Agency for Research on Cancer</collab>
 					 <source>World Cancer Report</source>
  					 <year>2003</year>
  					 <month>February</month>
					 <ext-link ext-link-type="uri" xlink:href="http://www.iarc.fr/IARCPress/pdfs/wcr/index.php">http://www.iarc.fr/IARCPress/pdfs/wcr/index.php</ext-link>
			</citation>
			</ref>
			<ref id="r15">
			 <citation citation-type="journal">
			         <person-group>
					 <name>
					      <surname>Wu</surname>
						  <given-names>YC</given-names>
					</name>						
					</person-group>
					<year>1995</year>
					<article-title>Classification of Microcalcification of the Diagnosis of Breast Cancer using Artificial Neural
Networks</article-title>										
					<source>Annual Report 1 Sep 94</source>
					<fpage>26</fpage>			
			</citation>
			</ref>
			</ref-list>
			</back>
			<floats-wrap>
			<table-wrap position="float" id="t1">
			<label>Table 1.</label>
  			<caption>
  				<title>Statistical Details of Data.</title>
  			</caption>
   <table frame="hsides" rules="groups">
      <thead>
		 <tr>
		 	<th align="left">Class</th>
			<th align="left">Frequency</th>
			<th align="left">Percent</th>
			<th align="left">Valid Percent</th>
			<th align="left">Cumulative Percent</th>			
		</tr>		 
      </thead>
      <tbody>
	     <tr>
			<td>1</td>	
			<td>357</td>
			<td>62.7</td>	
			<td>62.7</td>
			<td>62.7</td>		
         </tr>        
         <tr>
            <td>2</td>	
			<td>212</td>
			<td>37.3</td>
			<td>37.3</td>
			<td>100</td>													
         </tr>
		 <tr>
		 	<td>Total</td>
			<td>569</td>
			<td>100.0</td>
			<td>100.0</td>
			<td>&nbsp;</td>
		</tr>		
	</tbody>
	</table>
	</table-wrap>
	<table-wrap position="float" id="t2">
	<label>Table 2.</label>
  			<caption>
  				<title>Sample Data</title>
  			</caption>
   <table frame="hsides" rules="groups">
      <thead>
		 <tr style="background-color:#C0C0C0;">
		 	<th align="left">Desired Output</th>
			<th align="left">Actual Output</th>
			<th align="left">Error</th>
			<th align="left">Clump Thickness</th>
			<th align="left">Error</th>	
			<th align="left">Uniformity of Cell Size</th>
			<th align="left">Error</th>
			<th align="left">Uniformity of Cell Shape</th>
			<th align="left">Error</th>
			<th align="left">Marginal Adhesion</th>
			<th align="left">Error</th>
			<th align="left">Single Epithelial Cell Size</th>
			<th align="left">Error</th>
			<th align="left">Bare Nuceli</th>
			<th align="left">Error</th>		
		</tr>		 
      </thead>
      <tbody>
	     <tr>
			<td>0</td>	
			<td>0.0001</td>
			<td>0.0001</td>	
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0.0024</td>
			<td>0.0024</td>
			<td>0.0072</td>
			<td>0.0072</td>
			<td>0</td>
			<td>0</td>		
         </tr>        
         <tr>
            <td>0</td>	
			<td>0.0001</td>
			<td>0.0001</td>
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0.0024</td>		
			<td>0.0024</td>
			<td>0.0072</td>
			<td>0.0072</td>
			<td>0</td>
			<td>0</td>												
         </tr>
		 <tr>
		 	<td>0</td>
			<td>0.0001</td>
			<td>0.0001</td>
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0.0024</td>
			<td>0.0024</td>
			<td>0.0072</td>
			<td>0.0072</td>
			<td>0</td>
			<td>0</td>
		</tr>		
		<tr>
			<td>0</td>
			<td>0.0001</td>
			<td>0.0001</td>
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0.0024</td>
			<td>0.0024</td>
			<td>0.0072</td>
			<td>0.0072</td>
			<td>0</td>
			<td>0</td>
		</tr>
		<tr>
			<td>0</td>
			<td>0.0001</td>
			<td>0.0001</td>
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0.0024</td>
			<td>0.0024</td>
			<td>0.0072</td>
			<td>0.0072</td>
			<td>0</td>
			<td>0</td>
		</tr>
		<tr>
			<td>0</td>
			<td>0.0001</td>
			<td>0.0001</td>
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0</td>
			<td>0.0024</td>
			<td>0.0024</td>
			<td>0.0072</td>
			<td>0.0072</td>
			<td>0</td>
			<td>0</td>
		</tr>
	</tbody>
	</table>
	</table-wrap>	
	</floats-wrap>
  </article>
