Classifying Seer Dataset for Breast Cancer Using C4.5 Algorithm
Main Article Content
Abstract
When it comes to cancer-related deaths among women, breast cancer is now the second largest cause. The chances of long-term survival for those with breast cancer are greatly improved if the illness is caught early. Patients in the SEER breast cancer dataset have been categorized using the C4.5 classification algorithm as having "Carcinoma in situ" (an early stage of cancer) or "Malignant potential." The raw dataset has been cleaned up by pre-processing methods, and the important attributes for classification have been extracted. Data that has already been cleaned and prepared for analysis has been randomly sampled for testing. The acquired rule set was put to the test on the remaining information. This section presents and discusses the obtained results.