Enhanced Data Mining Algorithm (B-knn) - (B-knn)

Hiba Mohamed sulieman saliem

Enhanced Data Mining Algorithm (B-knn) - (B-knn)

dc.contributor.author	Hiba Mohamed sulieman saliem
dc.date.accessioned	2022-09-19T10:34:04Z
dc.date.available	2022-09-19T10:34:04Z
dc.date.issued	2022-02
dc.description	A research Studies in Fulfillment of The Requirements for The Degree of Doctor Philosophy In Computer Science	en_US
dc.description.abstract	Abstract The enhanced algorithm B-kNN , it is a variation of kNN algorithm equipped with notion of minimum and maximum points (MMP) and boundary subsets (BS) which is a subset for each class consisted the boundary of classes for preprocessing the training dataset to reduce the size and improve the efﬁciency of kNN. The B-kNN algorithm evaluates test element belonging to the MMP of each class. If there exists a class whose MMP include the object, the object is predicted to be of the class. If not, BS are fed into B-kNN .In some cases the size of (BS) maybe big and the MMP of classes are overlapping which is made it useless and erodes the efficiency of B-kNN algorithm. To address this, we present the EB-kNN algorithm built upon the notion of readjusting boundary of classes to avoid the element that fall out of range in prediction phase and eliminate the overlap of classes to improve the efﬁciency of B-kNN algorithm. We conducted experiment applied EB- kNN algorithm using the iris dataset and measure its accuracy, recall, precision, and F1 score using confusion matrices to evaluate EB-kNN. We employment our algorithm using The Java programming language and the weka standard Java tool for performing both machine learning experiments and for embedding trained models in Java applications. The results of experiment show that EB-kNN improves an average of 97% in efﬁciency over B-kNN using the entire training dataset . خلاصة البحث خوارزمية B-kNN هى خوارزميه محسنه لزيادة فعالية اداء خوارزمية knn التقليديه إحدى خوارزميات المصنف التمهيدي الخاضعة للإشراف والتي تعالج مشكلات التعرف على الأنماط ، وأيضًا أفضل الخيارات لمعالجة بعض المهام المتعلقة بالتصنيف. خوارزمية B-kNN مجهزة بمفهوم الحد الأدنى والحد الأقصى للنقاط (MMP) والمجموعات الفرعية للحدود (BS) وهي عباره عن مجموعة فرعية لكل فئة تحتوي على واحده من حدود الفئات (MMP) لمجموعة بيانات التدريب وذلك لتقليل حجمها وتحسين كفاءة kNN . حيث تقوم خوارزمية B-kNN بتقييم عنصر الاختبار الذي نريد ان نحدد انه ينتمي الى اي مدى MMP من الفئات وفي حالة وجود فئة تشتمل على قيم الكائن فانه ينتمي لهذه الفئه ، وإذا وقعت قيم الكائن خارج جميع مدى الفئات يتم اللجؤ الى المجموعه الفرعية للحدود (BS) وقياس المسافه بين جميع قيم كائنات هذه المجموعه و كائن الاختبار لتحديد اقصر مسافه لتكون هى فئة كائن الاختبار. ولكن في بعض الحالات ، قد يكون حجم (BS) كبيرًا ومن المتوقع ايضاً تتداخل MMP للفئات مما يجعلها ذات فائدة اقل ويؤدي إلى تقليل كفاءة خوارزمية B-kNN. لمعالجة هذا الأمر ، قدمنا خوارزمية EB-kNN المبنية على فكرة إعادة ضبط حدود الفئات لتجنب العنصر الذي يقع خارج النطاق في مرحلة التنبؤ وايضاً تفادي تداخل الفئات مما يودي من التحسين في كفاءة خوارزمية B-kNN. أجرينا تجربة عمليه تم تطبيق خوارزمية EB- kNN باستخدام مجموعة البيانات iris dataset حيث قمنا بقياس دقتها , واسترجاعها , ودقتها , ودرجة F1 باستخدام مصفوفات الارتباك confusion matrices وذلك لتقييم اداء الخوارزميه . وتم التنفيذ باستخدام لغة البرمجة Java وأداة Java القياسية Weka وذلك لإجراء تجارب التعلم الآلي ولتضمين النماذج المدربة في تطبيقات Java. وقد اظهرت نتائج التجربة أن EB-kNN تحسنت بمتوسط 97٪ في الكفاءة على B-kNN باستخدام مجموعة بيانات التدريب بأكملها.	en_US
dc.description.sponsorship	Prof : Saif Eldin Fattoh Osman	en_US
dc.identifier.uri	http://hdl.handle.net/123456789/17480
dc.publisher	Al-Neelain University	en_US
dc.subject	algorithm B-kNN	en_US
dc.subject	Enhanced Data	en_US
dc.title	Enhanced Data Mining Algorithm (B-knn) - (B-knn)	en_US
dc.title.alternative	تحسين خوارزمية تنقيب البيانات	en_US
dc.type	Thesis	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: البحث كامل.pdf
Size:: 2.21 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

PHD theses : Computer Science