Ensemble Partition Sampling (EPS) for Improved Multi-Class Classification

Article Subjects > Engineering Europe University of Atlantic > Research > Scientific Production
Fundación Universitaria Internacional de Colombia > Research > Scientific Production
Ibero-american International University > Research > Scientific Production
Ibero-american International University > Research > Scientific Production
Universidad Internacional do Cuanza > Research > Scientific Production
Abierto Inglés Classification is a commonly used technique in data mining and is applied in various fields such as sentiment analysis, fraud detection, and fault diagnosis. Multiclass classification, which involves more than two classes, is more complex than binary classification. There are mainly two ways to approach multiclass classification, one is to expand the binary classifier into a multiclass classifier through various strategies and the other is to divide the multiclass classification problem into multiple binary problems (binarization). Two popular approaches for binarization are One vs One (OvO) and One vs All (OvA). It is simpler to aggregate the outputs of all binary classifiers as the number of classifiers decreases. However, it causes an imbalance of positive and negative sample numbers, which affects the classification effect of each binary classifier. In this article, we contribute to the field of ensemble learning and multi-class classification by proposing a new method called Ensemble Partition Sampling (EPS). This article presents a new approach to multiclass classification using an "Ensemble Partition Sampling" method within the "one-vs-all" (OvA) framework. The primary goal of this method is to tackle the problem of data imbalance by incorporating ensemble learning and preprocessing techniques into each binary dataset. The study found that Ensemble Partition Sampling (EPS) is the most effective method for imbalanced and multiclass imbalanced classification, outperforming other methods including OvA, SMOTE, k-means-SMOTE, Bagging-RB, DES-MI, OvO-EASY, and OvO-SMB. The study used CART, Random Forest, and SVM as classifiers, and the results consistently showed that EPS outperformed all other algorithms. The findings suggest that EPS is a highly effective method for improving classification performance in imbalanced and multiclass imbalanced datasets. metadata Jabir, Brahim and Díez, Isabel De la Torre and Bautista Thompson, Ernesto and Ramírez-Vargas, Debora L. and Kuc Castilla, Ángel Gabriel mail UNSPECIFIED (2023) Ensemble Partition Sampling (EPS) for Improved Multi-Class Classification. IEEE Access. p. 1. ISSN 2169-3536

Full text not available from this repository.

Abstract

Classification is a commonly used technique in data mining and is applied in various fields such as sentiment analysis, fraud detection, and fault diagnosis. Multiclass classification, which involves more than two classes, is more complex than binary classification. There are mainly two ways to approach multiclass classification, one is to expand the binary classifier into a multiclass classifier through various strategies and the other is to divide the multiclass classification problem into multiple binary problems (binarization). Two popular approaches for binarization are One vs One (OvO) and One vs All (OvA). It is simpler to aggregate the outputs of all binary classifiers as the number of classifiers decreases. However, it causes an imbalance of positive and negative sample numbers, which affects the classification effect of each binary classifier. In this article, we contribute to the field of ensemble learning and multi-class classification by proposing a new method called Ensemble Partition Sampling (EPS). This article presents a new approach to multiclass classification using an "Ensemble Partition Sampling" method within the "one-vs-all" (OvA) framework. The primary goal of this method is to tackle the problem of data imbalance by incorporating ensemble learning and preprocessing techniques into each binary dataset. The study found that Ensemble Partition Sampling (EPS) is the most effective method for imbalanced and multiclass imbalanced classification, outperforming other methods including OvA, SMOTE, k-means-SMOTE, Bagging-RB, DES-MI, OvO-EASY, and OvO-SMB. The study used CART, Random Forest, and SVM as classifiers, and the results consistently showed that EPS outperformed all other algorithms. The findings suggest that EPS is a highly effective method for improving classification performance in imbalanced and multiclass imbalanced datasets.

Item Type: Article
Uncontrolled Keywords: Ensemble Partition Sampling (EPS); One vs One (OvO); One vs All (OvA); Multi-Class Classification; Imbalanced learning; multiclass imbalanced classification
Subjects: Subjects > Engineering
Divisions: Europe University of Atlantic > Research > Scientific Production
Fundación Universitaria Internacional de Colombia > Research > Scientific Production
Ibero-american International University > Research > Scientific Production
Ibero-american International University > Research > Scientific Production
Universidad Internacional do Cuanza > Research > Scientific Production
Date Deposited: 09 May 2023 23:30
Last Modified: 09 May 2023 23:30
URI: https://repositorio.unib.org/id/eprint/7028

Actions (login required)

View Item View Item

<a href="/10290/1/Influence%20of%20E-learning%20training%20on%20the%20acquisition%20of%20competences%20in%20basketball%20coaches%20in%20Cantabria.pdf" class="ep_document_link"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>

en

open

Influence of E-learning training on the acquisition of competences in basketball coaches in Cantabria

The main aim of this study was to analyse the influence of e-learning training on the acquisition of competences in basketball coaches in Cantabria. The current landscape of basketball coach training shows an increasing demand for innovative training models and emerging pedagogies, including e-learning-based methodologies. The study sample consisted of fifty students from these courses, all above 16 years of age (36 males, 14 females). Among them, 16% resided outside the autonomous community of Cantabria, 10% resided more than 50 km from the city of Santander, 36% between 10 and 50 km, 14% less than 10 km, and 24% resided within Santander city. Data were collected through a Google Forms survey distributed by the Cantabrian Basketball Federation to training course students. Participation was voluntary and anonymous. The survey, consisting of 56 questions, was validated by two sports and health doctors and two senior basketball coaches. The collected data were processed and analysed using Microsoft® Excel version 16.74, and the results were expressed in percentages. The analysis revealed that 24.60% of the students trained through the e-learning methodology considered themselves fully qualified as basketball coaches, contrasting with 10.98% of those trained via traditional face-to-face methodology. The results of the study provide insights into important characteristics that can be adjusted and improved within the investigated educational process. Moreover, the study concludes that e-learning training effectively qualifies basketball coaches in Cantabria.

Producción Científica

Josep Alemany Iturriaga mail josep.alemany@uneatlantico.es, Álvaro Velarde-Sotres mail alvaro.velarde@uneatlantico.es, Javier Jorge mail , Kamil Giglio mail ,

Alemany Iturriaga

<a class="ep_document_link" href="/11941/1/healthcare-12-00942.pdf"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>

en

open

Risk Factors for Eating Disorders in University Students: The RUNEAT Study

The purpose of the study is to assess the risk of developing general eating disorders (ED), anorexia nervosa (AN), and bulimia nervosa (BN), as well as to examine the effects of gender, academic year, place of residence, faculty, and diet quality on that risk. Over two academic years, 129 first- and fourth-year Uneatlántico students were included in an observational descriptive study. The self-administered tests SCOFF, EAT-26, and BITE were used to determine the participants’ risk of developing ED. The degree of adherence to the Mediterranean diet (MD) was used to evaluate the quality of the diet. Data were collected at the beginning (T1) and at the end (T2) of the academic year. The main results were that at T1, 34.9% of participants were at risk of developing general ED, AN 3.9%, and BN 16.3%. At T2, these percentages were 37.2%, 14.7%, and 8.5%, respectively. At T2, the frequency of general ED in the female group was 2.5 times higher (OR: 2.55, 95% CI: 1.22–5.32, p = 0.012). The low-moderate adherence to the MD students’ group was 0.92 times less frequent than general ED at T2 (OR: 0.921, 95%CI: 0.385–2.20, p < 0.001). The most significant risk factor for developing ED is being a female in the first year of university. Moreover, it appears that the likelihood of developing ED generally increases during the academic year.

Producción Científica

Imanol Eguren García mail imanol.eguren@uneatlantico.es, Sandra Sumalla Cano mail sandra.sumalla@uneatlantico.es, Sandra Conde González mail , Anna Vila-Martí mail , Mercedes Briones Urbano mail mercedes.briones@uneatlantico.es, Raquel Martínez Díaz mail raquel.martinez@uneatlantico.es, Iñaki Elío Pascual mail inaki.elio@uneatlantico.es,

Eguren García

<a class="ep_document_link" href="/11592/1/fchem-12-1361980.pdf"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>

en

open

Efficacy and classification of Sesamum indicum linn seeds with Rosa damascena mill oil in uncomplicated pelvic inflammatory disease using machine learning

Background and objectives: As microbes are developing resistance to antibiotics, natural, botanical drugs or traditional herbal medicine are presently being studied with an eye of great curiosity and hope. Hence, complementary and alternative treatments for uncomplicated pelvic inflammatory disease (uPID) are explored for their efficacy. Therefore, this study determined the therapeutic efficacy and safety of Sesamum indicum Linn seeds with Rosa damascena Mill Oil in uPID with standard control. Additionally, we analyzed the data with machine learning. Materials and methods: We included 60 participants in a double-blind, double-dummy, randomized standard-controlled study. Participants in the Sesame and Rose oil group (SR group) (n = 30) received 14 days course of black sesame powder (5 gm) mixed with rose oil (10 mL) per vaginum at bedtime once daily plus placebo capsules orally. The standard group (SC), received doxycycline 100 mg twice and metronidazole 400 mg thrice orally plus placebo per vaginum for the same duration. The primary outcome was a clinical cure at post-intervention for visual analogue scale (VAS) for lower abdominal pain (LAP), and McCormack pain scale (McPS) for abdominal-pelvic tenderness. The secondary outcome included white blood cells (WBC) cells in the vaginal wet mount test, safety profile, and health-related quality of life assessed by SF-12. In addition, we used AdaBoost (AB), Naïve Bayes (NB), and Decision Tree (DT) classifiers in this study to analyze the experimental data. Results: The clinical cure for LAP and McPS in the SR vs SC group was 82.85% vs 81.48% and 83.85% vs 81.60% on Day 15 respectively. On Day 15, pus cells less than 10 in the SR vs SC group were 86.6% vs 76.6% respectively. No adverse effects were reported in both groups. The improvement in total SF-12 score on Day 30 for the SR vs SC group was 82.79% vs 80.04% respectively. In addition, our Naive Bayes classifier based on the leave-one-out model achieved the maximum accuracy (68.30%) for the classification of both groups of uPID. Conclusion: We concluded that the SR group is cost-effective, safer, and efficacious for curing uPID. Proposed alternative treatment (test drug) could be a substitute of standard drug used for Female genital tract infections.

Producción Científica

X. Sumbul mail , Arshiya Sultana mail , Md Belal Bin Heyat mail , Khaleequr Rahman mail , Faijan Akhtar mail , Saba Parveen mail , Mercedes Briones Urbano mail mercedes.briones@uneatlantico.es, Vivian Lipari mail vivian.lipari@uneatlantico.es, Isabel De la Torre Díez mail , Azmat Ali Khan mail , Abdul Malik mail ,

Sumbul

<a class="ep_document_link" href="/11265/1/Food%20Frontiers%20-%202024%20-%20Cassotta%20-%20Human%E2%80%90based%20new%20approach%20methodologies%20to%20accelerate%20advances%20in%20nutrition%20research.pdf"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>

en

open

Human‐based new approach methodologies to accelerate advances in nutrition research

Much of nutrition research has been conventionally based on the use of simplistic in vitro systems or animal models, which have been extensively employed in an effort to better understand the relationships between diet and complex diseases as well as to evaluate food safety. Although these models have undeniably contributed to increase our mechanistic understanding of basic biological processes, they do not adequately model complex human physiopathological phenomena, creating concerns about the translatability to humans. During the last decade, extraordinary advancement in stem cell culturing, three-dimensional cell cultures, sequencing technologies, and computer science has occurred, which has originated a wealth of novel human-based and more physiologically relevant tools. These tools, also known as “new approach methodologies,” which comprise patient-derived organoids, organs-on-chip, multi-omics approach, along with computational models and analysis, represent innovative and exciting tools to forward nutrition research from a human-biology-oriented perspective. After considering some shortcomings of conventional in vitro and vivo approaches, here we describe the main novel available and emerging tools that are appropriate for designing a more human-relevant nutrition research. Our aim is to encourage discussion on the opportunity to explore innovative paths in nutrition research and to promote a paradigm-change toward a more human biology-focused approach to better understand human nutritional pathophysiology, to evaluate novel food products, and to develop more effective targeted preventive or therapeutic strategies while helping in reducing the number and replacing animals employed in nutrition research.

Producción Científica

Manuela Cassotta mail manucassotta@gmail.com, Danila Cianciosi mail , Maria Elexpuru Zabaleta mail maria.elexpuru@uneatlantico.es, Iñaki Elío Pascual mail inaki.elio@uneatlantico.es, Sandra Sumalla Cano mail sandra.sumalla@uneatlantico.es, Francesca Giampieri mail francesca.giampieri@uneatlantico.es, Maurizio Battino mail maurizio.battino@uneatlantico.es,

Cassotta

<a class="ep_document_link" href="/11322/1/journal.pone.0298582.pdf"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>

en

open

Design and development of patient health tracking, monitoring and big data storage using Internet of Things and real time cloud computing

With the outbreak of the COVID-19 pandemic, social isolation and quarantine have become commonplace across the world. IoT health monitoring solutions eliminate the need for regular doctor visits and interactions among patients and medical personnel. Many patients in wards or intensive care units require continuous monitoring of their health. Continuous patient monitoring is a hectic practice in hospitals with limited staff; in a pandemic situation like COVID-19, it becomes much more difficult practice when hospitals are working at full capacity and there is still a risk of medical workers being infected. In this study, we propose an Internet of Things (IoT)-based patient health monitoring system that collects real-time data on important health indicators such as pulse rate, blood oxygen saturation, and body temperature but can be expanded to include more parameters. Our system is comprised of a hardware component that collects and transmits data from sensors to a cloud-based storage system, where it can be accessed and analyzed by healthcare specialists. The ESP-32 microcontroller interfaces with the multiple sensors and wirelessly transmits the collected data to the cloud storage system. A pulse oximeter is utilized in our system to measure blood oxygen saturation and body temperature, as well as a heart rate monitor to measure pulse rate. A web-based interface is also implemented, allowing healthcare practitioners to access and visualize the collected data in real-time, making remote patient monitoring easier. Overall, our IoT-based patient health monitoring system represents a significant advancement in remote patient monitoring, allowing healthcare practitioners to access real-time data on important health metrics and detect potential health issues before they escalate.

Producción Científica

Md. Milon Islam mail , Imran Shafi mail , Sadia Din mail , Siddique Farooq mail , Isabel de la Torre Díez mail , Jose Breñosa mail josemanuel.brenosa@uneatlantico.es, Julio César Martínez Espinosa mail ulio.martinez@unini.edu.mx, Imran Ashraf mail ,

Islam