A research of word difficulty clustering based on improved k-means++ algorithm

Dongjie Wang; Lina Liu

doi:10.1088/1742-6596/2905/1/012012

ScienceGate Book Chapters

JOURNAL ARTICLE

A research of word difficulty clustering based on improved k-means++ algorithm

Dongjie Wang Lina Liu

Year: 2024 Journal: Journal of Physics Conference Series Vol: 2905 (1)Pages: 012012-012012 Publisher: IOP Publishing

DOI: 10.1088/1742-6596/2905/1/012012

Get Full-Text PDF Get Analytical Report

Abstract

Abstract Nowadays, Wordle, a daily puzzle game offered by the New York Times, is now becoming prevalent around the world by storm, and difficulty analysis of words has become a hot topic. However, the difficulty analysis of words is highly sensitive to the feature extraction of words, which undoubtedly creates great difficulties. Based on this, this paper proposes an improved K-means++ algorithm for clustering analysis of word difficulty, which is simulated and verified in the U.S. college students’ mathematical modeling C problem data. The results show that this paper clusters the 359 words in the data into 6 classes A, B, C, D, E, and F, and determines the class of word ERIE as B based on the distance between ERRIE and the word attributes of the six clustering centers. The DB index value of the model is as high as 0.815, and the results of the analysis of variance (ANOVA) of the clustering index are very significant, which proves the high accuracy of the model.

Keywords:

Cluster analysis Word (group theory) Computer science Natural language processing Algorithm Artificial intelligence Linguistics

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.36

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Educational Technology and Assessment

Physical Sciences → Computer Science → Information Systems

Advanced Decision-Making Techniques

Physical Sciences → Computer Science → Information Systems

Online Learning and Analytics

Physical Sciences → Computer Science → Computer Science Applications

A research of word difficulty clustering based on improved k-means++ algorithm

Abstract

Metrics

Topics

Related Documents

Research on k-means Clustering Algorithm: An Improved k-means Clustering Algorithm

Research on Improved K-Means Clustering Algorithm

Research on K-Means Clustering Algorithm Based on Improved Genetic Algorithm

Research on text clustering algorithm based on improved K-means

Improved K-Means Clustering Algorithm