Adapting Soundex for Arabic Names with Preprocessing for Better Accuracy
DOI:
https://doi.org/10.59743/jbs.v37i2.295Keywords:
Soundex algorithm, Natural Language Processing, Arabic names, Phonetic similarity, Name retrievalAbstract
The Soundex algorithm plays an important role in retrieving names based on phonetic similarity, as it converts names into numerical representations that facilitate comparison based on pronunciation rather than exact spelling. This algorithm helps identify names that sound similar despite variations in their spelling, improving the accuracy of searches in databases. Although initially used for languages like English, it has been adapted to fit the characteristics of other languages, such as Arabic. For Arabic names, the algorithm is enhanced to handle variations in pronunciation and spelling, further improving retrieval effectiveness. In this way, the Soundex algorithm provides an efficient method for classifying and retrieving names based on their phonetic sound rather than relying solely on exact spelling. We developed a system based on preliminary processing of Arabic names to improve precision and recall, taking into account the unique characteristics of the Arabic language. We compiled a list of Arabic names, which were then divided into groups based on their phonetic pronunciation. The proposed system demonstrated effectiveness and accuracy in retrieving and classifying names. Overall, the system showed good performance, with a need for improvements in certain specific cases.
References
D. E. Knuth, "The Art of Computer Programming," Volume 3: Sorting and Searching, Addison-Wesley, 1998.
J. Yang and M. Bhowmik, "Data Retrieval Techniques in Modern Information Systems," Journal of Computer and System Sciences, vol. 81, no. 5, pp. 810-825, 2015.
A. An and S. Gupta, "A Soundex-based Approach for Matching Similar Names," Journal of Database Management, vol. 20, no. 3, pp. 1-20, 2009.
H. S. Al-Khalifa and M. Al-Muhtaseb, "A New Phonetic Algorithm for Arabic Name Matching," International Journal of Computer Applications, vol. 138, no. 1, pp. 12-18, 2016.
T. El-Diraby and L. Liao, "Comparative Study of Phonetic Algorithms for Arabic Name Retrieval," International Journal of Computer Applications, vol. 90, no. 12, pp. 1-7, 2014.
R. M. El-Bakry, S. A. Al-Mansoori, and R. S. Al-Salemi, "A Comparative Study of Arabic Name Normalization Methods," Journal of King Saud University - Computer and Information Sciences, 2015.
S. A. Abed and Z. M. Awwad, "A Phonetic Matching Algorithm for Arabic Names," Journal of Computer and Communications, vol. 4, no. 9, pp. 54-63, 2016.
Downloads
Published
Issue
Section
License
Copyright (c) 2024 Journal of Basic Sciences

This work is licensed under a Creative Commons Attribution 4.0 International License.