• 제목/요약/키워드: Arabic Named Entity Recognition (ANER)

검색결과 1건 처리시간 0.013초

A Machine Learning Approach for Named Entity Recognition in Classical Arabic Natural Language Processing

  • Ramzi Salah;Muaadh Mukred;Lailatul Qadri binti Zakaria;Fuad A. M. Al-Yarimi
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제18권10호
    • /
    • pp.2895-2919
    • /
    • 2024
  • A key element of many Natural Language Processing (NLP) applications is Named Entity Recognition (NER). It involves categorizing and identifying text into separate categories, such as identifying a location or an individual's name. Arabic NER (ANER) is also utilized in numerous other Arabic NLP (ANLP) tasks, such as Machine Translation (MT), Question Answering (QA), and Information Extraction (IE). ANER systems can often be classified into three major groups: rule-based, Machine Learning (ML), and hybrid. This study focuses on examining ML-based ANER developments, particularly in the context of Classical Arabic, which presents unique challenges due to its complex morphological structure and limited linguistic resources. We propose a supervised approach that integrates word-level, morphological, and knowledge-based features to improve NER performance for Classical Arabic. Our method was evaluated on the CANERCorpus, a specialized dataset containing annotated texts from Classical Arabic literature. The Naive Bayes (NB) approach achieved an F-measure of 80%, with precision and recall levels at 86% and 75%, respectively. These results indicate a significant improvement over traditional methods, particularly in dealing with the intricate structure of Classical Arabic. The study highlights the potential of ML in overcoming the challenges of ANER and provides directions for further research in this domain.