دراسة عن المسافة المعجمية للهجات العربية pdf

دراسة عن المسافة المعجمية للهجات العربية pdf
📄 بحث علمي
📋
النوع pdf
👤
المؤلف Abu Kwaik, Kathrein Saad, Motaz K Chatzikyriakidis, Stergios Dobnika, Simon
📅
التاريخ 2018
👁️
المشاهدات 260

📝 نبذة مختصرة

<strong>ملخص الدراسة:</strong>

Diglossia is a very common phenomenon in Arabic-speaking communities, where the spoken language is different from both Classical Arabic (CA) and Modern Standard Arabic (MSA). The spoken language is characterised as a number of dialects used in everyday communication as well as informal writing. In this paper, we highlight the lexical relation between the MSA and Dialectal Arabic (DA) in more than one Arabic region. We conduct a computational cross dialectal lexical distance study to measure the similarities and differences between dialects and the MSA. We exploit several methods from Natural Language Processing (NLP) and Information Retrieval (IR) like Vector Space Model (VSM), Latent Semantic Indexing (LSI) and Hellinger Distance (HD), and apply them on different Arabic dialectal corpora. We measure the overlap among all the dialects and compute the frequencies of the most frequent words in every dialect. The results are informative and indicate that Levantine dialects are very similar to each other and furthermore, that Palestinian appears to be the closest to MSA.

<strong>توثيق المرجعي (APA)</strong>

📄 محتوى البحث

ملخص الدراسة:

Diglossia is a very common phenomenon in Arabic-speaking communities, where the spoken language is different from both Classical Arabic (CA) and Modern Standard Arabic (MSA). The spoken language is characterised as a number of dialects used in everyday communication as well as informal writing. In this paper, we highlight the lexical relation between the MSA and Dialectal Arabic (DA) in more than one Arabic region. We conduct a computational cross dialectal lexical distance study to measure the similarities and differences between dialects and the MSA. We exploit several methods from Natural Language Processing (NLP) and Information Retrieval (IR) like Vector Space Model (VSM), Latent Semantic Indexing (LSI) and Hellinger Distance (HD), and apply them on different Arabic dialectal corpora. We measure the overlap among all the dialects and compute the frequencies of the most frequent words in every dialect. The results are informative and indicate that Levantine dialects are very similar to each other and furthermore, that Palestinian appears to be the closest to MSA.

توثيق المرجعي (APA)

🏷️ التصنيفات والكلمات المفتاحية

ℹ️ تفاصيل البحث

اللغة English
النص المتاح دراسة كاملة
البلد فلسطين

📤 مشاركة البحث

تم نسخ الرابط إلى الحافظة ✓