ArabLEX: Database of Arab Names (DAN)

Full Official Name: ArabLEX: Database of Arab Names (DAN)
Submission date: April 7, 2022, 2:48 p.m.

This database is part of the ArabLEX set of data which consists of the Database of Arabic General Vocabulary (DAG), Database of Arabic Place Names (DAP), Database of Foreign Names in Arabic (DAF) and Database of Arab Names (DAN) available from ELRA under references, respectively, ELRA-L0131, ELRA-M0105, ELRA-M0106 and ELRA-M0107. This full-form database covers Arab personal names (both given names and surnames) in both Arabic and English and contains a rich set of romanized name variants for each name with a variety of supplementary information such as gender, name type and frequency statistics. This comprehensive lexicon (over 6.4 million variants) contains precise phonemic transcriptions and vocalized Arabic for all inflected and cliticized forms for each name. This database is provided with 3 options: 1) proclitics, 2) phonetic information (CARS) and 3) orthographic variants. Subsets excluding some of the 3 proposed options may be provided upon demand. CARS is an accurate phonemic transcription. Optionally, phonetic transcriptions, IPA and/or SAMPA, can be provided, fine tuned to a customer's specifications. Quantity and size: 218,215,875 lines / 32,659 MB (31.9 GB) File format: flat TSV text files

Creator(s)
Distributor(s)
Right Holder(s)