Nazik Z. answered 01/17/20
I am a dentist with vast knowledge and a fun teacher!
Google N-gram Viewer is a graphing tool that charts time-based data to show the frequency of usage of words or phrases. The tool is based on a large dataset from books collected by Google from open sources. Although Arabic is the fifth spoken language in the world with speakers more than French, German, Russian and Italian languages, unfortunately, Arabic language is not included as one of the corpora indexed by the Google n-gram viewer. This paper illustrates the possibility of building a big Arabic corpus and indexing it to be included in Google N-gram viewer. A case study is presented to build a dataset to initiate the process of digitizing the Arabic content and prepare it to be incorporated in Google N-gram viewer. One of the major goals of including Arabic content in Google N-gram is to enrich Arabic public content which has been very limited in comparison with the size of Arabic speakers. We believe that adopting Arabic language by Google N-gram viewer can significantly benefit researchers in different fields related to Arabic language and its speakers’ history.