Blackboard Agents For Standard Arabic Language Tokenization And Parsing

Abstract

The Processing of the Arabic language is a difficult mission comparing it with other languages. Because Sentences in Arabic language are complex, longer than the others in various languages, and have difficult structure with lattices. The syntactic structure of sentences parts may be missing, affecting the orders of words and phrases. Parsing Arabic sentences can be done via several techniques which are started with top-down and or bottom-up parsing. Recursive Transition Network one of the most famous techniques for parsing sentences. It is a finite transition automaton of limited states. This paper presents a new method in processing the first stage from Arabic language processing which is a syntax analysis by using Transition Networks techniques. This technique presents morphological analysis for sentence tokens. Prolog version 2 where used for execution parsing, since it is more consistence for natural languages representation and processing.