TRANSFORMING NOUN PHRASE STRUCTURE FORM INTO RULES TO DETECT COMPOUND NOUNS IN MALAY SENTENCES

Authors

  • Suhaimi Abdul Rahman oftware Engineering Department, College of Information Technology Universiti Tenaga Nasional,Jalan IKRAM-UNITEN 43000 Kajang, Selangor, Malaysia
  • Nazlia Omar School of Computer Science, Faculty of Information Science and Technology Universiti Kebangsaan Malaysia, 43600 Bangi, Selangor, Malaysia

Keywords:

Noun phrase structure form, rules, compound noun, noun modifier category, parts of speech, tokenizer

Abstract

This paper addresses the process of transforming the noun phrase structure form into a list of rules to detect compound noun words in Malay sentences. Rules are collection of word syntax that are derived from a specific resource (as defined in our study). Comprehension of the concept rule used in a system is important (i.e. using rules to find a list of compound nouns that may exist in a sentence). The noun phrase frame structure is a form that contains a list of noun modifier categories. The list of noun modifier categories is then divided into several sub-categories such as numeral, numeral classifier, appellation, etc. All categories are arranged in sequence based on correct grammar. The noun phrase frame structure is then used to analyse the sentence. The words in the sentence will be arranged according to their suitable noun modifier category as defined by the noun phrase frame structure. In terms of data requirements, we will only focus on examples of sentences that combine two noun phrases.

 

Additional Files

Published

23-04-2013

How to Cite

Abdul Rahman, S., & Omar, N. (2013). TRANSFORMING NOUN PHRASE STRUCTURE FORM INTO RULES TO DETECT COMPOUND NOUNS IN MALAY SENTENCES. Journal of Information and Communication Technology, 12, 161–173. Retrieved from https://e-journal.uum.edu.my/index.php/jict/article/view/8142