THE ARCHITECTURE OF INFORMATION EXTRACTION FOR ONTOLOGY POPULATION IN CONTRACTOR SELECTION

Authors

  • Rosmayati Mohemad Software Technology Research Group (SofTech), School of Informatic and Applied Mathematics, Universiti Malaysia Terengganu, 21030, Kuala Terengganu, Terengganu, Malaysia
  • Abdul Razak Hamdan Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Bandar Baru Bangi, 43600, Selangor, Malaysia
  • Zulaiha Ali Othamn Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Bandar Baru Bangi, 43600, Selangor, Malaysia
  • Noor Maizura Mohamad Noor Software Technology Research Group (SofTech), School of Informatic and Applied Mathematics, Universiti Malaysia Terengganu, 21030, Kuala Terengganu, Terengganu, Malaysia

DOI:

https://doi.org/10.11113/jt.v78.9719

Keywords:

Decision-making, information extraction, ontology population, contractor selection

Abstract

The enormous amount of unstructured data presents the biggest challenge to decision makers in eliciting meaningful information to support business decision-making. This study explores the potential use of ontologies in extracting and populating the information from various combinations of unstructured and semi-structured data formats such as tabular, form-based and natural language-based text. The main objective of this study is to propose an architecture of information extraction for ontology population. Contractor selection is chosen as the domain of interest. Thus, this research focuses on the extraction of contractor profiles from tender documents in order to enrich ontological contractor profile by populating the relevant extracted information. The findings are significantly good in precision and recall, in which the performance measures have reached an accuracy of 100% precision and recall for extracting information in both tabular and form-based formats. However, the precision score of relevant information extracted in natural language text is average with a percentage of 42.86% due to the limitation of the linguistic approach for processing Malay texts. 

References

Gantz, J. F., and Reinsel, D. 2011. The 2011 Digital Universe Study: Extracting Value from Chaos. http://www.emc.com/collateral/analyst-reports/idc-extracting-value-from-chaos-ar.pdf.

Power, D. J., Burstein, F., and Sharda, R. 2011. Reflections on the Past and Future of Decision Support Systems: Perspective of Eleven Pioneers. Decision Support: An Examination of the DSS Discipline. Schuff, D., Paradice, D., Burstein, F., Power, D. J. and R. Sharda eds. Springer New York. 25-48.

Wanner, L., Rospocher, M., Vrochidis, S., Johansson, L., Bouayad-Agha, N., Casamayor, G., Karppinen, A., Kompatsiaris, I., Mille, S., Moumtzidou, A., and Serafini, L. 2015. Ontology-Centered Environmental Information Delivery for Personalized Decision Support. Expert Systems with Applications. 42(12): 5032-5046.

Hou, S., Li, H., and Rezgui, Y. 2015. Ontology-based Approach for Structural Design Considering Low Embodied Energy and Carbon. Energy and Buildings. 102(2015): 75-90.

Wang, X., Wong, T. N., and Fan, Z.-P. 2013. Ontology-based Supply Chain Decision Support for Steel Manufacturers in China. Expert Systems with Applications. 40(18): 7519-7533.

Chowdhuri, R., Yoon, V. Y., Redmond, R. T., and Etudo, U. O. 2014. Ontology Based Integration of XBRL Filings for Financial Decision Making. Decision Support Systems. 68 (2014): 64-76.

Shue, L.-Y., Chen, C.-W., and Shiue, W. 2009. The Development of an Ontology-Based Expert System for Corporate Financial Rating. Expert Systems with Applications. 36(2): 2130-2142.

Niaraki, A. S., and Kim, K. 2009. Ontology Based Personalized Route Planning System Using a Multi-Criteria Decision Making Approach. Expert Systems with Applications. 36(2): 2250-2259.

Bahulkar, A., and Reddy, S. 2013. Ontology Driven Information Extraction from Tables Using Connectivity Analysis. Lecture Notes in Computer Science. R. Meersman, H. Panetto, T. Dillon, J. Eder, Z. Bellahsene, N. Ritter, P. De Leenheer, and D. Dou eds. Springer Berlin Heidelberg. 642-658.

Guo, C., Ma, S., and Yuan, D. 2014. A Web Table Extraction Method Based on Structure and Ontology. Advanced Data Mining and Applications, Lecture Notes in Computer Science. Luo, X. Yu, J. and Li, Z. eds. Springer International Publishing. 695-704.

Younsi, Z., Quafafou, M., Ouzegane, R. and Tari, A. 2013. WebOMSIE: An Ontology-Based Multi Source Web Information Extraction. New Trends in Databases and Information Systems, Advances in Intelligent Systems and Computing. Pechenizkiy, M. and Wojciechowski, M. eds. Springer Berlin Heidelberg. 199-208.

Çelik, D. and Elçi, A. 2013. An Ontology-based Information Extraction Approach for Résumés. Pervasive Computing and the Networked World, Lecture Notes in Computer Science. Zu, Q., Hu, B. and Elçi, A. eds. Springer Berlin Heidelberg. 165-179.

Moreno, A. Isern, D. and López Fuentes, A. C. 2013. Ontology-based Information Extraction of Regulatory Networks from Scientific Articles with Case Studies for Escherichia Coli. Expert Systems with Applications. 40(8): 3266-3281.

Faria, C., Serra, I. and Girardi, R. 2014. A Domain-Independent Process for Automatic Ontology Population from Text. Science of Computer Programming. 95(Part 1): 26-43.

Lu, Y., Li, Q., Zhou, Z. and Deng, Y. 2015. Ontology-based Knowledge Modeling for Automated Construction Safety Checking. Safety Science. 79(2015): 11-18.

El-Diraby, T. E. and Osman, H. 2011. A Domain Ontology for Construction Concepts in Urban Infrastructure Products. Automation in Construction. 20(8): 1120-1132.

Chungoora, N., Young, R. I., Gunendran, G., Palmer, C., Usman, Z., Anjum, N. A., Cutting-Decelle, A.-F., Harding, J. A., and Case, K. 2013. A Model-driven Ontology Approach for Manufacturing System Interoperability and Knowledge Sharing. Computers in Industry. 64(4): 392-401.

Bright, T. J. 2009. Development and Evaluation of an Ontology for Guiding Appropriate Antibiotic Prescribing. PhD. School of Arts and Sciences, Columbia University, Columbia.

David, S., Arantza, A. and Clare, M. 2011. An Ontology of Diabetes Self Management. Proceedings of the First International Workshop on Managing Interoperability and Complexity in Health Systems. Glasgow, Scotland. 28 October 2011. 83-86.

Jia, H., Wang, M., Ran, W., Yang, S. J. H., Liao, J. and Chiu, D. K. W. 2011. Design of a Performance-Oriented Workplace e-Learning System using Ontology. Expert Systems with Applications. 38(4): 3372-3382.

Kuo-Kuang, C., Chien, I. L. and Rong-Shi, T. 2011. Ontology Technology to Assist Learners' Navigation in the Concept Map Learning System. Expert System Application. 38(9): 11293-11299.

Fernandez-Breis, J. T., Castellanos-Nieves, D., Hernandez-Franco, J., Soler-Segovia, C., Robles-Redondo, M. d. C., Gonzalez-Martinez, R., and Prendes-Espinosa, M. P. 2012. A Semantic Platform for the Management of the Educative Curriculum. Expert Systems with Applications. 39(5): 6011-6019.

Rosmayati, M., Abdul Razak, H., Zulaiha, A.O. and Noor Maizura, M. N. 2011. Modelling Ontology for Supporting Construction Tender Evaluation Process. International Conference on Semantic Technology and Information Retrieval (STAIR 11). Putrajaya, Malaysia. 27-29 June 2011. 282-288.

Ciribini, A. L. C., Bolpagni, M., and Oliveri, E. 2015. An Innovative Approach to e-Public Tendering Based on Model Checking. Procedia Economics and Finance. 21 (2015): 32-39.

Schaaffkamp, C. 2014. How Can Customer Focus be Strengthened in Competitive Tendering? Research in Transportation Economics. 48(2014): 305-314.

Small, S. and Medsker, L. 2014. Review of Information Extraction Technologies and Applications. Neural Computing and Applications. 25(3-4): 533-548.

Sen, S., Tao, J. and Deokar, A. 2015. On the Role of Ontologies in Information Extraction. Reshaping Society through Analytics, Collaboration, and Decision Support, Annals of Information Systems. Iyer, L.S. and Power, D. J. eds. Springer International Publishing. 115-133.

Daya, C. W. and Dejing, D. 2010. Ontology-based Information Extraction: An Introduction and a Survey of Current Approaches. Journal Information Science. 36(3): 306-323.

Li, C.-x., Su, Y.-r., Wang, R.-j., Wei, Y.-y., and Huang, H. 2012. Structured AJAX Data Extraction Based on Agricultural Ontology. Journal of Integrative Agriculture. 11(5): 784-791.

Nederstigt, L. J., Aanen, S. S., Vandic, D., and Frasincar, F. 2014. FLOPPIES: A Framework for Large-Scale Ontology Population of Product Information from Tabular Data in e-Commerce Stores. Decision Support Systems. 59(2014): 296-311.

Ali, F., Kim, E., and Kim, Y.-G. 2015. Type-2 Fuzzy Ontology-based Opinion Mining and Information Extraction: A Proposal to Automate the Hotel Reservation System. Applied Intelligence. 42(3): 481-500.

Gomez-Perez, A., Corcho-Garcia, O., and Fernandez-Lopez, M. 2004. Ontological Engineering. Springer-Verlag New York.

Fernandez, M., Gomez-Perez, A., and Juristo, N. 1997. METHONTOLOGY: From Ontological Art towards Ontological Engineering. Proceedings of the AAAI97 Spring Symposium Series on Ontological Engineering. Menlo Park, California. 24-26 March 1997. 33-40.

Oro, E. and Ruffolo, M. 2009. PDF-TREX: An Approach for Recognizing and Extracting Tables from PDF Documents. 10th International Conference on Document Analysis and Recognition. Barcelona, Spain. 26-29 July 2009. 906-910.

Rosmayati, M., Abdul Razak, H., Zulaiha, A.O., and Noor Maizura, M. N. 2011. Automatic Recognition of Document Structure from PDF Files. Software Engineering and Computer Systems, Communications in Computer and Information Science. Zain, J. M., Wan Mohd, W. M. b., and El-Qawasmeh, E. eds. Springer Berlin Heidelberg. 274-282.

Downloads

Published

2016-09-28

How to Cite

THE ARCHITECTURE OF INFORMATION EXTRACTION FOR ONTOLOGY POPULATION IN CONTRACTOR SELECTION. (2016). Jurnal Teknologi (Sciences & Engineering), 78(9-3). https://doi.org/10.11113/jt.v78.9719