Ontology-based Information Extraction from Tourism Web sites

Christina Feilmayr, Stefan Parzer, Birgit Pröll

Research output: Contribution to journalArticlepeer-review

Abstract

The enlarging amount of semistructured and unstructured data on heterogeneously designed tourism websites creates a need for information extraction (IE) mechanisms for semiautomatic data acquisition in order to build tourism recommender systems or tourism Web portals. In this article we analyze heterogeneity aspects of individually maintained accommodation websites and discuss the applicability of different IE types and techniques for this domain. We then develop a rule/ ontology-based IE approach and discuss the components of our prototype crawler. Finally, we discuss some relevant issues that emerged during the development and evaluation of the prototype.
Original languageEnglish
Pages (from-to)pp. 183-196
Number of pages15
JournalInformation Technology and Tourism
Volume11
Issue number3
DOIs
Publication statusPublished - 2009

Fields of science

  • 102001 Artificial intelligence
  • 102006 Computer supported cooperative work (CSCW)
  • 102010 Database systems
  • 102014 Information design
  • 102015 Information systems
  • 102016 IT security
  • 102028 Knowledge engineering
  • 102019 Machine learning
  • 102022 Software development
  • 102025 Distributed systems
  • 502007 E-commerce
  • 505002 Data protection
  • 506002 E-government
  • 509018 Knowledge management
  • 202007 Computer integrated manufacturing (CIM)
  • 102033 Data mining
  • 102035 Data science

Cite this