Please use this identifier to cite or link to this item:
https://repository.cihe.edu.hk/jspui/handle/cihe/226
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Xie, Haoran | - |
dc.contributor.author | Wang, Philips Fu Lee | - |
dc.contributor.author | Wong, Tak Lam | - |
dc.date.accessioned | 2021-03-16T05:45:22Z | - |
dc.date.available | 2021-03-16T05:45:22Z | - |
dc.date.issued | 2018 | - |
dc.identifier.uri | https://repository.cihe.edu.hk/jspui/handle/cihe/226 | - |
dc.description.abstract | Contrary to traditional Web information retrieval methods that can only return a ranked list of Web pages and only allow search terms in the query, we have developed a novel learning framework for retrieving precise information blocks from Web pages given a query, which may contain some search terms and prior information such as the layout format of the data. There are two challenging sub-tasks for this problem. One challenge is information block detection, where a Web page is automatically segmented into blocks. Another challenge is to find the information blocks relevant to the query. Existing page segmentation methods, which make use of only visual layout information or only content information, do not consider the query information, leading to a solution having conflict with the information need expressed by the query. Our framework aims at modeling the query and the block features to capture both keyword information and prior information via a probabilistic graphical model. Fisher Kernel, which can effectively incorporate the graphical model, is then employed to accomplish the two sub-tasks in a unified manner, optimizing the final goal of block retrieval performance. We have conducted experiments on benchmark datasets and read-world data. Comparisons between existing methods have been conducted to evaluate the effectiveness of our framework. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Springer | en_US |
dc.relation.ispartof | International Journal of Machine Learning and Cybernetics | en_US |
dc.title | A learning framework for information block search based on probabilistic graphical models and Fisher Kernel | en_US |
dc.type | journal article | en_US |
dc.identifier.doi | 10.1007/s13042-017-0657-9 | - |
dc.contributor.affiliation | School of Computing and Information Sciences | - |
dc.relation.issn | 1868-808X | en_US |
dc.description.volume | 9 | en_US |
dc.description.issue | 9 | en_US |
dc.description.startpage | 1473 | en_US |
dc.description.endpage | 1487 | en_US |
dc.cihe.affiliated | Yes | - |
item.languageiso639-1 | en | - |
item.fulltext | No Fulltext | - |
item.openairetype | journal article | - |
item.grantfulltext | none | - |
item.openairecristype | http://purl.org/coar/resource_type/c_6501 | - |
item.cerifentitytype | Publications | - |
crisitem.author.dept | Yam Pak Charitable Foundation School of Computing and Information Sciences | - |
crisitem.author.dept | Rita Tong Liu School of Business and Hospitality Management | - |
crisitem.author.dept | Yam Pak Charitable Foundation School of Computing and Information Sciences | - |
Appears in Collections: | CIS Publication |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.