Journal of Guangdong University of Technology ›› 2017, Vol. 34 ›› Issue (03): 89-95.doi: 10.12052/gdutxb.170029

Previous Articles     Next Articles

A Research on Text Information Extraction from Annual Report Based on Domain Ontology

Liang Zhuo-qian1,3, Wang Dong2, Zhu Hui2, Pan Ding1   

  1. 1. School of Management, Jinan University, Guangzhou 510632, China;
    2. School of Business Administration, Guangzhou 510006, China;
    3. School of Information, Jinan University, Guangzhou 510632, China
  • Received:2017-02-17 Online:2017-05-09 Published:2017-05-09

Abstract:

Significant financial information can be retrieved from the vast amount of textual data provided in Chinese business accounting reports (annual reports). Nevertheless, due to the unstructured nature, this textual information usually is difficult to be obtained and analyzed via traditional computer and database techniques. To address this issue, a set of unified domain-specific ontology is presented, combined with Chinese Natural language processing (NLP), which transforms accounting reports in unstructured text into a structured XBRL-based form via three different dimensions, namely word attribute description, word relation organization, and related knowledge links respectively.

Key words: extensible business reporting language(XBRL), domain ontology, financial report

CLC Number: 

  • TP391

[1] LI H Q, ZHAI J. Literature review of XBRL semantic research[C]//. 2015 International Conference on Computer Science and Intelligent Communication. HK:Atlantis, 2015:316-320.
[2] LI M J, ZHOU Z H, DU M J. Detection and resolution of structural conflictions in heterogeneous XBRL taxonomies[C]//. The 5th International Conference on New Trends in Information Science and Service Science. HI:IEEE, 2011:312-317.
[3] LI M J, ZHOU Z H, DU M J. XBRL in the Chinese financial ecosystem[J]. IT professional, 2013, 15(6):36-42.
[4] 李吉梅, 杜美杰. 基于XBRL的异构财务信息集成算法[J]. 吉林大学学报(工学版), 42(S1):266-270. LI J M, DU M J. Information integration algorithm of heterogeneous XBRL financial reporting[J]. Journal of Jilin University:Engineering and Technology Edition, 2012, 42(S1):266-270.
[5] PAN D, PAN Y S. Incorprating XBRL into business intelligence applications based on formal semantics[C]//2011 China Academic Accounting Association Annual Meeting. XM:Elsevier, 2011:1758-1765.
[6] 冯志伟. 现代术语学引论(增订本)[M]. 北京:商务印书馆, 2011. 12-195.
[7] 杨周南, 朱建国, 刘锋. XBRL分类标准认证的理论基础和方法学体系研究[J]. 会计研究, 2010, 1(11):10-15 YANG Z N, ZHU J G, LIU F. Research on the theory basis and methodology system of xbrl taxonomy recognition[J]. Accounting Research. 2010, 1(11):10-15.
[8] DEBRECENY R, FELDEN C, OCHOCKI B, et al. XBRL for interactive data[M]. NY:Springer, 2009. 189-211.
[9] LARA R, CANTADOR I, CASTELLS P. XBRL taxonomies and OWL ontologies for investment funds[C]//In the 1st International Workshop on Ontologizing Industrial Standards at the 25th International Conference on Conceptual Modelling. AZ:Springer, 2006:271-280.
[10] BAO J, RONG G, LI X, et al. Representing financial reports on the semantic web:a faithful translation from XBRL to OWL[C]//International Workshop on Rules and Rule Markup Languages for the Semantic Web. DC:Springer, 2010:144-152.
[11] HUANG M, WANG D, WANG K. Ontology-based semantic retrieval of XBRL data[C]//2011 International Conference on Business Computing and Global Informatization, SH:IEEE, 2011:363-366.
[12] ZHU H. Semantic integration approach to efficient business data supply chain:integration approach to interoperable XBRL[EB/OL]. (2007-10-01)[2016-04-01]. http://web.mit.edu/smadnick/www/wp/2007-10.pdf
[13] ROMILLA C, YOON VY, REDMOND RT, et al. Ontology based integration of XBRL filings for financial decision making[J]. Decision Support Systems, 2014, 1(68):64-76.
[14] GARCIA R, GIL R. Publishing XBRL as linked open data[C]//In Proceedings of World Wide Web Workshop:Linked Data on the Web, Madrid:CEUR-WS, 2009:538
[15] KAMPGEN B, WELLER T, O'RIAIN S. Accepting the XBRL challenge with linked data for financial data integration[J]. Lecture Notes in Computer Science, 2014, 1(8465):595-610
[16] 吴忠生, 张天西, 陈志德. 基于领域本体的XBRL财务报告转换研究[J]. 计算机应用研究, 2013, 1(30):3643-3646 WU Z S, ZHANG T X, CHEN Z D. Research on conversion between XBRL financial reports based on domain ontology[J]. Application Research of Computers. 2013, 1(30):3643-3646.
[17] ANTONINA K, CAMILLA M, BARBRO B. Mining textual contents of financial reports[J]. The International Journal of Digital Accounting Research, 2004, 4(7):1-29
[18] MENDEZ NUNEZ S, TRIVIO G. Combining semantic web technologies and computational theory of perceptions for text generation in financial analysis[C]//2010 IEEE International Conference on Fuzzy Systems. Barcelona:IEEE, 2012:1-8.
[19] GRUBER T R. Toward principles for the design of ontologies used for knowledge sharing[J]. International journal of human-computer studies, 1995, 1(43):907-928.
[20] 李群. 非寿险业务的会计核算[J]. 财务与会计. 2009, 1(5):20-26. LI Q. Accounting for non-life insurance business[J]. Financial and Accounting. 2009, 1(5):20-26.
[21] 黄蓉, 徐璐璐. 公司关联交易文献评述[J]. 广东工业大学学报, 2016, 33(06):102-106. HUANG RONG, XU LU-LU. Summarization of Related Party Transactions in Listed Company. JOURNAL OF GUANGDONG UNIVERSITY OF TECHNOLOGY, 2016, 33(06):102-106.

[1] Xie Guo-bo, Lin Li, Lin Zhi-yi, He Di-xuan, Wen Gang. An Insulator Burst Defect Detection Method Based on YOLOv4-MP [J]. Journal of Guangdong University of Technology, 2023, 40(02): 15-21.
[2] Chen Jing-yu, Lyu Yi. Frost Detection Method of Cold Chain Refrigerating Machine Based on Spiking Neural Network [J]. Journal of Guangdong University of Technology, 2023, 40(01): 29-38.
[3] Ye Wen-quan, Li Si, Ling Jie. Sparse-view SPECT Image Reconstruction Based on Multilevel-residual U-Net [J]. Journal of Guangdong University of Technology, 2023, 40(01): 61-67.
[4] Zou Heng, Gao Jun-li, Zhang Shu-wen, Song Hai-tao. Design and Implementation of a Dropping Guidance Device for Go Robot [J]. Journal of Guangdong University of Technology, 2023, 40(01): 77-82,91.
[5] Xie Guang-qiang, Xu Hao-ran, Li Yang, Chen Guang-fu. Consensus Opinion Enhancement in Social Network with Multi-agent Reinforcement Learning [J]. Journal of Guangdong University of Technology, 2022, 39(06): 36-43.
[6] Liu Xin-hong, Su Cheng-yue, Chen Jing, Xu Sheng, Luo Wen-jun, Li Yi-hong, Liu Ba. Real Time Detection of High Resolution Bridge Crack Image [J]. Journal of Guangdong University of Technology, 2022, 39(06): 73-79.
[7] Xiong Wu, Liu Yi. Application of Particle Filter Algorithm in Static Deformation Monitoring of BDS High-Speed Rail [J]. Journal of Guangdong University of Technology, 2022, 39(04): 66-72.
[8] Yi Min-qi, Liu Hong-wei, Gao Hong-ming. Research on the Factors Influencing the Co-purchase Network of Products on E-commerce Platforms [J]. Journal of Guangdong University of Technology, 2022, 39(03): 16-24.
[9] Qiu Zhan-chun, Fei Lun-ke, Teng Shao-hua, Zhang Wei. Palmprint Recognition Based on Cosine Similarity [J]. Journal of Guangdong University of Technology, 2022, 39(03): 55-62.
[10] Zheng Jia-bi, Yang Zhen-guo, Liu Wen-yin. Marketing-Effect Estimation Based on Fine-grained Confounder Balancing [J]. Journal of Guangdong University of Technology, 2022, 39(02): 55-61.
[11] Gary Yen, Li Bo, Xie Sheng-li. An Evolutionary Optimization of LSTM for Model Recovery of Geophysical Fluid Dynamics [J]. Journal of Guangdong University of Technology, 2021, 38(06): 1-8.
[12] Li Guang-cheng, Zhao Qing-lin, Xie Kan. A Design of Decentralized Data Processing Scheme [J]. Journal of Guangdong University of Technology, 2021, 38(06): 77-83.
[13] Xie Guang-qiang, Zhao Jun-wei, Li Yang, Xu Hao-ran. Cooperative Lane-changing Based on Multi-cluster System [J]. Journal of Guangdong University of Technology, 2021, 38(05): 1-9.
[14] Zhang Wei, Zhang Zhen-bin. Joint Graph Embedding and Feature Weighting for Unsupervised Feature Selection [J]. Journal of Guangdong University of Technology, 2021, 38(05): 16-23.
[15] Deng Jie-hang, Yuan Zhong-ming, Lin Hao-run, Gu Guo-sheng. Superpixel and Visual Saliency Synergetic Image Quality Assessment [J]. Journal of Guangdong University of Technology, 2021, 38(05): 33-39.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!