Web Document Analysis for Companies Listed in Bursa Malaysia
Mohd Shahizan Othman, Lizawati Mi Yusuf, Juhana Salim

TL;DR
This study analyzes web documents of Bursa Malaysia-listed companies, revealing minimal website usage and highlighting the predominant use of image files in their online presence.
Contribution
It introduces a Web Resources Extraction System and provides insights into the web usage patterns of Malaysian companies.
Findings
Minimal website usage among Bursa Malaysia companies
60.02% of image files are utilized in company websites
Web Resources Extraction System effectively extracts web document information
Abstract
This paper discusses a research on web document analysis for companies listed on Bursa Malaysia which is the forerunner of financial and investment center in Malaysia. Data set used in this research are from the company web documents listed in the Main Board and Second Board on Bursa Malaysia. This research has used the Web Resources Extraction System which was developed by the research group mainly to extract information for the web documents involved. Our research findings have shown that the level of website usage among the companies on Bursa Malaysia is still minimal. Furthermore, research has also found that 60.02 percent of the image files are utilized making it the most used type of file in creating websites.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWeb Data Mining and Analysis · Algorithms and Data Compression · Advanced Text Analysis Techniques
