An Empirical Study on How the Developers Discussed about Pandas Topics
Sajib Kumar Saha Joy, Farzad Ahmed, Al Hasib Mahamud, and Nibir, Chandra Mandal

TL;DR
This paper analyzes Stack Overflow discussions to identify popular and challenging pandas topics among developers, categorizing 26 topics into five groups and tracking their discussion trends over time.
Contribution
It provides an empirical analysis of pandas-related discussions on Stack Overflow, categorizing topics and revealing trends and difficulties faced by developers.
Findings
26 pandas topics identified and categorized
Developers frequently discuss error handling, visualization, and optimization
Trend analysis shows evolving focus on specific pandas topics over time
Abstract
Pandas is defined as a software library which is used for data analysis in Python programming language. As pandas is a fast, easy and open source data analysis tool, it is rapidly used in different software engineering projects like software development, machine learning, computer vision, natural language processing, robotics, and others. So a huge interests are shown in software developers regarding pandas and a huge number of discussions are now becoming dominant in online developer forums, like Stack Overflow (SO). Such discussions can help to understand the popularity of pandas library and also can help to understand the importance, prevalence, difficulties of pandas topics. The main aim of this research paper is to find the popularity and difficulty of pandas topics. For this regard, SO posts are collected which are related to pandas topic discussions. Topic modeling are done on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Online Learning and Analytics · Software System Performance and Reliability
MethodsLib
