The Performance of the LSTM-based Code Generated by Large Language Models (LLMs) in Forecasting Time Series Data
Saroj Gopali, Sima Siami-Namini, Faranak Abri, Akbar Siami Namin

TL;DR
This paper evaluates the effectiveness of various large language models in automatically generating deep learning models for time series forecasting, comparing their performance to manually crafted models across different prompt sensitivities.
Contribution
It provides a systematic comparison of LLMs like ChatGPT, PaLM, LLama, and Falcon in generating time series forecasting models, highlighting ChatGPT's superior performance.
Findings
Generated models perform comparably to manual LSTM models.
ChatGPT outperforms other LLMs in accuracy.
Model quality varies with the temperature setting.
Abstract
As an intriguing case is the goodness of the machine and deep learning models generated by these LLMs in conducting automated scientific data analysis, where a data analyst may not have enough expertise in manually coding and optimizing complex deep learning models and codes and thus may opt to leverage LLMs to generate the required models. This paper investigates and compares the performance of the mainstream LLMs, such as ChatGPT, PaLM, LLama, and Falcon, in generating deep learning models for analyzing time series data, an important and popular data type with its prevalent applications in many application domains including financial and stock market. This research conducts a set of controlled experiments where the prompts for generating deep learning-based models are controlled with respect to sensitivity levels of four criteria including 1) Clarify and Specificity, 2) Objective and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnergy Load and Power Forecasting · Topic Modeling · Computational Physics and Python Applications
MethodsSparse Evolutionary Training · Pathways Language Model · Tanh Activation · OPT · Sigmoid Activation · Long Short-Term Memory
