Prediction of tractor drawbar pull under different tillage tools using machine learning and low-cost sensors

So-Yun Gong; Si-Eon Lee; Yi-Seo Min; Seung-Min Baek; Seung-Yun Baek; Yong-Joo Kim; Wan-Soo Kim

PMC · DOI:10.1038/s41598-025-24974-w·November 20, 2025

Prediction of tractor drawbar pull under different tillage tools using machine learning and low-cost sensors

So-Yun Gong, Si-Eon Lee, Yi-Seo Min, Seung-Min Baek, Seung-Yun Baek, Yong-Joo Kim, Wan-Soo Kim

PDF

Open Access

TL;DR

This study uses machine learning and low-cost sensors to accurately predict tractor drawbar pull for different plows in various soil conditions.

Contribution

The study introduces a cost-effective approach by comparing input variable combinations for different plows using machine learning models.

Findings

01

RF in Model E achieved the highest performance for moldboard plow prediction (R2 = 0.977).

02

ANN in Models B and C provided strong accuracy for chisel plow prediction (R2 = 0.953).

03

ANN in Models B and E performed well for subsoiler plow prediction (R2 = 0.953).

Abstract

Machine-learning models were developed to predict the drawbar pull of a 78-kW-class tractor for moldboard, chisel, and subsoiler plows. Four models were tested: random forest (RF), extreme gradient boosting (XGB), artificial neural network (ANN), and support vector machine (SVM). The training variables included engine speed (ES), engine torque (ET), travel speed (TS), tillage depth (TD), and slip ratio (SR). Unlike prior studies that focused mainly on engine parameters, this study incorporated nonlinear variables to improve both accuracy and practical applicability. Data were collected from three Korean paddy fields with different soil conditions, and the dataset was divided into 70% for training and 30% for testing. Five input variable combinations were used: Model A (ES, ET), Model B (ES, ET, TD), Model C (ES, ET, TS, SR), Model D (TD, TS), and Model E (ES, TD, TS). The results showed…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Chemicals1

chisel

Figures17

Click any figure to enlarge with its caption.

Drawbar pull for different tillage tools: comparison between this study and ASABE standards.

Critical performance data for tractor operations with plow (a) moldboard plow, (b) chisel plow, and (c) subsoiler: Results of correlation analysis.

Scatter plots of the measured and predicted drawbar pull for the moldboard plow: (a) RF, (b) XGB, (c) ANN, and (d) SVM.Table 11Performance evaluation of machine-learning models for predicting drawbar pull in moldboard-plow operations.ModelR^2^RMSE (kN)MAPE (%)RD (%)TrainTestR^2^ GapTrainTestTestTrainTestTrainTestARF0.6330.4950.1380.6450.7461.3371.822.142.613.02XGB0.4600.4290.0310.7830.7941.0282.442.533.173.21ANN0.6180.6010.0160.3050.3301.1660.910.970.010.01SVM0.4820.3900.0920.7720.8091.0990.020.030.030.03BRF0.8890.7930.0960.3540.4831.8560.891.170.011.95XGB0.8010.7310.0710.4790.5361.2551.391.48

Scatter plots of the measured and predicted drawbar pull for the chisel plow: (a) RF, (b) XGB, (c) ANN, and (d) SVM.Table 12Performance evaluation of machine learning models for predicting drawbar pull in chisel-plow operations.ModelR^2^RMSE (kN)MAPE (%)RD (%)TrainTestR^2^ GapTrainTestTTLRTrainTestTrainTestARF0.7560.7330.0230.9080.9511.0986.046.267.798.16XGB0.8970.8840.0120.5920.6261.1214.134.375.085.37ANN0.9230.9160.0070.5100.5351.0993.323.560.040.05SVM0.9230.9080.0150.5110.5591.1990.030.040.040.05BRF0.9360.9120.0230.4660.5471.3793.163.704.004.69XGB0.8000.7860.0140.8180.8631.1125.586.007.027.

Scatter plots of the measured and predicted drawbar pull for the subsoiler: (a) RF, (b) XGB, (c) ANN, and (d) SVM.Table 13Performance evaluation of machine learning models for predicting drawbar pull in subsoiler operations.ModelR^2^RMSE (kN)MAPE (%)RD (%)TrainTestR^2^ GapTrainTestTTLRTrainTestTrainTestARF0.9670.9160.0510.4460.5381.4520.931.403.832.20XGB0.8220.7830.0390.7960.8631.1752.592.833.263.54ANN0.5600.5520.0081.2531.2400.9793.953.560.050.05SVM0.9870.8810.1050.2190.6388.5060.010.020.010.03BRF0.8760.7990.0770.7110.8481.4231.982.556.103.48XGB0.6980.6630.0361.0281.0981.1393.353.554.224.50AN

Traction-force prediction for different tillage tools: (a) moldboard plow, (b) chisel plow, and (c) subsoiler. Comparison of R^2^ among machine-learning Models (A–E): Model A = drawbar pull as a function of engine speed and engine torque; Model B = drawbar pull as a function of engine speed, engine torque, and tillage depth; Model C = drawbar pull as a function of engine speed, engine torque, travel speed, and slip ratio; Model D = drawbar pull as a function of tillage depth and travel speed; Model E = drawbar pull as a function of engine speed, tillage depth, and travel speed.

Traction-force prediction for different tillage tools: (a) moldboard plow, (b) chisel plow, and (c) subsoiler. Comparison of RMSE among machine-learning Models (A–E): Model A = drawbar pull as a function of engine speed and engine torque; Model B = drawbar pull as a function of engine speed, engine torque, and tillage depth; Model C = drawbar pull as a function of engine speed, engine torque, travel speed, and slip ratio; Model D = drawbar pull as a function of tillage depth and travel speed; Model E = drawbar pull as a function of engine speed, tillage depth, and travel speed.

Robustness test using scatter plots of the measured and predicted values obtained with the moldboard plow: (a) RF, (b) XGB, (c) ANN, and (d) SVM.

Diagram of the tractor field data acquisition system.

Field experiment sites for tractor-performance measurement.

Field experiments with different implements: (a) moldboard plow, (b) chisel plow, and (c) subsoiler.

Overview of machine-learning models: (a) Random Forest (RF)—ensemble of decision trees with majority voting, (b) Extreme Gradient Boosting (XGB)—sequential boosting process, (c) Artificial Neural Network (ANN)—multi-layer perceptron with hidden layers, and (d) Support Vector Machine (SVM)—optimal hyperplane with support vectors.

Framework for developing and validating machine learning-based drawbar pull prediction model.

Engine-load characteristics with time for different tillage tools: comparison of engine torque, engine speed, and engine power.

Tractor working-performance metrics with time for different tillage tools: comparison of travel speed, slip ratio, and tillage depth.

Traction performance factors with time for different tillage tools: comparison of drawbar pull.

Funding1

—Ministry of Trade, industry & Energy (MOTIE) of Korea

Keywords

EngineeringMathematics and computing

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoil Mechanics and Vehicle Dynamics · Agricultural Engineering and Mechanization · Agriculture and Farm Safety

Full text

Introduction

The global agricultural-machinery market is projected to grow from $155.1 billion in 2021 to$ 212.6 billion in 2026, with a compound annual growth rate (CAGR) of 6.5%. The tractor market alone is estimated to grow from $55.5 billion in 2021 to$ 77.8 billion in 2026 at a CAGR of 7.0%^1^. Tractors are widely used in agricultural fields owing to their versatility in plowing, mowing, harvesting, and transportation operations^2^. Tractors have the advantages of a wide transmission range and adaptability to diverse road conditions. A tractor performs agricultural tasks by attaching implements to its front or rear^3^ and primarily executes agricultural tasks by towing attached implements or supplying power through a power take-off (PTO). Tractors are typically operated on soil, which has diverse and irregular characteristics owing to varying physical and chemical properties^4^.

The plowing operation of tractors requires a substantial drawbar pull owing to high soil resistance; consequently, the traction performance of a tractor is a critical aspect of its design. The evaluation of the traction performance of a tractor is currently based on the Organization for Economic Co-operation and Development (OECD) testing method, which involves towing a drawbar equipped with various sensors on asphalt road surfaces^5^. Although evaluation under concrete-road conditions is useful for comparing performances across various models and manufacturers, it fails to reflect actual soil conditions, resulting in significant discrepancies with agricultural performance^6^. Tractor power-transmission efficiency, traction efficiency, and work performance are significantly influenced by losses caused by slip and other factors resulting from the complex interactions between tractor driving wheels and the soil^7^. Numerous studies have used load and traction measurement systems to evaluate the traction performance of tractors. Battiato and Diserens (2017) conducted simulations and experimental validations of tractor traction performance across various soil textures to enhance the practical applicability in agricultural operations^8^. Their analysis considered factors such as soil–tire interaction, tractor configuration, slip ratio, and tire pressure. Kumari and Raheman (2019) developed an embedded advisory system to evaluate the traction performance of tractors by optimizing the gear and throttle settings^9^. The system measures the wheel slip, velocity ratio, and engine load in real time, providing operators with guidance to maintain optimal ranges, thereby improving both traction performance and fuel efficiency. However, evaluating tractor traction performance remains cumbersome because it requires complex measurement systems using various sensors, followed by data collection and analysis through experimentation^10^. Thus, a technology that can predict tractor traction performance based on key tractor component data is required.

Various attempts have been made to predict traction performance. Before the introduction of machine learning, most studies employed statistical approaches with widely used regression models. Askari et al. (2021) utilized a response surface methodology approach to predict the tractive performance of agricultural tractors^11^. Ranjbarian et al. (2017) used the American Society of Agricultural and Biological Engineers (ASABE) equations to predict the tractor slip rate, drawbar pull, and traction efficiency. However, with the rapid advancement of artificial intelligence technologies and high-performance computing techniques, machine-learning methods are increasingly being applied in various areas of agricultural machinery for performance prediction^12^.

Factors influencing tractor traction performance include soil properties, soil-tractor interactions, and the type of attached implement^13^. Of these, the type of attached implement plays a critical role in determining the traction performance. In particular, plows, commonly used for tillage, significantly affect tractor traction performance, with variations depending on the type of plow^14^. For example, the type of plow influences factors such as the tractor speed and tillage depth, which ultimately have a direct impact on traction performance. Although studies have been conducted on the traction performance of tractors equipped with various plows, further research is required to address the remaining gaps. Arvidsson et al. (2004) measured the specific drawbar pull and energy consumption of three plowing tools (moldboard plow, chisel plow, and disc plow) under various soil conditions and demonstrated that the required drawbar pull varied depending on the plow type under identical soil conditions^15^. Kim et al. (2022) analyzed the effects of tillage depth and soil properties on the working performance of a tractor-moldboard plow system and confirmed that the engine load and drawbar pull varied considerably depending on the soil type and tillage depth^16^. Upadhyay et al. (2024) employed ANN and regression models to predict the power requirements of agricultural machinery and applied the ANN–PSO (Particle Swarm Optimization) technique to determine the optimal operating conditions^17^. Nataraj et al. (2021) developed ANN and regression models using soil properties and operating conditions as input variables to predict the draft and power requirements of a rotary tiller, and compared their performance^18^. Upadhyay et al. (2025) developed a regression model to predict the draft and torque requirements of an active–passive disc harrow (APDH) and derived the optimal operating conditions through multi-objective optimization based on a genetic algorithm^19^. These studies concluded that the type of plow significantly affects traction performance. Furthermore, studies have been conducted to predict the tractor performance under specific working conditions. However, previous studies mainly employed expensive high-cost sensors, did not sufficiently consider diverse input variables, and mainly focused on evaluating tractor traction performance with a single type of plow. Considering different plows exhibit distinct performance characteristics in terms of the drawbar pull, slip ratio, and soil disturbance, a comprehensive approach that accounts for these variations is required; research comparing and predicting the traction performance of tractors equipped with multiple types of plows under various field conditions remains insufficient.

This study aims to analyze and predict the traction performance of tractors equipped with different types of plows, focusing on their impact on key traction parameters. The specific objectives of this study were as follows: (1) Evaluate the traction performance of tractors equipped with three distinct types of plows (moldboard, chisel, and subsoiler) under varying soil conditions, using a load-measurement system in field experiments. (2) Assess the impact on the overall drawbar pull by systematically measuring key traction-related variables, including the engine load (torque, speed, and power), travel speed, slip ratio, tillage depth. (3) Develop and validate a predictive model that predicts the tractor traction performance for different plow types based on machine-learning techniques, incorporating the real-world operational data collected from field experiments.

Methods

Measurement system

In this study, the performance of a 78-kW tractor (S07, TYM Co., Ltd., Korea) was measured. The dimensions of the tractor are 4225 mm (length) × 2140 mm (width) × 2,830 mm (height), with a weight of 3985 kg. The rated engine power is 78 kW (at 2300 rpm), and the maximum torque is 430 Nm (at 1400 rpm). The transmission system provides 32 forward and 32 reverse gear stages, achieved through a four-speed main transmission, a four-range transmission, and a two-speed power-shift function for high- and low-range switching. In addition, a four-stage sub-transmission is available, providing a variety of gear-selection options. The maximum recorded drawbar pull was 29.25 kN, with a traction power of 11.55 kW, achieved at a speed of 1.41 km/h. When operating in the gear closest to 7.5 km/h, the traction performance was measured at a traction power of 48.23 kW, a drawbar pull of 25.74 kN, and a travel speed of 6.75 km/h. The maximum recorded traction power was 56.59 kW, with a drawbar pull of 16.68 kN, at a speed of 12.23 km/h.

A measurement system was designed to measure the engine, axle, drawbar, and other key operating variables of a tractor in real-time, as shown in Fig. 1. The engine torque and rotational speed were measured via controller-area-network (CAN) communication and transmitted to the data-acquisition device through CAN channels. The rotational speed was measured using a proximity sensor (PRDCMT30-25DO; Autonics, Korea). The torque and rotational speed of the wheel axles were measured using analog input (four channels) and digital input (four channels). A GPS device (GPS 18x, Garmin, USA) was mounted on top of the tractor cabin to measure the driving speed, and the data were transmitted to the data-acquisition device through CAN channels. The plowing depth was measured by attaching a potentiometer to the three-point hitch, and data were collected through an analog input channel (1 channel). All collected data were stored in real time using a DAQ system (CRONOS Compact CRC-400-11, IMC, Germany).To measure the drawbar pull, a six-axis force meter was attached between the tractor and rear-mounted implement. The six-axis force meter consisted of six load cells (DACELL, UU-T2, Korea), and equations were used to calculate the drawbar pull using the data collected by three of these load cells.Fig. 1. Diagram of the tractor field data acquisition system.

Table 1 presents the measurement accuracy and error specifications of the sensors used in this study. The DACELL UU-T2 showed an accuracy within ±0.1% R.O., while the potentiometer exhibited a linearity of ±1% and an end-point sensitivity of ±1% of the supply voltage. The Autonics PRDCMT30-25DO had a response frequency of up to 100 Hz with a detection distance tolerance within ±10%, and the Garmin GPS 18x provided a position accuracy of less than 3 m and a velocity error of less than 0.1 m/s.Table 1. Specifications of measurement accuracy and error for the sensors used in this study.SensorMeasurement itemAccuracy / Error specificationDACELL UU-T2Drawbar pullNonlinearity, hysteresis, and repeatability within ± 0.1% R.OPotentiometerTillage depthLinearity: ± 1% Sensitivity of the end points: ± 1% of supply voltageAutonics PRDCMT30-25DORotational speedResponse frequency up to 100 Hz, detection distance tolerance within ± 10%Garmin GPS 18xTravel speedPosition accuracy < 3 m (WAAS enabled), velocity error typically < 0.1 m/s

Field experiment

Types of agricultural implements Various types of plows, such as moldboards, chisels, and subsoiler, are commonly used in agriculture^20^. Three types of implements were used in this study, as shown in Fig. 2. These include the moldboard plow, which is used to effectively cut soil and create furrows^21^ the chisel plow, which tills deep soil with its long, sharp blades^22^ and the subsoiler, which breaks through compacted hardpan layers^23^.Fig. 2. Tillage implements used in this study.

Specifications of plows The working widths of the moldboard and chisel plow were 2.8 m, whereas that of the subsoiler was 2.1 m. The maximum tillage depths are 0.2 m, 0.4 m, and 0.7 m for the moldboard plow, chisel plow, and subsoiler, respectively.

Site selection Field experiments were conducted in rice paddy fields located in Geumam-ri, Songsan-myeon, Dangjin-si, Chungcheongnam-do, as shown in Fig. 3. Field A, measuring 6000 m^2^ (60 × 100 m), was used for the moldboard-plow operation. Field B, measuring 9800 m^2^ (120 × 80 m), and Field C, measuring 12,000 m^2^ (120 × 100 m), were used for both chisel-plow and subsoiler operations.Fig. 3. Field experiment sites for tractor-performance measurement.

Soil-environment analysis The cone index was measured using a soil penetrometer (SC 900, Spectrum Technologies, Aurora, IL, USA) in accordance with ASABE standards^24^, and the cone index for each depth was presented as the average value for the 0–150 mm range. Soil-moisture content (SMC) was measured using a soil-moisture sensor (TDR 350, Spectrum Technologies, Aurora, IL, USA). Soil samples were collected from 10 uniform locations at each site, based on the plowing depth. Soil texture was analyzed based on the particle distribution of sand, silt, and clay as defined by the USDA^25^. The soil properties were analyzed for each field: the soils in Fields A, B, and C were loam, with similar results observed for both soil strength and moisture (Table 2).Table 2. Soil environment analysis results by field experiment site.SiteCone index (kPa) for depths (0–15 cm)Soil moisture content (%)Soil textureField A315–368437.63LoamField B270–374332.30LoamField C209–310035.77Loam

Plowing-depth and gear settings Field experiments were conducted for data measurement, as shown in Fig. 4. The plowing depth was set to 11–18 cm for the moldboard plow, 21–30 cm for the chisel plow, and 36–45 cm for the subsoiler. The operating gear was set to low speed M3 (7.1 km/h) for the moldboard plow and chisel plow, and low-speed L3 (2.4 km/h) for the subsoiler.Fig. 4. Field experiments with different implements: (a) moldboard plow, (b) chisel plow, and (c) subsoiler.

Data analysis

The engine torque measured and rotational speed were used to calculate engine power required (Eq. (1)).

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$P_{a} = \frac{2\pi TN}{{60,000}}$$\end{document}

where $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${P}_{a}$$\end{document}$ is the engine-power requirement (kW), T is the torque (Nm), and N is the engine rotational speed (rpm).

The drawbar pull was calculated by summing the tensile forces from the two lower load cells and the compressive force from the upper load cell^26^ (Eq. (2)).

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$F = F_{TC} + F_{LL} + F_{LR}$$\end{document}

where F is the drawbar pull (kN), $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${F}_{TC}$$\end{document}$ is the compressive force on the top link (kN), $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${F}_{LL}$$\end{document}$ is the tensile force on the lower-left link (kN), and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${F}_{LR}$$\end{document}$ is the tensile force on the lower-right link (kN).

The traction-force equation presented by the ASAE is shown in Eq. (3)^20^. Using this equation, the theoretical drawbar pull was calculated and compared with the drawbar pull obtained from experiments.

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$D = F_{i} \left[ {A + B\left( S \right) + C\left( S \right)^{2} } \right]WT$$\end{document}

where D is the draft force from ASABE (kN), $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${F}_{i}$$\end{document}$ denotes the soil-resistance factor, A, B, and C are empirical constants, S is the working speed (m/s), W is the width of the implement (m), and T is the tillage depth (m).

Table 3 presents the predicted draft force for the various tillage and seeding implements using the model parameters. This table provides the parameter values used in the calculation for each implement based on the draft force equation presented in the ASAE standard D497.4^20^. A, B, and C are the coefficients that reflect the characteristics of each implement and play a critical role in predicting the draft force. Fi represents the drawbar pull and W denotes the width of the implement. For the medium-textured soils, the value of i = 2, specifically, F_2_, was used.Table 3. Key parameters for estimating draft force of different tillage implements.ItemABCFi (F_2_)W (m)Moldboard plow65205.10.72.8Chisel plow1237.300.852.8Subsoiler29402.40.72.095

The coefficient of variation (CV) was calculated using Equation (4) to analyze the relative variability of the data. To examine the effects of different plow conditions on tractor performance, one-way analysis of variance (ANOVA) and post-hoc analysis using the least significant difference (LSD) method were conducted using IBM SPSS Statistics (SPSS 25, SPSS Inc., New York, USA). A statistically significant difference was determined at p < 0.05, which was satisfied by comparisons between multiple groups.

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$CV = \frac{SD}{{Average}}$$\end{document}

where CV denotes the coefficient of variation and SD is the standard deviation.

Machine learning models for predicting drawbar pull

The machine learning techniques used in this study include random forest (RF), extreme gradient boosting (XGB), artificial neural networks (ANN), and support vector machines (SVM). A structural overview of each model is presented in Fig. 5. Of the measured data, 70% were allocated to the training set, whereas the remaining 30% were used for the testing set.Fig. 5. Overview of machine-learning models: (a) Random Forest (RF)—ensemble of decision trees with majority voting, (b) Extreme Gradient Boosting (XGB)—sequential boosting process, (c) Artificial Neural Network (ANN)—multi-layer perceptron with hidden layers, and (d) Support Vector Machine (SVM)—optimal hyperplane with support vectors.

Table 4 presents the designs of the five models for predicting the drawbar pull by combining different input variables. Scenario A refers to a situation in which the drawbar pull can be predicted using only engine data. When using a tractor equipped with an electronically controlled engine that supports CAN communication, engine data can be collected without the need for additional sensors^27^. Scenario B represents a situation in which the tillage depth is used as an additional input variable to improve the accuracy of the drawbar pull prediction. As the tillage depth is an important factor affecting the drawbar pull, incorporating it into the prediction model can enhance the performance^28^. Scenario C describes a case in which a speed sensor is used to measure the driving speed for drawbar pull prediction. The prediction performance can be improved by measuring the driving speed and using it to calculate the slip rate^29^. Scenario D refers to a situation in which the drawbar pull is predicted using only the tillage depth and travel speed without using engine data. In this case, the drawbar pull prediction was conducted without relying on engine-related data. Scenario E represents a situation in which the drawbar pull prediction is performed using all available data, including the engine, tillage depth, and speed sensor data. This approach assumes that increasing the number of input variables leads to improved prediction accuracy^30^.Table 4. Combination of variables used for model development.ModelABCDESourceEngine speed + Engine torqueEngine speed + Engine torque + Tillage depthEngine speed + Engine torque + Travel speed + Slip ratioTillage depth + Travel speedEngine speed + Tillage depth + Travel speed

Table 5 compares the input variables, advantages, and disadvantages of different drawbar pull prediction methods. The conventional dynamometer-based approach ensures high accuracy. however, it is limited by the high cost of equipment. In contrast, ML-based models can effectively predict drawbar pull by integrating engine and soil variables that can be collected at relatively low cost.Table 5. Comparison of measurement, input, strengths, and limitations of drawbar pull prediction methods.CategoryMeasurement/InputStrengthsLimitationsConventionalDirect measurement with a dynamometerAccurate calculation of drawbar pullRequires additional equipment between tractor and implement, higher costModel AEngine speed + Engine torqueData acquisition from ECU without additional sensorsLow resolution and weak direct correlation with drawbar pullModel BEngine speed + Engine torque + Tillage depthConsiders soil resistanceIn addition to engine-based data acquisition, the installation of additional sensors and data synchronization are requiredModel CEngine speed + Engine torque + Travel speed + Slip ratioReflects soil–tractor interaction by incorporating slip ratioModel DTillage depth + Travel speedReflects soil resistance and speed effectModel EEngine speed + Tillage depth + Travel speedComprehensive reflection of engine, soil, and operating conditions with high accuracy

Table 6 compares the costs of the low-cost sensors used in this study with those of commonly employed standard sensors. While the price of a conventional load cell amounts to approximately 5,500 USD, the total cost of the potentiometer, proximity sensor, and GPS receiver utilized in this study is only about 220 USD. This finding not only demonstrates that low-cost sensors can serve as a highly cost-effective alternative, but also indicates that a system capable of predicting drawbar pull can be constructed at roughly 4% of the cost of conventional systems. Furthermore, even when accounting for potential fluctuations in sensor prices, the system can still be implemented at less than 5% of the conventional cost.Table 6. Comparison of low-cost sensors used in this study with commonly used standard sensors.ItemsConventional sensorLow-cost Sensor used in this studyPrice (USD)ModelPrice (USD)Drawbar pull sensor5500––Potentiometer–PotentiometerApprox. 80Proximity sensor–Autonics PRDCMT30-25DOApprox. 40GPS–Garmin GPS 18xApprox. 100Total5500–Approx. 220

The optimal hyperparameter values derived for each technique are listed in Table 7. In this study, the Optuna optimization method was used to determine the optimal hyperparameters. Specifically, a Bayesian optimization approach based on a tree-structured parzen estimator (TPE) was employed to efficiently search for optimal hyperparameters within the exploration space. The selection of hyperparameters varies by the model, making it essential to focus solely on the most significant hyperparameters. Removing unnecessary hyperparameters reduces computational complexity and shortens the training time, leading to more efficient resource utilization^31^.Table 7. Hyperparameter combinations used to train a machine-learning model for predicting drawbar pull.RFXGBANNSVMn_estimators: 135max_depth: 10num_leaves: 43learning rate: 0.09n_estimators: 150max_depth: 29learning rate: 0.06colsample_bytree: 0.82n_layers: 4units: 189dropout_rate: 0.105learning_rate: 0.001c: 1.0kernel: ‘rbf’gamma: 0.02

RF The RF technique is a decision-tree-based ensemble learning algorithm that constructs multiple decision trees using bootstrap sampling and aggregates the results to develop a final predictive model. This ensemble method helps prevent overfitting by aggregating the predictions from multiple decision trees. The training process is controlled by hyperparameters such as the number of trees, maximum depth, and criteria for node splitting. Optimizing the hyperparameters is essential for developing an effective RF model, as it ensures high predictive performance while minimizing the risk of overfitting^32^. To optimize the model performance, the number of trees was set to 135, with a maximum depth of 10 and 43 leaf nodes, ensuring a balance between predictive accuracy and model complexity.

XGB The XGB is a gradient-boosting algorithm that sequentially trains weak learners by minimizing residual errors from previous iterations, thereby enhancing the predictive performance^33^. Unlike RF, XGB can be sensitive to the data scale because of its use of L1 and L2 regularization in the training process. Therefore, the traction data were normalized to a range of 0–1 before the model training. The number of trees was set to 150 and the maximum depth was adjusted to 29 to ensure sufficient model complexity. The learning rate was set to 0.06 to prevent overfitting while promoting fast convergence.

ANN An ANN is a computational model consisting of multiple neurons arranged in hidden layers, where the network adjusts weights during training to minimize prediction errors^34^. To optimize the network depth, four hidden layers were applied, with each hidden layer consisting of 189 neurons. In addition, a dropout rate of 0.105 was applied to improve the generalization performance of the network and prevent overfitting.

SVM The SVM is a supervised learning algorithm that determines the optimal hyperplane to maximize the margin between classes, making it effective for classification and regression tasks. In this study, the radial-basis-function kernel was applied with a regularization parameter (C) of 1.0 and a gamma value of 0.02, to balance the model complexity and generalization ability.

Performance evaluation parameters

The metrics selected for evaluating the developed prediction models were the coefficient of determination (R^2^), root mean square error (RMSE), mean absolute percentage error (MAPE), and relative deviation (RD), as shown in Eq. (5–8). These metrics are widely used in regression analysis and model evaluation, making them suitable for assessing the predictive accuracy of the proposed models. R^2^ represents the proportion of variance in the dependent variable that is predictable from the independent variables, indicating how well the model captures variations in the actual data. RMSE is the square root of the mean of the squared differences between actual and predicted values, making it more sensitive to large errors than the mean absolute error (MAE), which simply averages absolute differences. The MAPE represents the mean of all absolute percentage errors normalized by the mean of the actual values, allowing a relative comparison of errors across different datasets. RD is a metric that normalizes the mean prediction error of the model using the mean measured value, providing a relative measure of model accuracy in predicting drawbar pull. A lower RD value indicates that the predicted values are closer to the actual values.

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R^{2} = \frac{{\mathop \sum \nolimits_{i = 1}^{i = N} \left( {y_{i} - y_{a} } \right) - \mathop \sum \nolimits_{i = 1}^{i = N} \left( {y_{i} - \hat{y}_{i} } \right)}}{{\mathop \sum \nolimits_{i = 1}^{i = N} \left( {y_{i} - y_{a} } \right)}}$$\end{document}

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$RMSE = \sqrt {\frac{1}{N}\mathop \sum \limits_{i = 1}^{i = N} \left( {\hat{y}_{i} - y_{i} } \right)^{2} }$$\end{document}

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$MAPE = \frac{1}{N}\mathop \sum \limits_{i = 1}^{i = N} \left| {\frac{1}{{y_{i} }}\left( {y_{i} - \hat{y}_{i} } \right)} \right| \times 100\left( \% \right)$$\end{document}

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$RD = \frac{RMSE}{{Mean }} \times 100\left( \% \right)$$\end{document}

where $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${y}_{a}$$\end{document}$ is the mean measured drawbar pull, $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${y}_{i}$$\end{document}$ is the ith measured drawbar pull, and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\widehat{{y}_{i}}$$\end{document}$ is the ith predicted drawbar pull.

Using the test data, the model was evaluated using the above evaluation metrics: R^2^, RMSE, MAPE, and RD. During the development of the prediction model, it was crucial to identify potential overfitting, which occurs when a model becomes overly tailored to the training data, leading to poor generalization. To assess overfitting, the test-to-train loss ratio (TTLR) (the ratio of MSE between the training and test models) was calculated; a TTLR < 1.5 indicates overfitting and a TTLR > 1.5 indicates underfitting. Furthermore, the R^2^ gap, which indicates the difference in the coefficient of determination between the training and test models, was used to evaluate the changes in the model accuracy on the test dataset. A large R^2^ gap indicates potential overfitting owing to better model performance on training than on the test data.

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$MSE = \frac{1}{N} \mathop \sum \limits_{i = 1}^{i = N} \left( {y_{i} - \hat{y}_{i} } \right)^{2}$$\end{document}

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$TTLR = \frac{{MSE_{test} }}{{MSE_{train} }}$$\end{document}

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R^{2} Gap = \left| { R^{2}_{train} - R^{2}_{test} } \right|$$\end{document}

where $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${y}_{i}$$\end{document}$ denotes the ith measured and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\widehat{{y}_{i}}$$\end{document}$ denotes the ith predicted drawbar pulls, respectively.

Robustness test

To validate the robustness of the developed model, additional data were collected and utilized. For this validation, robustness testing was performed only with the moldboard plow, as field operation data with other implements were not available from additional sites. This was because the moldboard plow is widely recognized as a standard implement for primary tillage, making it an appropriate reference tool to assess the generalizability of the proposed model. Specifically, Field D was located in Seosan-si, Chungcheongnam-do, and Field E in Anseong-si, Gyeonggi-do. Soil analysis revealed that Field D was classified as loamy sand, while Field E was classified as clay loam. Furthermore, the physical properties of the soil indicated that the cone index (0–15 cm depth) of Field D was 483 kPa with a soil moisture content of 33.79%, whereas Field E had a cone index of 1034 kPa and a soil moisture content of 33.60%^35^. These differences highlight the distinct soil conditions between the two test fields and served as an important basis for validating the robustness of the proposed model.

Research framework and workflow

Figure 6 provides a comprehensive overview of the research framework applied in this study, which consists of three main stages: data acquisition, model development, and validation. In the data acquisition stage, drawbar pull data were collected from three experimental fields in Dangjinsi, Chungcheongnam-do, using different implements: a moldboard plow (Field A), a chisel plow (Field B), and a subsoiler (Field C). All three sites were classified as loam. The dataset was divided into training (70%) and testing (30%) sets. In addition, for robustness validation, independent datasets were collected in Seosansi, Chungcheongnam-do (Field D, loamy sand) and Anseongsi, Gyeonggi-do (Field E, clay loam) using a moldboard plow. These datasets were used exclusively as the validation set. In the model development stage, five models (A–E) were constructed based on different combinations of input variables, including engine speed, engine torque, tillage depth, travel speed, and slip ratio. Hyperparameter optimization was performed using Optuna, and four machine learning methods—RF, ANN, XGB, and SVM—were applied. The validation stage consisted of three procedures. First, model performance was evaluated using the test set. Second, the prediction results of Model D were compared with the ASABE drawbar pull equation to examine differences between the empirical formula and the machine learning approach. Third, the models were tested with independent datasets from Seosan and Anseong to verify whether they could operate reliably under soil conditions not represented in the training data, thereby demonstrating their robustness and generalizability.Fig. 6. Framework for developing and validating machine learning-based drawbar pull prediction model.

Results

Engine characteristic profile

The values of the engine-load measurements are shown in Figure 7. The engine torque and power were the highest for the moldboard plow, followed by the chisel plow and the subsoiler, whereas the engine rotational speed was the highest for the subsoiler, followed by the chisel plow and the moldboard plow. It is believed that, at an equal operating speed, the subsoiler would impose the highest engine load owing to its deeper and wider soil penetration. However, since its operating speed was set lower to avoid excessive draft resistance and slippage, the moldboard plow imposed the highest engine load in the present results.Fig. 7. Engine-load characteristics with time for different tillage tools: comparison of engine torque, engine speed, and engine power.

The results of the statistical analysis of the tractor-engine performance are shown in Table 8. The chisel-plow operation exhibited high CV values for the engine torque and power, indicating significant variations in engine performance during operation. This result suggests that the engine load may fluctuate irregularly, depending on the working conditions or soil resistance. Furthermore, the ANOVA results revealed statistically significant differences in all the engine-performance indicators, clearly demonstrating distinct engine performance characteristics based on the type of plow used.Table 8. Statistical analysis results of engine torque, speed, and power for different types of plows.ItemsEngine torque (Nm)Engine speed (rpm)Engine power (kW)MCSMCSMCSMax359.1316.9216.82,3612,4462,45378.4477.0854.98Min299.5136.8128.92,0082,3222,42373.9934.9633.07Avg330.1^A^254.0^B^167.5^C^2,243^C^2,404^B^2,440^A^77.44^A^63.82^B^42.77^C^Std10.135.714.4702250.768.473.59CV0.0310.1400.0860.0310.0090.0020.0100.1330.084Means with different superscripts (A, B, and C) in each row are significantly different (p < 0.05) according to LSD multiple range tests. * M: Moldboard plow; C: Chisel plow; S: Subsoiler.

Assessment of tractor work performance

The results of the tractor-performance measurements are shown in Figure 8. The subsoiler operation exhibited a high and variable slip ratio, because of the high drawbar resistance relative to the axle power. The travel speed was highest for the chisel plow, followed by the moldboard plow and subsoiler, whereas the slip ratio showed the opposite trend. These differences appear to result from variations in the soil interactions based on the working mechanisms of each plow. The axle power was the highest for the moldboard plow, followed by the chisel plow and subsoiler. These findings indicate that the work performance of a tractor varies depending on the type of plow, highlighting the need for tractor settings that are optimized for specific work characteristics.Fig. 8. Tractor working-performance metrics with time for different tillage tools: comparison of travel speed, slip ratio, and tillage depth.

The results of the statistical analysis of the tractor performance are presented in Table 9. The CV of the slip ratio was high for both chisel plow and subsoiler operations. Notably, the average slip ratio for the subsoiler is 29.8%, indicating a high level. Additionally, all work-performance metrics exhibited statistically significant differences.Table 9. Statistical analysis results of tractor work performance by plow types.ItemsTravel speed (km/h)Slip ratio (%)Tillage depth (cm)MCSMCSMCSMax6.8528.1122.79732.414.663.517.8729.2544.98Min4.4266.2781.0003.70.00.011.0821.2036.54Avg5.795^B^6.918^A^1.770^C^15.8^B^6.1^C^29.8^A^15.03^C^25.78^B^40.67^A^Std0.3570.2400.3463.93.013.90.891.301.71CV0.0620.0350.1950.2450.4970.4650.0590.0500.042Means with different superscripts (A, B, and C) in each row are significantly different (p < 0.05) according to LSD multiple range tests. * M, Moldboard plow; C, Chisel plow; S, Subsoiler.

Analysis of traction performance for various tillage implements

The results of the tractor traction performance measurements are shown in Figure 9. High levels of drawbar pull were observed during the moldboard-plow and subsoiler operations, whereas the chisel-plow operation exhibited a highly variable drawbar pull. The drawbar power was the highest for the moldboard plow, followed by the chisel plow and subsoiler, attributed to the lower travel speed of the subsoiler. The traction ratio showed results similar to those of the drawbar pull because no significant differences were observed in the total weight of the tractor. The traction efficiency was highest for the moldboard plow, followed by the chisel plow and subsoiler.Fig. 9. Traction performance factors with time for different tillage tools: comparison of drawbar pull.

The results of the statistical analysis of the tractor drawbar pull are listed in Table 10. Among all the drawbar pull indicators, the moldboard-plow operation demonstrated the highest drawbar pull. The chisel-plow operation exhibited high CV values across all the drawbar pull indicators. In addition, all drawbar pull metrics showed statistically significant differences.Table 10. Statistical analysis results of drawbar pull by plow types.ItemsDrawbar pull (kN)Moldboard plowChisel plowSubsoilerMax30.8418.4131.35Min19.823.8818.26Avg24.73^A^11.60^C^24.39^B^Std1.612.252.08CV0.0650.1940.085Means with different superscripts (A, B, and C) in each row are significantly different (p < 0.05) according to LSD multiple range tests.

A comparison between the ASAE standard and the measured drawbar pull results is shown in Figure 10. The differences between the moldboard plow, chisel plow, and subsoiler were 1.90, 8.92, and 35.5%, respectively. The larger discrepancy in the subsoiler soil may be attributed to increased slip and other errors caused by deeper tillage. The error margins suggested by the ASAE standards were 40, 50, and 50% for the moldboard plow, chisel plow, and subsoiler, respectively, indicating that the measured data were reliable^20^.Fig. 10. Drawbar pull for different tillage tools: comparison between this study and ASABE standards.

Results of correlation analysis

Figure 11(a) shows the results of the correlation analysis among the major variables during moldboard plow operations. The correlation coefficients (r) of the drawbar pull ranged from -0.74 to 0.69 with key tractor variables. The variables that had the greatest influence on the drawbar pull were travel speed (r = − 0.74) and slip ratio (r = 0.69). Fig. 11(b) shows the results of the correlation analysis among the major variables during chisel plow operations. During chisel plow operations, the correlation coefficients of the drawbar pull ranged from − 0.72 to 0.66 with key tractor variables. The variables that had the greatest influence on the drawbar pull were travel speed (r = − 0.72) and slip ratio (r = 0.66). Fig. 11(c) shows the results of the correlation analysis among the major variables during subsoiler operations. During subsoiler operations, the correlation coefficients of the drawbar pull ranged from − 0.61 to 0.59 with key tractor variables. The variables that had the greatest influence on the drawbar pull were travel speed (r = -0.61), and slip ratio (r = 0.59).Fig. 11. Critical performance data for tractor operations with plow (a) moldboard plow, (b) chisel plow, and (c) subsoiler: Results of correlation analysis.

ET: Engine torque (Nm); ES: Engine speed (rpm); EP: Engine power (kW); TS: Travel speed (km/h); DP: Drawbar pull (kN); SR: Slip ratio (%); TD: Tillage depth (cm).

Machine-learning-based traction-force prediction

Moldboard plow Figure 12 shows the results of the traction-force prediction model for the moldboard plow, comparing the measured and predicted drawbar pulls across the five variable combinations. The model development results indicated that, except for Model A of the XGB and SVM, the predicted drawbar pull was closely aligned with the measured drawbar pull along the 1:1 reference line. Table 11 presents the performance metrics of the various machine-learning algorithms, including RF, XGB, ANN, and SVM. These algorithms were used to predict the drawbar pulls of a moldboard plow. The model performance was evaluated using the R^2^, RMSE, MAPE, and RD metrics for both the training and test datasets. Additionally, model overfitting was assessed based on R^2^ Gap and TTLR values. Among the five models, the performance of Model A was lower than that of the other four models. However, except for Model A, the R^2^ values of the remaining four models exceeded 0.68, with MAPE within 2.6%, RMSE below 0.81 kN, and RD under 3.21%, demonstrating stable predictive performance. Model E, which employed RF, achieved the highest predictive accuracy, with an R^2^ value of 0.977, RMSE of 0.118 kN, MAPE of 0.44%, and RD of 0.47%. The overfitting-assessment results show that the TTLR values for the RF model range from 1.266–1.856, indicating a slight risk of overfitting. However, as the R^2^ Gap values remained within an acceptable range (0.013–0.138), the generalization performance of the model can be considered reliable.Fig. 12. Scatter plots of the measured and predicted drawbar pull for the moldboard plow: (a) RF, (b) XGB, (c) ANN, and (d) SVM.Table 11. Performance evaluation of machine-learning models for predicting drawbar pull in moldboard-plow operations.ModelR^2^RMSE (kN)MAPE (%)RD (%)TrainTestR^2^ GapTrainTestTestTrainTestTrainTestARF0.6330.4950.1380.6450.7461.3371.822.142.613.02XGB0.4600.4290.0310.7830.7941.0282.442.533.173.21ANN0.6180.6010.0160.3050.3301.1660.910.970.010.01SVM0.4820.3900.0920.7720.8091.0990.020.030.030.03BRF0.8890.7930.0960.3540.4831.8560.891.170.011.95XGB0.8010.7310.0710.4790.5361.2551.391.481.942.17ANN0.7820.7530.0300.4970.5231.1041.401.450.020.02SVM0.8160.7700.0460.4520.5161.3051.291.431.832.09CRF0.9680.9440.0240.1900.2271.4320.520.690.010.92XGB0.8980.8770.0200.3380.3761.2370.010.010.010.02ANN0.9020.8870.0150.3330.3531.1241.041.100.010.01SVM0.8150.8000.0150.4590.4691.0440.010.020.020.02DRF0.8790.8460.0320.3680.4201.2991.061.200.011.70XGB0.7040.718− 0.0140.5740.5720.9911.841.832.322.31ANN0.9180.9010.0160.3050.3301.1660.910.970.010.01SVM0.7370.741− 0.0040.5380.5561.0711.711.752.172.25ERF0.9900.9770.0130.1050.1181.2660.280.440.000.48XGB0.8810.8580.0230.3680.3951.1571.131.201.491.60ANN0.7720.7540.0180.5090.5221.0491.611.650.020.02SVM0.6840.699− 0.0160.5860.6071.0721.821.892.372.45

Chisel plow Fig. 13 shows the results of the traction-force prediction model for the chisel plow, comparing the measured and predicted drawbar pulls across the five variable combinations. The Model A development results indicated that, except for RF Model A, the predicted drawbar pull was closely aligned with the measured drawbar pull along the 1:1 reference line. Additionally, the agreement with the 1:1 reference line improved as the number of variables increased. Table 12 summarizes the performance metrics of the chisel plow. The analysis results indicated that all models achieved an R^2^ value of at least 0.73. Model A, which used only the tractor-engine variables (engine torque and engine speed) as input variables, had an average R^2^ value of 0.860. In contrast, Model B, which incorporated the tillage depth in addition to the tractor engine variables (engine torque and engine speed), showed an improved average R^2^ value of 0.874. Furthermore, Model C, which included travel speed along with the tractor-engine variables (engine torque and engine speed), achieved the highest predictive performance with an average R^2^ value of 0.911. These findings confirm that including additional variables, such as tillage depth or travel speed, improves the model’s predictive performance compared to using only tractor-engine variables. The overfitting-assessment results showed that the R^2^ gap remained below 0.049 and the TTLR remained under 1.429, demonstrating an overall appropriate level of model generalization.Fig. 13. Scatter plots of the measured and predicted drawbar pull for the chisel plow: (a) RF, (b) XGB, (c) ANN, and (d) SVM.Table 12. Performance evaluation of machine learning models for predicting drawbar pull in chisel-plow operations.ModelR^2^RMSE (kN)MAPE (%)RD (%)TrainTestR^2^ GapTrainTestTTLRTrainTestTrainTestARF0.7560.7330.0230.9080.9511.0986.046.267.798.16XGB0.8970.8840.0120.5920.6261.1214.134.375.085.37ANN0.9230.9160.0070.5100.5351.0993.323.560.040.05SVM0.9230.9080.0150.5110.5591.1990.030.040.040.05BRF0.9360.9120.0230.4660.5471.3793.163.704.004.69XGB0.8000.7860.0140.8180.8631.1125.586.007.027.40ANN0.9590.9530.0060.2150.2271.1160.610.640.020.02SVM0.8750.8460.0290.6470.7321.2814.495.045.556.28CRF0.9580.9290.0290.3730.4451.4292.433.263.203.82XGB0.9150.9000.0150.5360.5801.1700.040.040.050.05ANN0.9180.9010.0160.3050.3301.1660.910.970.030.03SVM0.9380.9130.0250.4570.5411.4010.030.040.040.05DRF0.9110.8610.0490.6520.6841.1023.544.515.595.87XGB0.8100.8050.0060.8020.8101.0205.565.546.886.95ANN0.9590.9500.0090.3730.4121.2232.402.640.030.04SVM0.8090.8050.0050.8050.8101.0155.605.576.906.95ERF0.9690.9470.0230.3740.4241.2852.062.683.213.64XGB0.8730.876− 0.0030.6550.6520.9904.494.545.625.59ANN0.7720.7540.0180.5090.5221.0491.611.650.040.04SVM0.9250.9200.0050.5030.5221.0753.393.494.324.48

Subsoiler Fig. 14 shows the results of the traction-force prediction model for the subsoiler, comparing the measured and predicted drawbar pulls across the five variable combinations. The model development results indicated that in all cases, the predicted drawbar pull was closely aligned with the measured drawbar pull along the 1:1 reference line. Model E, which included engine performance, draft depth, and speed sensor data, exhibited the highest agreement with the 1:1 reference line. Table 13 summarizes the performance metrics of the subsoiler. The analysis results indicated that Model D, which used tillage depth and travel speed as input variables while excluding the tractor-engine variables, had the lowest average R^2^ value of 0.671. By contrast, Model E, which incorporated tractor-engine variables along with tillage depth and travel speed, achieved the highest average R^2^ value of 0.908. These findings suggest that an appropriate combination of engine- and tractor-related variables is necessary to optimize the performance of the traction-force prediction model. The overfitting-assessment results showed that Model A of the SVM exhibited a high TTLR value of 8.506, suggesting a potential overfitting problem in the model with respect to the training data. However, owing to the nature of SVM, the TTLR values can be relatively high. Considering SVM operates by separating data in a high-dimensional space, it may react sensitively to training data, leading to an increase in TTLR values for certain datasets.Fig. 14. Scatter plots of the measured and predicted drawbar pull for the subsoiler: (a) RF, (b) XGB, (c) ANN, and (d) SVM.Table 13. Performance evaluation of machine learning models for predicting drawbar pull in subsoiler operations.ModelR^2^RMSE (kN)MAPE (%)RD (%)TrainTestR^2^ GapTrainTestTTLRTrainTestTrainTestARF0.9670.9160.0510.4460.5381.4520.931.403.832.20XGB0.8220.7830.0390.7960.8631.1752.592.833.263.54ANN0.5600.5520.0081.2531.2400.9793.953.560.050.05SVM0.9870.8810.1050.2190.6388.5060.010.020.010.03BRF0.8760.7990.0770.7110.8481.4231.982.556.103.48XGB0.6980.6630.0361.0281.0981.1393.353.554.224.50ANN0.9590.9530.0060.2150.2271.1160.610.640.010.01SVM0.8180.7680.0500.7990.9101.2992.552.903.273.73CRF0.9020.8270.0750.6380.7851.5161.742.405.473.22XGB0.8390.7910.0470.7520.8621.3130.020.030.030.04ANN0.9180.9010.0160.3050.3301.1660.910.970.010.01SVM0.8880.8170.0710.6280.8091.6570.020.030.030.03DRF0.8450.7490.0960.7370.9481.6542.222.796.323.88XGB0.6180.5900.0281.1571.2111.0963.894.104.744.96ANN0.7720.7540.0180.5090.5221.0491.611.650.020.02SVM0.6140.5920.0221.1631.2091.0803.944.124.774.96ERF0.9660.9340.0320.3950.4791.4690.951.373.391.96XGB0.9120.8880.0240.5590.6271.2581.761.972.292.57ANN0.9590.9530.0060.2150.2271.1160.610.640.010.01SVM0.8670.8550.0120.6850.7121.0802.172.252.812.92

Comparison of the ASABE traction equation and machine learning based prediction models

Table 14 presents the results of comparing the performance of the ASABE drawbar pull equation and machine learning models across different plow types. Among the machine learning models, Model D was constructed using the same input variables as the ASABE equation, namely tillage depth and travel speed. Based on the results, the ASABE drawbar pull equation showed very low coefficients of determination (R^2^ = 0.029–0.158) across all plow types, indicating its limited ability to explain variations in drawbar pull. According to ASABE Standard (D497.4), an error of approximately ±40% has been reported for moldboard plows, and about ±50% for chisel plows and subsoilers^20^. However, in this study the equation performed even worse than the error ranges reported in the ASABE standard. One possible explanation is that the empirical formula was developed using data that are thought to reflect soil and tillage conditions in the United States, while the datasets applied here came from Korean fields with different soil properties and management practices. These regional differences may have widened the gap between the actual drawbar pull observed in the experiments and the simplified predictors included in the ASABE equation. Moreover, this is considered to be due to the fact that it does not account for key powertrain variables such as engine torque, engine speed, and slip ratio^36^. Instead, it is considered to primarily rely on implement-related factors such as working speed, implement width, and tillage depth. In contrast, the machine learning models achieved substantially higher R^2^ values, with ANN exhibiting the best performance for the chisel (0.950) and moldboard (0.901) plows. For the subsoiler, RF (0.749) and ANN (0.754) outperformed the other models.Table 14. Comparison R^2^ of ASABE drawbar pull equation and machine learning models across different plow types.Plow typeASABE StandardMachine learning (Model D)RFXGBANNSVMMoldboard0.1580.8460.7180.9010.741Chisel0.1550.8610.8050.9500.805Subsoiler0.0290.7490.5900.7540.592

Comparison and evaluation of machine-learning-based predictive models

Figure 15 shows the traction-force prediction performance of the machine-learning models, presenting the R^2^ values for each type of plow. For the moldboard plow, Model A exhibited the lowest R^2^ values, with XGB and SVM showing the lowest performances of 0.429 and 0.390, respectively. In Model C, the R^2^ values improved significantly across all models, with RF and ANN achieving high performances of 0.944 and 0.887, respectively. Overall, Models C and E demonstrated superior predictive performances, with RF showing the highest accuracy. For the chisel plow, Model A showed relatively high R^2^ values, and all models maintained a certain level of predictive performance. The R^2^ values ranged from 0.733–0.953, with RF and XGB achieving the highest predictive accuracies. For the subsoiler, the ANN exhibited the highest predictive performance in Models B and E, with R^2^ values of 0.953. RF consistently maintained high performance across all models, particularly in Models A (0.916) and E (0.934). In contrast, SVM and XGB had relatively lower predictive performances than RF and ANN.Fig. 15. Traction-force prediction for different tillage tools: (a) moldboard plow, (b) chisel plow, and (c) subsoiler. Comparison of R^2^ among machine-learning Models (A–E): Model A = drawbar pull as a function of engine speed and engine torque; Model B = drawbar pull as a function of engine speed, engine torque, and tillage depth; Model C = drawbar pull as a function of engine speed, engine torque, travel speed, and slip ratio; Model D = drawbar pull as a function of tillage depth and travel speed; Model E = drawbar pull as a function of engine speed, tillage depth, and travel speed.

Figure 16 presents the RMSE values for each type of plow, comparing the traction-force prediction performance of the machine-learning models. Across all plow types, the ANN and RF recorded the lowest RMSE values. Additionally, Model E, which incorporated tractor-engine variables (engine speed) along with tillage depth and travel speed, achieved lower RMSE values than Model A, which only used the tractor-engine variables (engine torque and engine speed).Fig. 16. Traction-force prediction for different tillage tools: (a) moldboard plow, (b) chisel plow, and (c) subsoiler. Comparison of RMSE among machine-learning Models (A–E): Model A = drawbar pull as a function of engine speed and engine torque; Model B = drawbar pull as a function of engine speed, engine torque, and tillage depth; Model C = drawbar pull as a function of engine speed, engine torque, travel speed, and slip ratio; Model D = drawbar pull as a function of tillage depth and travel speed; Model E = drawbar pull as a function of engine speed, tillage depth, and travel speed.

Results of robustness test

Figure 17 presents the results of the robustness test based on scatter plots of the measured and predicted values obtained using the moldboard plow. In the model development process, the R^2^ values of the test set ranged from 0.390 to 0.977, with values of 0.390 to 0.777 in loamy sand soils and 0.405 to 0.775 in clay loam soils. These results suggest that the model, which had been trained on loam soils, still produced meaningful predictions when applied to other soil types, although its accuracy declined. The lower R^2^ values observed during external validation can be explained by the strong effect of soil physical properties on drawbar pull. Earlier studies have reported that, in robustness checks using external datasets, R^2^ close to 0.7 are often considered an acceptable result^37^. These results show that the model keeps a practical level of robustness with diverse soil data. At the same time, adding training data from a wider range of soils will be important for improving the model’s generalizability.Fig. 17. Robustness test using scatter plots of the measured and predicted values obtained with the moldboard plow: (a) RF, (b) XGB, (c) ANN, and (d) SVM.

Discussion

In this study, models for predicting the tractor drawbar pull were developed based on RF, XGB, ANN, and SVM. The performance of these models was compared across five variable combinations. The analysis of the results showed that the traction-force predictions were dependent on the type of plow and combination of input variables. As the number of input variables increased, model performance improved; incorporating tillage depth and travel speed in addition to tractor-engine parameters led to improved prediction accuracy.

For the moldboard plow, the prediction performance of Model A, which applied XGB and SVM, was relatively low, with R^2^ values of 0.429 and 0.390, respectively, indicating a lower accuracy than the other models. However, in Model C, the prediction performances of all models significantly improved, with RF and ANN achieving R^2^ values of 0.944 and 0.887, respectively, demonstrating superior performance. These results suggest that using engine speed, engine torque, travel speed, and slip ratio as input variables and applying the RF are the most effective approaches for predicting the action force of a moldboard plow. For the chisel plow, all the models maintained R^2^ values above 0.733, indicating a relatively stable performance. RF and XGB demonstrate the highest prediction accuracies. However, for the subsoiler, the ANN achieved the highest performance in Models B and E, with R^2^ = 0.953, whereas the RF model also achieved the same R^2^ = 0.953 in Model E, demonstrating stable performance. These results indicate that when predicting the action force of a subsoiler, setting engine speed, tillage depth, and travel speed as input variables is advantageous for maximizing model performance.

Overall, the RF model achieved a superior performance across all three plow types. However, the TTLR value for the RF model ranges from 1.266–1.856, suggesting a potential risk of overfitting. This result indicates that while the RF-based model performed well on the training data, additional adjustments are necessary to ensure the generalization of the test data.

The robustness test results showed that the R^2^ values ranged from 0.390 to 0.777 for loamy sand and from 0.405 to 0.775 for clay loam, which were lower than the performance metrics obtained during model development. Nevertheless, these values can be considered reasonable compared to previous studies. However, further research is needed to incorporate more diverse soil conditions and improve the generalization performance of the model.

Conclusions

This paper presents a machine-learning-based approach for estimating tractor drawbar pull, providing a cost-effective alternative to expensive load cells. The primary objective of this study was to identify the optimal machine learning technique and input variable combinations for each plow type (moldboard plow, chisel plow, and subsoiler). The proposed methodology can enhance precision agriculture by enabling real-time performance monitoring and optimization.

We developed a tractor traction-force prediction model using four machine-learning techniques, incorporating key tractor parameters as input variables. The R^2^ values for each model (RF, XGB, ANN, and SVM) were 0.495, 0.977, 0.429, 0.888, 0.552, 0.953, and 0.390, 0.920, respectively. However, the model performance varies significantly depending on the type of plow and input variables used. A comparison of the four models indicates that increasing the number of input variables is a key factor in improving the R2 values. Models incorporating travel speed, tillage depth, and slip ratio, in addition to engine data, exhibited superior performance compared to those using only engine parameters.

However, this study has several limitations. First, the data were collected under restricted conditions in specific agricultural fields, which limits the generalizability of the findings, particularly due to the failure to account for diverse soil conditions such as sandy or clay soils. Secondly, the dataset was not sufficiently large, necessitating additional data collection for a more reliable analysis. Third, various influencing factors were not considered simultaneously as the analysis was conducted on individual variables.

These limitations will be addressed in future studies using field experiments to collect data under diverse working conditions. Future research should focus on improving model reliability by applying hyperparameter optimization and regularization techniques to mitigate overfitting.

Finally, this paper proposes a methodology for applying various machine-learning techniques to tractor traction-force prediction. This study demonstrated that machine learning techniques utilizing relatively low-cost sensors can serve as effective alternatives to conventional complex methods for measuring drawbar pulls. With the advancements in agricultural technology, the models developed in this study are expected to have various practical applications.

Bibliography2

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1ASABE standards, Procedures for Using and Reporting Data Obtained with the Soil Cone Penetrometer. ASABE EP 542. St. Joseph, Michigan (2018).
2Ngo, Q.H., Le-Khac, N.A. & Kechadi, T. Predicting soil p H by using nearest fields. Preprint at http://arxiv.org/abs/ar Xiv:1912.01303 (2019).