Discrete Choice Models and Artificial Intelligence Techniques for Predicting the Determinants of Transport Mode Choice—A Systematic Review

Mujahid Ali

doi:10.32604/cmc.2024.058888

icon Open Access

REVIEW

Discrete Choice Models and Artificial Intelligence Techniques for Predicting the Determinants of Transport Mode Choice—A Systematic Review

Mujahid Ali^*

Department of Transport Systems, Traffic Engineering and Logistics, Faculty of Transport and Aviation Engineering, Silesian University of Technology, Katowice, 40019, Poland

* Corresponding Author: Mujahid Ali. Email: email

Computers, Materials & Continua 2024, 81(2), 2161-2194. https://doi.org/10.32604/cmc.2024.058888

Received 17 September 2024; Accepted 17 October 2024; Issue published 18 November 2024

Abstract

Forecasting travel demand requires a grasp of individual decision-making behavior. However, transport mode choice (TMC) is determined by personal and contextual factors that vary from person to person. Numerous characteristics have a substantial impact on travel behavior (TB), which makes it important to take into account while studying transport options. Traditional statistical techniques frequently presume linear correlations, but real-world data rarely follows these presumptions, which may make it harder to grasp the complex interactions. Thorough systematic review was conducted to examine how machine learning (ML) approaches might successfully capture nonlinear correlations that conventional methods may ignore to overcome such challenges. An in-depth analysis of discrete choice models (DCM) and several ML algorithms, datasets, model validation strategies, and tuning techniques employed in previous research is carried out in the present study. Besides, the current review also summarizes DCM and ML models to predict TMC and recognize the determinants of TB in an urban area for different transport modes. The two primary goals of our study are to establish the present conceptual frameworks for the factors influencing the TMC for daily activities and to pinpoint methodological issues and limitations in previous research. With a total of 39 studies, our findings shed important light on the significance of considering factors that influence the TMC. The adjusted kernel algorithms and hyperparameter-optimized ML algorithms outperform the typical ML algorithms. RF (random forest), SVM (support vector machine), ANN (artificial neural network), and interpretable ML algorithms are the most widely used ML algorithms for the prediction of TMC where RF achieved an R2 of 0.95 and SVM achieved an accuracy of 93.18%; however, the adjusted kernel enhanced the accuracy of SVM 99.81% which shows that the interpretable algorithms outperformed the typical algorithms. The sensitivity analysis indicates that the most significant parameters influencing TMC are the age, total trip time, and the number of drivers.

Keywords

Machine learning techniques; AI; transport mode choice; discrete choice model; sustainable transportation

Glossary/Nomenclature/Abbreviations

ML	Machine Learning
TMC	Transport Mode Choice
DCM	Discrete Choice Model
BE	Built Environment
TB	Travel Behavior
GBT	Gradient Boosting Tress
XGB	Extreme Gradient Boosting
k-NN	k-Nearest Neighbors
RF	Random Forest
NB	Naïve Bayes
BN	Bayesian Network
NN	Neural Network
DT	Decision Tree
SVM	Support Vector Machine
FSVM	Fuzzy Support Vector Machine
GE	Gene Expression
GEP	Gene Expression Program
SEM	Structural Equation Modeling
PT	Public Transport

1 Introduction

The term “transport mode choice (TMC)” refers to the different transport options, which might include a private vehicle, a public vehicle, walking, bicycle, or other modes of transportation. TMC is frequently expressed as a discrete choice model with options that match the various trip modes. TMC refers to the individual choice of a specific transport mode for his/her activity participation at a different place [1]. The choice of travel mode for a certain journey is determined by many factors, both personal and contextual [2]. These factors can differ from one person to another and from one location to another, but they typically include infrastructure and accessibility [3], time [4], cost [5], and purpose of travel, as well as factors like health and physical ability [6], demographics, personal preference [7], built environment (BE) concerns, weather conditions [8], information and technology (IT), traffic congestion, safety, parking availability, and governmental policies. There is a strong association between the phrase’s “connectivity” and “accessibility”. Connectivity describes the link between areas and hubs of activity, whereas accessibility describes a person’s or a product’s ability to travel by different modes of transportation [9].

The factors influencing people’s choice of transport mode can be greatly impacted by changes to urban infrastructure and policies, which can change how people choose between using private vehicles, walking, cycling, and public transportation. It becomes critical to comprehend how machine learning (ML) models can capture these changing dynamics as these factors change over time. Abulibdeh studied the introduction of new metro lines using ML algorithms and concluded that urban infrastructure significantly affects TMC [10]. After comparing the urban infrastructure of Germany and America, Buehler concluded that TMC is greatly impacted by living in lower-density neighborhoods, further from public transportation, and with a more restricted mix of land uses [11]. Changes in the policy such as parking prices, raising fuel taxes, providing subsidies on public transport, and employee-paid parking significantly encouraged and affected the model shift from private cars to public transport, walking, and cycling [12–14]. Beckx et al. concluded that 64% of the car trips were less than 8 km which can be replaced with walking and cycling that compensate for 3% of the fuel consumption [15].

Numerous traditional discrete choice models and statistical analyses, including the nested logit model and linear and non-linear regression analysis, mixed logit model, binary logistic regression [16], multinomial logistic (MNL) model [17], bivariate statistical analysis [18], and Structural Equation Modeling (SEM) [11,19], were employed in both recent and prior studies to study the correlation among several independent variables, mediation variables, and TMC such as PT, private vehicle, and active transport as dependent variables [20]. The mutual information (MI) [21] test method, which calculates the relevance of the inputs, was utilized by various researchers to determine the most influential component and its impact on TMC [22]. Even though some regression models can estimate interaction and quadratic effects, they are prone to outliers and have trouble reflecting the complex interactions between various variables. Furthermore, due to the restricted tools available, spotting abnormalities in nonlinear regression is more difficult than in linear regression [23]. Researchers use machine learning (ML) approaches because they rarely rely on assumptions, can handle enormous datasets, outliers, and missing values, and have advantages over traditional statistical methods [24].

ML models are emerging as an intriguing compelling substitute to multinomial logit (MNL) models in TB research, where tree-based ensemble models—gradient boosting and random forest (RF) have shown effectiveness in accomplishing this objective. MNL [25], k-nearest neighbors (k-NN), neural networks (NNs), RF, decision trees (DT), gradient boosting trees (GBT) [26], support vector machines (SVM) [22], and Naïve Bayesian (NB) [27] are the most often used machine learning (ML) techniques used in recent studies. Comparing these models to more conventional statistical methods, the majority of them outperformed [24,28,29]. It should be mentioned that most of these researchers applied the ML models with default settings, which can result in less-than-ideal outcomes due to some of the ML algorithms being capable of binary classification. Qian et al. applied an adjusted kernel function to SVM to map complex datasets into high dimensional that makes the data point separation easier and concluded that the SVMAK gives higher accuracy than the typical SVM [22]. Most researchers that utilized strategies for optimization to adjust the hyperparameters did so by using random search or grid, both of which have drawbacks of their own [30]. As a tool for policy analysis, the optimized GBT is used to investigate and assess ways to enhance the usage of more environmentally friendly transportation options while decreasing the use of private vehicles.

To avoid the limitations of the ML-specific tools and techniques, the latest studies used interpretable ML techniques in which they combine several ML techniques for a good understanding of TMC decisions. Tamim Kashifi et al. predict the TMC using five diverse interpretable ML models (LR, RF, DT, Multilayer Perceptron, LightGBDT) [31]. Since it is challenging to find a sufficient description for the link between the output and input variables due to the nature of the ML black box, Kim suggests an interpretable ML strategy to increase the interpretability of ML in TMC modeling [32]. Zhao et al. used an interpretable ML approach to explore the heterogeneity in mode-switching behavior and concluded that a machine-learning classifier in conjunction with interpretation tools that are model-independent offers useful insights into the mode-switching behavior of travelers [33].

Based on sensitivity analysis and feature importance metrics, which identify the most predictive features for the target variable, some researchers have claimed that factors such as the reason for not walking [22], household drivers, total travel time [25], household vehicles [31], and the purpose of the trip [34] are the most influential. Conversely, other researchers have concluded that household income, socio-demographic factors (such as age and gender) [35], the number of stops, road infrastructure availability [17], and accessibility (the distance between the last stop and the resident location) [36] are the most significant contributors.

Numerous characteristics have a substantial impact on TB, which makes it important to consider while studying transportation decisions. Traditional statistical techniques frequently presume linear correlations, but real-world data rarely follows these presumptions, which may make it harder to grasp the complicated interactions. We conducted thorough systematic research to examine how ML approaches might successfully capture nonlinear correlations that conventional methods may ignore to get around these constraints. An in-depth analysis of several ML algorithms, datasets, model validation strategies, and tuning techniques employed in previous research is carried out in the current review. Besides, the current review aims to systematically review the limitations and findings of the recent literature for the prediction of TMC and its determinants that utilize DCMs, ML algorithms, and interpretable ML techniques and suggest the best predictive model for the prediction of TMC. Moreover, based on the sensitivity analysis of the ML models, the most influential factors for the TMC are investigated to help policymakers in planning and forecasting TMC demands. The two primary goals of our study are to establish the present conceptual frameworks for the factors influencing the TMC chosen for daily activities and to pinpoint methodological issues and limitations in previous research. Our findings shed important light on the significance of considering factors that influence the TMC. For accurate analysis and efficient policy creation to promote sustainable transportation systems, it is essential to comprehend this complexity.

The review pattern is as follows: Section 2 provides an overview of the latest five-year studies in the field of modern techniques used for TMC, the methodology such as the methods of reviewing recent and past studies that are using PRISMA rules and Kitchenham and Charters Approach are discussed in Section 3, whereas Section 4 highlights the results and discussion of the selected 39 studies, and the conclusion is presented in Section 5 followed by the future direction in Section 6.

2 Literature Review

Using diverse transport modes has a substantial effect on individual health outcomes, subjective well-being, and the global environment [37–39]. However, on the other side, van Wee and Ettema, and Zhang studied that health is a capability constraint that influences transport options [40,41]. Besides, past studies concluded that planes, ships, cars, and heavy-duty vehicles are the main contributors to CO2 emissions from the transportation sector [42]. According to a 2023 survey, 73% of American respondents chose the car, underscoring the car’s crucial importance in daily life in the country which negatively contributes to GHG emissions [43], that households are responsible for 72% of global GHG emissions in which car and plane mobility is the most dominant component. European Union and the US aim to neutralize CO2 emissions from the transportation sector by 2050. To achieve this aim, Zhang et al. concluded that active and public transport is encouraged in urban areas to reduce GHG emissions [34], whereas Xu et al. claimed that electric vehicles (EVs) in Europe significantly reduce GHG emissions [44]. Moreover, Aijaz et al. studied environmental sustainability through EVs and concluded that EVs have the potential to drastically decrease emissions from the transport sector and enhance sustainability [39]. However, the GHG emissions from the transportation sector are still questionable. Therefore, it is vital to predict TMC used for daily activities to promote green and sustainable transportation systems and a healthier society.

People are more inclined to switch modes if they are well-informed, yet mode choice behavior is an important subject when it comes to improving the overall sustainability of transportation networks [18]. Therefore, it is crucial to grasp and investigate what are the most effective variables for TMC to develop a more sustainable transport system [45]. Several factors such as BE, socio-demographic and economic variables, travel behaviors, availability, accessibility, and connectivity, time, weather conditions, purpose of travel, information and technology (IT), traffic congestion, safety, parking availability, and governmental policies describe by many researchers that influence TMC. Several studies are listed around the globe that highlight travel behavior, BE, factors, and TMC to work, school, and daily commuting. Besides, most researchers performed the comparison between different countries and provided interpretations about their transportation system, TMC or travel behaviors, and determinants of TMC.

The factors influencing the TMC are greatly impacted by changes in urban infrastructure and policies [46]. These dynamics can be well captured by ML models, particularly when temporal, geographical, and policy data are included [47]. Through the incorporation of dynamic elements like travel duration, expenses, and ease of use into flexible models, ML techniques can assist city planners in forecasting how transportation patterns will change in reaction to upcoming policy changes and infrastructure enhancements [29]. Insightful, data-driven transport planning is made possible by the combination of sophisticated ML techniques with solid datasets, despite obstacles like data availability and model interpretability [24,48].

Buehler conducted a comparative analysis of Germany and the USA for the determinants of TMC and concluded that the USA is more car-dependent than Germany, whereas Germans are more prone to cycle, walk, and use PT [11]. Besides, Bresson et al. performed a comparative analysis of England and France and studied the determinants of demands for PT. They concluded that the fare charges for PT are relatively sensitive and the main determinants for individuals to choose PT over a car. The reduction (subsidization) in the fare charges played a substantial role in encouraging the individual to choose PT, thus reducing the use of private cars [49]. Papaioannou et al. investigated how connectivity and accessibility affected PT. They concluded that while a system’s accessibility might stimulate PT use, a particular trip’s lack of connectivity could discourage it. Moreover, it seems that using PT instead of a private vehicle requires greater accessibility and trip-specific connectivity [50]. In addition, Wolday 2023 studied the effect of BE attributes on active transport in small cities and concluded that the frequency of walk/bike trips is significantly influenced by accessibility and attitude toward active travel [51].

Moreover, Harbering et al. studied the determinants of TMC for Mexico and concluded that although slow modes like cycling and walking are affected by distance from the city center, mass rapid transits are affected by infrastructure. In addition, based on their socioeconomic characteristic, women and younger people are more inclined to use PT despite the private vehicle. Moreover, higher education individuals are more dependent on cars and negatively influenced, whereas the availability of cars is negatively associated with all other transport modes [17]. Using binary logistic regression models, Szymon Wójcik studied possible factors impacting the decisions made by Łódź, Poland, citizens about the TMC they choose to utilize for everyday travel. He concluded that respondents’ sociodemographic traits and household car ownership had the greatest impact on TMC. Furthermore, a statistically significant correlation was seen between geographic distances and subjective evaluations of PT. The factors influencing the decision to choose private or PT differed [35].

Convery and Williams studied the determinants of TMC for non-commuters by considering land use, the role of transport, and socio-demographic characteristics using bivariate statistical analysis. They concluded that vehicle ownership and income are recognized as key influences on TB patterns. Additionally, the comparatively low use of cars outside of the inner city core suggests initiatives offer alternatives to driving [18]. Using Tobit regression for efficiency and data envelopment analysis, Matulova and Tomes investigated the factors influencing urban PT efficiency in the Czech Republic. They concluded that certain factors, such as the average vehicle age, total vehicle kilometers, the tramlines existence in the city, percentage of drivers, and population density, increase efficiency while other factors, such as the percentage of revenue subsidies, ticket prices, and the existence of a two-city system, decrease efficiency [16]. Sharmin Sultana investigated the variables influencing parents’ selection of active transportation options for their kids’ school-related commutes. He chose 13 explanatory variables and concluded the Binary Logistic Regression Model results that gender, age, the distance between school and home, household size, home ownership status, household drivers, household vehicle ownership, and population density all play a significant role in parents’ decisions to send their kids to school on foot [52].

Due to the technological advancement for the Prediction of work TMC to accurately forecast travel demand and achieve sustainability goals, ML techniques are interesting and are widely used by researchers over conventional techniques such as MNL models. For instance, Aghaabbasi et al. employed the ideal setting of the hyperparameters (which has an immediate impact on the model’s performance). To forecast the TMC for work, ML methods utilizing a Bayesian Optimization (BO) algorithm are investigated. These methods include SVM, k-NN, single DT, ensemble DT, and NB. They concluded that BO is more effective than other models for enhancing the performance of the k-NN model [30]. With a focus on GBT, RF, and MNL models, Pineda-Jaramillo et al. analyze various logit and ML models to forecast TMC and identify the factors that influence TB in an urban setting. They concluded that GBT models outperformed the other models that were compared and that the factors that explain the TMC include age, gender, travel time, household motorized vehicles (cars and motorcycles), and availability of parking type at the destination [26]. Wang et al.’s research on TMC performance shows that when the dataset is unbalanced, the XGB model outperforms the MNL model in terms of prediction accuracy. Furthermore, they reported that although mode-specific travel time is the primary determinant of TMC, people’s TMC is found to be substantially correlated with other trip characteristics, sociodemographic factors, and BE variables [25].

The performance of ML models is accessed using the classification metrics which are area under the curve (AUC), accuracy, precisions, F1-score, and recall. The model’s actual and anticipated values serve as the basis for the classification. As illustrated in Eq. (1), accuracy is defined as the ratio of the correctly predicted class over all classes. Precision shows how much of a true positive class there is compared to the total number of true positive and false positive categories. Qian et al. studied the classification of imbalance TMC to work using an Adjustable kernel SVM model. They compared their results with the recent and past studies using simple SVM models as shown in Table 1 and concluded that the Adjustable kernel SVM model outperforms and enhances the model accuracy to 99.81%. From the Sensitivity Analysis mutual information (MI) test method, they concluded that household drivers and age are the most influential factors for TMC [22]. Aghaabbasi et al. investigated the impact of an employee’s sociodemographic and living environment on active transportation using DT techniques, and they came to the conclusion that the availability and coverage of bike lanes, sidewalks, and transit stations were the most crucial factors in how frequently employees used AT modes to travel to work, shop, and enjoy themselves [53].

Accuracy=TP+TNTP+TN+FP+FN×100(1)

where TP, TN, FP, FN are the true positive, true negative, false positive, and false negative.

Besides several ML techniques, recent studies applied interpretable ML models for the prediction of TMC and travel behaviors. Kashifi et al. predicted the TMC using five different interpretable ML models (LR, RF, DT, Multilayer Perceptron, LightGBDT). They used 3-years of Dutch National Travel Survey data and concluded that LightGBDT outperformed other models. In addition, they carried out analyses of variable importance and SHAP dependency to address the issue of ML models being a “black box” and enhance their interpretability. The investigation revealed that factors such as travelers’ age and annual income, trip distance, trip density, and the number of vehicles or bicycles they possess are major determinants of their TMC [31]. Kim’s research indicates that the interpretability of ML concerning TMC modeling is hindered by its opaque character, making it challenging to find a plausible explanation for the relationship between input and outcome variables. Consequently, he suggests an interpretable ML technique to address this issue. After using the XGB model, he concluded that, when it came to variable importance, variable interaction, and accumulated local effects (ALE), XGB performed better than the other ML models. Furthermore, he asserted that the number of tour trips taken and age had been demonstrated to be major factors in determining the TMC, whereas the connected trip and tour-related variables had a substantial impact on predicting TMC [32]. Zhao et al. employed an interpretable ML technique to explore the heterogeneity in mode-switching behavior. They first create a high-accuracy classifier that naturally captures the individual heterogeneity included in the data to forecast mode-switching behavior under a hypothetical Mobility-on-Demand Transit system. To study response heterogeneity, they proposed two novel model-independent ML interpretation tools, namely conditional individual partial dependence plots and conditional partial dependence plots. They concluded that using a machine-learning classifier in conjunction with interpretation tools that don’t depend on a particular model might provide important information about mode switching in travel. Besides, the existing transit users are normally willing to share rides but unwilling to take any extra transfers, and the present drivers are more cautious about more collections than individuals employing other means of transportation [33]. The summary of the articles that utilize conventional techniques to study the determinants of TMC are presented in Table 2. However, those studies which utilize modern techniques for TMC predictions are depicted in Table 3.

images

3 Methodology

There are two different approaches for conducting a systematic literature review which are (1) Kitchenham and Charters (2007) and (2) the PRISMA approach. However, the current literature study mainly used the PRISMA approach due to its established reputation and extensive usage in various fields to conduct a systematic literature review. Both methods are briefly described.

3.1 Kitchenham and Charters Approach

This approach addresses the three stages of a systematic literature review: preparation, execution, and reporting. It is a generally acknowledged and approved procedure for carrying out systematic reviews, offering a strict and organized process for locating, assessing, and combining research findings. Furthermore, the technique has complete rules and checklists that guarantee a thorough and transparent review process. This allows for the replication of the review methodology, hence augmenting the study’s credibility. The review mainly focuses on the methodological aspect of the recent and past studies rather than interfering with the outcome of the study, nor do they specify the detailed mechanisms performed in metadata [57]. Therefore, the quality of the research is not assessed and out of ten steps, the remaining nine steps were considered for the review process as shown in Fig. 1.

images

Figure 1: Kitchenham and charters review process for a systematic literature review

3.2 PRISMA Approach

Many researchers globally perform systematic literature reviews using the PRISMA (preferred reporting items for systematic reviews and meta-analyses) approach, which is easy to use and includes a four-phase flow diagram and a 27-item checklist [58]. The 27-item checklist is mainly composed of title (1), abstract (1), introduction (2), methods (12), results (7), discussion (3), and findings (1), whereas the for-phase diagram contains identification, screening, eligibility, and final inclusion as shown in Fig. 2. Following the 27-checklist and four-phase flow diagram makes it easier for the researchers to retrieve the important information from the research articles to conduct a convenient systematic review.

images

Figure 2: PRISMA statement for the systematic literature review

The primary subject of the current review is to review and summarize the study that mainly focuses on the determinants of TMC using conventional and ML techniques. The following review questions are the focus of the review.

Review Question # 1: Do machine learning algorithms outperform the conventional techniques for predicting the determinants of TMC?

Review Question # 2: Which ML techniques have been used to determine the determinants of TMC?

Review Question # 3: What are the characteristics of the datasets used to determine the determinants of TMC?

Review Question # 4: How are ML models’ performances evaluated?

3.3 Procedure for Review

Throughout the review process, the four phases were used. The search approach so-called the identification, criteria for inclusion such as the screening and eligibility, and procedure for data retrieval are utilized for the review process and are presented here.

3.3.1 Search Approach

For the search engine, several databases including Scopus and Web of Science (WoS) with the utilization of Google Scholar are considered. Academic articles such as journals and conferences written in English were considered. Indeed, it was a challenging task to well-structured the search function as ML, TMC, and TBs are vast areas. To obtain specific, relevant, and up-to-date articles for the analysis, the current systematic review adopted the following procedure; initially, the phrases or keywords from Table 4 were used to conduct a comprehensive search in a Scopus and WoS database. Secondly, the keys from the table combined such TMC and ML algorithms, modern and conventional techniques, transport and environment, transport and health, DCMs, and TMC that search within article title, abstract, and keywords through which we got 2941 articles. The final dataset of 2941 articles is narrowed down to limited studies by using the filter option in the search engine. For instance, the filter applied to choose those articles using ML techniques, the latest five years publications, English version only. In addition, through the physical examination, the irrelevant publications were discarded, leaving 56 articles. The objectives, suggested methods, real contributions, findings, and recommendations for the future of the 56 papers that were gathered via the search engine were carefully examined by hand, going over the complete contents of the publications. In the end, after carefully assessing of article by reading its abstract, methods, results, and conclusions, 46 most recent articles (2017–2023) were collected for the study, whereas 39 of the articles were chosen for review after a manual review of the publications.

images

3.3.2 Inclusion Procedure and Requirements

Several criteria were set for the inclusion of the articles in the review process after the query search that is (1) only articles that are published in conferences and journals with English editions from the year 2018–2023 were considered in which one article, mostly relevant to the topic from 2017 were also considered because of its relevancy to the scope of the study; (2) studies modeling TMC and examine the correlation between several endogenous and exogenous variables; (3) studies that determine the determinants of TMC; and finally (4) those studies which used ML techniques for the prediction, modeling, and correlation of TMC. The PRISMA approach was used to choose the paper for the final inclusion using the four phases.

To effectively compare the performance of ML models across different datasets for predicting transport mode choice (TMC), a systematic approach should be adopted that ensures the evaluation is consistent, transparent, and meaningful. The specific variables such as socio-demographic data (age, income, occupation), geographic data or quantitative (distance, origin, destination), real-time data or qualitative variables (traffic, weather, public transport availability), TMC for different purposes, and different ML algorithms are looked in the different dataset for the model comparison. It’s common to test all models and compare their performance using cross-validation and metrics like accuracy, precision, recall, and F1-score to decide which model is optimal. Therefore, the performance of several ML models is compared using performance evaluation metrics and k-fold, 2-fold, 3-fold, 5-fold, and 10-fold cross-validation metrics to suggest the best predictive model. This process ensures a fair and thorough comparison of machine learning models across different datasets for TMC prediction.

3.3.3 Data Retrieval Approach

Based on the research questions, the data are retrieved from the articles and compiled in Table 5 to gather the information and avoid biases during the data collection that are concrete, measurable, and well-defined. After a thorough review of the paper, it was accessed with which research question it was allied to gather specific information.

images

4 Results and Discussion

The summary of the particular 39 papers that are selected for the current study along with the references and identifications are presented in Table 6. In the next subheading, the source and the date (years of publication) are mentioned to easily assess the review process and understandable to the readers.

4.1 Articles Source

As can be seen in Table 7 the articles were included from different publishers such as Elsevier, Springers, SAGE, Hindawi, and MDPI and conferences. Among 39 articles, 36 articles are from different peer-reviewed journals consisting of 87% and 3 conference proceedings contributed 13% in total after the search engine from Scopus and WoS as shown in Fig. 3. However, after the manual examination of the publications, the peer-reviewed journals contributed 92.30% and the conference proceedings contributed 7.69% in total. The most prominent, prestigious, and well-known journals; Travel Behavior and Society, IEEE Access, and Transportation Research Part C: Emerging Technologies contributed 27.10% in total.

images

Figure 3: Number of articles from different sources

4.2 Year and Country of Publication

The current systematic review considers the latest five-year articles that were published from 2018–2023 with only one article considered from 2017 that was highly cited and most relevant to the study as shown in Fig. 4. There was a gradual increase in the number of articles that used ML tools and algorithms for assessing travel behavior, TMC, and the determinants of TMC. The highest number of articles (10) were published in 2023 which shows the usage of ML algorithms for the prediction of determinants of TMC and outperforms the conventional models. Besides, the US shows the highest number of articles 11 articles as shown in Fig. 5 published for evaluating the determinants of TMC. China is the second highest after the US published 9 articles, whereas the UK published 6 articles in total of 39 articles related to the current review.

images

Figure 4: Yearly basis distribution of articles considered in the systematic review

images

Figure 5: Number of articles distributed by Country

4.3 Where Were ML Techniques Used to Determine the Determinants of TMC?

RQ1. a. ML application domains

ML techniques are used in several areas that outperform conventional statistical techniques and enhance TMC. In the current review of 39 articles, this study identified five different areas in which both ML and conventional techniques are used to investigate TMC as shown in Table 8. The application domains contained (1) TMC, (2) BE, (3) active transport, (4) shared mobility, and (5) BE. Among five application domains, twenty-three ML-based investigations were used for the TMC and nine were used for the TB. However, four investigations were used for active transport such as women cyclists and travel to school by cycle, etc., two were used for BE, and two for shared mobility. Most of the ML approaches have been utilized for the prediction and determination of determinants of TMC and TB, that’s the motivation behind conducting this systematic review based on the high number of studies in the field of TMC and TB utilizing ML techniques. Most of the studies claimed that ML techniques outperformed conventional techniques, whereas hyperparameter-optimized ML algorithms outperformed typical ML algorithms. Several ML algorithms were utilized for imbalanced TMC to work data while others were used to classify TMC prediction and feature the importance of input variables to investigate the most influential factors. Most of the studies claimed that total travel time, number of household vehicles, and income are the most influential factors, while others claim that age, gender, activity type, parking, and trip purpose were the most significant features.

images

4.4 RQ2. Which ML Techniques Have Been Used to Determine the Determinants of TMC?

In the following section, the synopsis of various ML approaches used in 39 studies will be discussed. Several different types of ML algorithms are utilized for the determination of TMC in diverse countries either for urban or rural areas in which different determinants influence TMC.

RQ2. a. Utilization of ML algorithms

Past studies were limited to conventional techniques such as structural equation modeling, bivariate and multivariate analysis, and regression analysis using SPSS, AMOS, and R [92–95]. Due to the recent development in modern techniques, recent studies utilize ML and AI techniques such as ANN, BN, k-NN, XGBT, GBT, DT, FT, SVM, GE, and GEP [96–98]. Due to the limitation of conventional techniques, ML algorithms, and data types such as linear or non-linear, recent and past studies utilize integrated and interpretable ML techniques for the factors affecting TMC and to enhance the model efficiency using deep and reinforcement learning [31,32,99]. Therefore, based on the current literature, the ML approaches are categorized into three groups—conventional techniques, ML algorithms, and interpretable ML as shown in Fig. 6. RF is one of the most widely used ML algorithms in TB research for TMC, BE, AT, and shared mobility followed by the ANN and interpretable ML algorithms. However, on the other side, conventional techniques are mostly utilized for the prediction of TMC and TB, whereas ML algorithms and interpretable approaches are widely utilized for the BE, active transport to promote sustainability, TB, shared mobility, and TMC. Moreover, Table 9 depicts the summary of several utilized ML techniques for TMC, TB, active transport, BE, and shared mobility in the selected studies. It can be seen that among all studies, only 10 studies utilized conventional techniques which contributed 25.6%, whereas 74.4% used ML algorithms in which random forest (RF) was the most frequently employed approach (18 studies) contributed about 47%; however, 20.5% studies applied interpretable ML algorithms. There was a gradual increase in the number of studies in 2022/2023 that utilize ML techniques with special attention to extreme gradient boosting trees (XGBT) and RF contributing a total of 19/39 in the scientific literature. The outcome of all these models shows that ML techniques outperformed conventional techniques, whereas interpretable ML algorithms outperform ML approaches due to the black box which turns out to white-box in integrated and interpretable ML approaches and enhances TMC decisions.

images

Figure 6: The number of studies used Conventional, ML, and interpretable techniques in selected studies

images

RQ2. b. Interpretable ML techniques

Due to the nature of the dataset such as linear and non-linearity and the limitations of the ML techniques, most of the researchers applied interpretable ML techniques to solve the issue of black-box in the dataset and ML techniques. Interpretable ML techniques could resolve the issue of a black box and turn it into a white box considering the nature of the dataset. RF, GBT, XGBT, and SVM were the utmost commonly employed algorithms in the selected studies. The most popular machine learning algorithms for evaluating the impact of many independent variables on travel time and distance, including safety, BE, sociodemographic, and journey time, were RF and GBT. However, because GBT is trained sequentially rather than in parallel, it is prone to overfitting and inefficiency for the huge dataset. Nevertheless, if you use the RF technique for regression analysis, you can rely on orthogonal decision boundaries, which can produce less-than-ideal outcomes.

4.5 RQ3. What Are the Characteristics of the Datasets Used to Determine the Determinants of TMC?

This section contains the characteristics of the dataset that are used in 39 articles such as the description and size of the data that are gathered for the analysis and correlation between the variables. It mainly focuses on the targeted variables such as the determinants of TMC, ML techniques, and the TMC in return. Besides, the unit of analysis, data size, and the data availability statement are also discussed in this section. Several studies used separate variables for the target variables and TMC; therefore, the data sources of the targeted variables and TMC variables are described separately.

RQ3. a. Characteristics and size of the dataset

Table 10 predicts the characteristics and the sample size of the selected 39 articles. During the entire review process of the selected 39 articles, it was noticed that for the TMC and its influencing variables, three types of data sources are used National, Local, and Departmental Data (NLDD), Academic Data (AD), and Company Data (CD) for the targeted variables. Among 39 studies, 24 studies which contributed circa 61% used NLDD data source type, whereas AD used 13 studies contributing 33.33%, and only 2 studies used CD which contributed the remaining 5.12% of the data source.

images

The analyzed research in the reviewed study made use of multiple research units. In general, it was individuals, households, respondents, trips, travel (air, travel diary), adults, children, women, transport routes, employees, drivers, passengers, and school students. However, these research units are classified into individuals, households, trips, travels, and adults. Among 39 studies, 13 studies contributed 33.33% in total used individuals as a unit of analysis, 5 studies (12.82%) utilized household survey data as a unit of analysis, 7 studies employed trip as a sample unit, 5 studies (12.82%) used travel data, and the rest of 9 studies circa 23% employed adults as a unit of analysis.

Regarding the data size, there was only one study that used less than 100 sample size which was N3 (women cyclist), 4 articles that used less than 500 sample size which contained one review article, 6 studies that used less than 1000 samples, and the rest of 28 studies used over 1000 sample size.

RQ3. b. Data availability statements—freely available?

Throughout the selected articles, it was checked whether the data used in the current study is freely available to the public and users or not; therefore, the data availability is mainly categorized into four different sections that are data available (DA), data unavailable (Dua), available on request for the corresponding or any authors (AoR), and the availability statement didn’t mention in the article (NE). It was noticed that most of the research data will not be freely available due to some institutional policies or confidentiality. Nine articles used data that are publicly available and accessible to all researchers. Besides, there were only two studies that used CD, whereas there were 16 studies that did not mention the data availability statement. However, eight studies mentioned that the data is AoR from the corresponding author(s). Four studies used the NLDD data and kept it available on request, which are:

• Chongqing Urban Resident Travel Survey from 2014.

• Onderweg in Nederland databy Centraal Bureau voor de Statistiek (CBS), Netherlands (Centraal Bureau voor de Statistiek (CBS), Rijkswaterstaat (RWS-WVL) 2019 and 2020.

• Annual National Travel Survey (NTS) data of the UK from 2005 to 2016, which are publicly provided by the Department for Transport.

• 2016 National Household Travel Survey (NHTS) dataset in Seoul, Korea.

RQ3. c. TMC variables

TMC was determined from several variables such as gender, income, distance, purpose, safety, time, household vehicle ownership, available transport mode, accessibility to public transportation, BE variables, and weather conditions. All these variables directly or indirectly influence TMC depending on the country, situation, and type of available data. For instance, safety and security have a significant impact on women cyclists, whereas the built environment has an impact on travel behavior with a higher degree of 5Ds such as design, density, destination accessibility, diversity, and short distance to transit.

RQ3. d. TMC dataset

As can be seen in past studies, the source of the targeted variables and TMC are different. 25 studies explain the TMC from three different transport modes such as private vehicles, public transport, and active transport. These studies gathered the data from public databases to determine TMC using statistical tools and ML techniques. Almost every study used different ML techniques and algorithms for the determination of TMC in different countries and compared the results with the conventional techniques in which the ML techniques always outperformed and enhanced the model efficiency which helped the policymakers to better develop the policy based on the ML outcomes. In most models, the coefficient of determination was over 0.95 (95%) which shows the high significance of the model.

4.6 RQ4. How Are ML Models’ Performances Evaluated?

RQ4. a. Approach for validation

For the model validation in ML algorithms, several cross-validation (CV) methods such as the k-fold cross-validation method, 3-fold CV, 5-fold CV, 10-fold CV, and holdout validation methods are used by the past studies. Some of the studies also used both k-fold and holdout validation methods, whereas others used k-fold and 10-fold CV. Table 11 depicts the approach for model validation using several CVs in which nine studies used k-fold CV, two studies used 3-fold CV, four studies used 5-fold CV, and five studies used 10-fold CV. However, three studies used both k-fold and 10-fold CV, whereas two studies used k-fold and 5-fold CV. Most of the studies used an 80:20 ratio of train-test data whereas some of the studies used a 70:30 ratio and others used a 90:10 ratio of train-test data. Only one study was found (N35) which used three different ratios of train-test data which are 60:40, 70:30, and 80:20, and concluded that the 80:20 ratio train-test data provided higher accuracy than the rest of the train-test ratios. Out of all 39 selected studies, nineteen studies did not report their validation approach.

images

RQ4. b. Model performance evaluations

The performance of the models in 39 selected studies is accessed using several evaluation techniques. The different performance criteria are used to assess the relationship among TB, BE, TMC, and its determinants. Table 12 depicts the model evaluation process in each study that is used to assess the model performance. The current review merely reviewed and considered the performance criteria and the analysis as shown in Tables 12 and 13; however, several other important measures that could be employed to assess ML models’ performance are not covered in the current review as it wasn’t presented in the selected studies. Only ten studies did not show their performance criterion and ML model performance.

images

The relationship between two variables was assessed using the linear correlation or coefficient of determination (R/R2). The R2 value of 10%–20% is considered satisfactory in travel behavior research. Furthermore, the Mean Absolute Percentage Error (MAPE) is used to quantify the deviation between the actual and anticipated values, Mean Absolute Error (MAE), Mean Squared Error (MSE), and Root Mean Squared Error (RMSE). As the name MAPE, it is a percentage-based measure while MAE, RMSE, and MSE are absolute measures. For the classification task, precision (PRE) and classification accuracy (AUC) are utilized to determine the anticipated positive cases among all actual positive cases. Among all 39 selected studies, nine studies (N4, N5, N7, N10, N12, N14, N23, N25, N33) used AUC and PRE, whereas fifteen studies used AUC for measuring the corrected positive predicted cases.

Besides, four studies (N3, N4, N10, N11) employed linear correlation or the coefficient of determination (R/R2), whereas four studies (N3, N8, N11, N133) used absolute measure, RMSE, MSE, MAE. Only one study (N3) employed a percentage-based (MAPE) performance criterion. In addition, Table 13 presents the performance of several machine learning models employed in the chosen research. Several studies divided the data into training and testing through which they have different performance ML models and R2 values. Sometimes the models achieved higher values such as 90%–99% (0.90–0.99), while in other cases, they achieved lower values ranging from 20%–30% (0.20–0.30). Several ML models such as SVM, DT, RF, XGBT, NB, MNL, ANN, NN, KNN, AdaBoost, XGBoost, LightGBM, etc., are used to demonstrate the accurate performance. The R2 values for both training and testing in RF in two studies (N3 and N11) show the highest (0.91 and 0.58), whereas it was lower in one study (N4) in which the SVM R2 values were higher than RF.

5 Discussion

The classification matrix was checked through the coefficient of determinations (R2), RMSE, MSE, MAE, and MAPE, and model performance through AUC, accuracy, precision, F1-score, recall, and MCC for both training and testing of the data. Some of the studies used 70% training and 30% testing data, while others used 80:20 and 90:10. Most of the studies claimed that 80% training and 20% testing data give the best performance. Besides, several cross-validations are used such as 3-folds, 5-folds, and 10-folds during the model analysis. Moreover, some studies utilized both conventional and modern techniques and compared the models based on statistical correlations between the variables such as R2 and significance level. It was found that most of the conventional model R2 was below 50%, whereas it was the opposite for modern techniques which gives sometimes over 95%.

Besides, several ML models were compared based on the classification matrix and model performance evaluations. Some of the ML models give an accuracy of over 90% while others are below 80%. Most of the models have over 80% precision values in which the RF outperformed the rest of the models. Besides, almost all studies claim that modern techniques outperform conventional techniques, where interpretable ML algorithms outperform the typical ML algorithms due to the binary classification and unable to handle imbalances or multidimensional datasets. Adjusted kernel SVM mapping the complex dataset into high dimensions makes the data point separations easier which simplifies the data boundaries for non-linear problems. The kernel SVM can handle optimized problems that have multiclass and variables.

Lack of interpretability is one problem with machine learning models, especially black-box methods like deep learning. Additionally, DCMs’ great interpretability strength stems from their utility maximization foundation. DCMs may overlook complicated relationships between trip duration, cost, and convenience; ML models can see these relationships. Hybrid models can combine the prediction capacity of ML with the interpretability of traditional models by merging DCMs with ML models such as RF or GBMs. The utility function of a DCM can then receive the output from various ML models to improve prediction while preserving interpretability.

Moreover, the feature importance of the input variables over the output variables is studied to check the evaluate the individual input variable effect on the targeted variables. It was concluded that the total travel time [86], trip distance [87], income [10], waiting time [28], sociodemographic [66], age, and car ownership [32] are the most influential variables for the prediction of TMC. However, these factors were varying in different studies around the globe due to personal, geographical, and contextual factors in some studies, weather conditions are the most influential factors, whereas in other studies infrastructure availability and accessibility were the most influential factors. The primary socioeconomic characteristics that motivate passengers to transition to more environmentally friendly modes of transportation include age, nationality, employment, ownership of a vehicle, and income [10].

In addition to promoting environmentally friendly transportation options, reducing traffic, and mitigating the effects of travel mode choices on the environment, long-term policy recommendations also seek to improve community accessibility and mobility. For promoting sustainable transportation systems including walking, public transport, and cycling, the government should prioritize the investment of funding in infrastructure and maintenance and modernization of existing transit systems to ensure their reliability and efficiency. Besides, health is a part of capability constraints that influence transport options [100]; therefore, providing and enhancing accessibility for people with disabilities helps in the reduction of private cars and the promotion of a sustainable transportation system. Moreover, the implementation of road user charges based on factors, increment in parking fees in urban areas, providing subsidies on public transport tickets, and introduction of pricing charges scheme discourage private car users in peak days and hours. As one of the most significant factors is the distance between the last stop and individual residence location; therefore, the adoption of land use policies, and the development of affordable housing near the transit stations enhance the access to public transportation. However, in some cases, the residential areas closer to the basic amenities and public transport lines are more expensive than the other way around which encourages individuals to live far away and use private vehicles.

This study emphasizes the increasing importance of ML as a useful substitute for traditional statistical methods in the modeling of determinants of TMC used for daily activities. Nevertheless, a close look at the literature review indicates notable differences in the approaches used. Thus, more investigation is required to develop reliable and consistent scientific methods for using ML to analyze TMC and investigate its determinants. In 39 selected studies, the R2 values and even the same algorithm values are changing, which might be due to the nature of the data and the variables used; however, standardized methods need to be developed for the prediction of TMC and its determinants. Moreover, in terms of data aggregation, it is crucial to assure consistency between both the input and the output variables to prevent problems with generalization and accuracy.

The practical application of DCMs and ML models is vital for urban planners and policymakers. By increasing prediction accuracy and result interpretability, DCMs and ML algorithms can be integrated into transportation planning tools or policy frameworks to greatly improve decision-making. Each approach has its own merits, and when combined, it can produce strong tools for policy evaluation, infrastructure development, and transportation demand modeling. Large, real-time data sets might be processed and analyzed using ML techniques in the first step of a hybrid model. Subsequently, pertinent factors (such as transport availability and congestion levels) could be fed into a DCM to predict how travel behavior would change in response to those conditions. This makes the results more valuable for making policy decisions by guaranteeing that predictions are not only accurate but also based on a solid theoretical framework. The data-driven policy that is effective in the near term and long-term sustainable is made possible by these integrated approaches.

Several factors need to be considered when discussing the issues of reproducibility of the results and generalizability of the ML techniques in diverse areas and countries. Transport options, urban infrastructures, weather conditions, and geographical factors vary from country to country and cannot be generalized. Reproducibility can be improved through open data, transparent methods, and standardized processes. In addition, the models trained on one dataset cannot be used for other datasets for the predictions due to cultural, behavioral, infrastructure, and policies.

ML models use real-time data from smart city infrastructure, mobility apps, and personal devices, which is crucial to address ethical considerations related to the use of personal data. While there are many advantages to integrating ML models into transportation planning, there are also serious ethical concerns about data privacy, fairness, openness, and monitoring. Planners and policymakers should implement best practices including data anonymization, bias audits, and open decision-making frameworks to reduce these risks. The preservation of people’s privacy should come first in ethical data governance, and it should make sure that ML models improve transportation systems without escalating inequality or jeopardizing citizens’ rights.

6 Conclusion

The current review provides a systematic evaluation of ML techniques that are used for predicting the determinants of TMC around the globe. The research develops four review questions that relate to the application domains, utilization of ML algorithms, the dataset used in the studies, and performance evaluation of the ML models. Using two main online publishing databases such as WoS and Scopus, 39 relevant studies were found related to determinants of TMC and its influence that were considered for the review.

This research systematically reviews the conventional (statistical tool) and modern techniques (ML algorithms), criterion, and model performance of the past studies and concludes that in most of the studies, RF outperforms SVM, GBT, DT, XGBT, and MNL. Besides, some studies used interpretable ML techniques in which they combined two different algorithms such as SVM + GBT or NB + RF + SVM as mentioned in the N34 study, and concluded that the accuracy of the model reached 99% (0.99). However, in some other studies, the accuracy of the RF model ranges from 95% to 99% (0.95–0.99) as shown in Table 13 studies N14, N16, N18, N22, N27, N34, and N39. Moreover, the coefficient of determination (R2) is also found higher in RF compared to other ML models. For instance, the value of R2 is 0.91 (91%) in study N3 is higher than Adaboost, XGBT, and LightGBM.

Several studies confirmed that socio-demographic characteristics, household vehicle ownership, and income status in the main determinants of TMC. Besides, other studies confirm that attitude, built environment, accessibility, and infrastructure influence TMC. Moreover, travel time, parking type, motorized vehicles, age, and gender explain TMC.

Given the prevalence of the problems this research describes, a deeper understanding of the methods utilized in ML modeling of nonlinear interactions between transport mode choice and built environment is needed. Even though more research is necessary to completely comprehend the implications of these limitations, it is already clear that some of the “matters of concern” violate the fundamental holdout validation principle of machine learning and ought to be disregarded in subsequent studies.

Acknowledgement: The author would like to thank the Silesian University of Technology, Poland for providing research facilities and Professor Elżbieta Macioszek for supervising and helping in the systematic literature review.

Funding Statement: The author received no specific funding for this study.

Availability of Data and Materials: The summary and Excel sheet of the selected papers will be provided upon special request from the corresponding author.

Ethics Approval: Not applicable.

Conflicts of Interest: The author declares that they have no conflicts of interest to report regarding the present study.

References

1. S. G. Stradling, “Chapter 34—travel mode choice,” in Handbook of Traffic Psychology, B. E. Porter Ed., San Diego: Academic Press, 2011, pp. 485–502. [Google Scholar]

2. A. Ababio-Donkor, W. Saleh, and A. Fonzone, “The role of personal norms in the choice of mode for commuting,” Res. Transp. Econ., vol. 83, no. 2, 2020, Art. no. 100966. doi: 10.1016/j.retrec.2020.100966. [Google Scholar] [CrossRef]

3. S. Labi, A. Faiz, T. U. Saeed, B. N. T. Alabi, and W. Woldemariam, “Connectivity, accessibility, and mobility relationships in the context of low-volume road networks,” Transp. Res. Rec., vol. 2673, no. 12, pp. 717–727, 2019. doi: 10.1177/0361198119854091. [Google Scholar] [CrossRef]

4. C. R. Bhat and R. Sardesai, “The impact of stop-making and travel time reliability on commute mode choice,” Trans. Res. Part B: Methodol., vol. 40, no. 9, pp. 709–730, 2006. doi: 10.1016/j.trb.2005.09.008. [Google Scholar] [CrossRef]

5. T. Bai, X. Li, and Z. Sun, “Effects of cost adjustment on travel mode choice: Analysis and comparison of different logit models,” Transp. Res. Procedia, vol. 25, no. 8, pp. 2649–2659, 2017. doi: 10.1016/j.trpro.2017.05.150. [Google Scholar] [CrossRef]

6. M. Ali, D. B. E. Dharmowijoyo, A. R. G. de Azevedo, R. Fediuk, H. Ahmad and B. Salah, “Time-use and spatio-temporal variables influence on physical activity intensity, physical and social health of travelers,” Sustainability, vol. 13, no. 21, 2021, Art. no. 12226. doi: 10.3390/su132112226. [Google Scholar] [CrossRef]

7. S. Kim and G. F. Ulfarsson, “Travel mode choice of the elderly: Effects of personal, household, neighborhood, and trip characteristics,” Transp. Res. Rec., vol. 1894, no. 1, pp. 117–126, 2004. doi: 10.3141/1894-13. [Google Scholar] [CrossRef]

8. E. Mirzaei, R. Kheyroddin, and D. Mignot, “Exploring the effect of the built environment, weather condition and departure time of travel on mode choice decision for different travel purposes: Evidence from Isfahan, Iran,” Case Stud. Trans. Pol., vol. 9, no. 4, pp. 1419–1430, 2021. doi: 10.1016/j.cstp.2021.05.002. [Google Scholar] [CrossRef]

9. Y. -H. Cheng and S. -Y. Chen, “Perceived accessibility, mobility, and connectivity of public transportation systems,” Trans. Res. Part A: Policy Pract., vol. 77, no. 2, pp. 386–403, 2015. doi: 10.1016/j.tra.2015.05.003. [Google Scholar] [CrossRef]

10. A. Abulibdeh, “Analysis of mode choice affects from the introduction of Doha Metro using machine learning and statistical analysis,” Trans. Res. Interdiscip. Perspect., vol. 20, 2023, Art. no. 100852. doi: 10.1016/j.trip.2023.100852. [Google Scholar] [CrossRef]

11. R. Buehler, “Determinants of transport mode choice: A comparison of Germany and the USA,” J. Transp. Geogr., vol. 19, no. 4, pp. 644–657, 2011. doi: 10.1016/j.jtrangeo.2010.07.005. [Google Scholar] [CrossRef]

12. N. Vidovic and J. Simicevic, “The impact of parking pricing on mode choice,” Transp. Res. Procedia, vol. 69, no. 3, pp. 297–304, 2023. doi: 10.1016/j.trpro.2023.02.175. [Google Scholar] [CrossRef]

13. R. W. Willson and D. C. Shoup, “Parking subsidies and travel choices: Assessing the evidence,” Transportation, vol. 17, no. 2, pp. 141–157, 1990. doi: 10.1007/BF02125333. [Google Scholar] [CrossRef]

14. D. Ogilvie, M. Egan, V. Hamilton, and M. Petticrew, “Promoting walking and cycling as an alternative to using cars: Systematic review,” BMJ, vol. 329, no. 7469, 2004, Art. no. 763. doi: 10.1136/bmj.38216.714560.55. [Google Scholar] [PubMed] [CrossRef]

15. C. Beckx, S. Broekx, B. Degraeuwe, B. Beusen, and L. Int Panis, “Limits to active transport substitution of short car trips,” Trans. Res. Part D: Transp. Environ., vol. 22, pp. 10–13, 2013. doi: 10.1016/j.trd.2013.03.001. [Google Scholar] [CrossRef]

16. H. Fitzová, M. Matulová, and Z. Tomeš, “Determinants of urban public transport efficiency: Case study of the Czech Republic,” Eur. Transp. Res. Rev., vol. 10, no. 42, 2018, Art. no. 225. doi: 10.1186/s12544-018-0311-y. [Google Scholar] [CrossRef]

17. M. Harbering and J. Schlüter, “Determinants of transport mode choice in metropolitan areas the case of the metropolitan area of the Valley of Mexico,” J. Transp. Geogr., vol. 87, no. 2, 2020, Art. no. 102766. doi: 10.1016/j.jtrangeo.2020.102766. [Google Scholar] [CrossRef]

18. S. Convery and B. Williams, “Determinants of transport mode choice for non-commuting trips: The roles of transport, land use and socio-demographic characteristics,” Urban Sci., vol. 3, no. 3, 2019, Art. no. 82. doi: 10.3390/urbansci3030082. [Google Scholar] [CrossRef]

19. M. Ali, E. Macioszek, and N. Ali, “Travel mode choice prediction to pursue sustainable transportation and enhance health parameters using R,” Sustainability, vol. 16, no. 14, 2024, Art. no. 5908. doi: 10.3390/su16145908. [Google Scholar] [CrossRef]

20. M. Ali, E. Macioszek, and C. W. Yuen, “Health enhancement through activity travel participation and physical activity intensity,” J. Transp. Health, vol. 39, no. 5, 2024, Art. no. 101927. doi: 10.1016/j.jth.2024.101927. [Google Scholar] [CrossRef]

21. S. Verron, T. Tiplica, and A. Kobi, “Fault detection and identification with a new feature selection based on mutual information,” J. Process Control, vol. 18, no. 5, pp. 479–490, 2008. doi: 10.1016/j.jprocont.2007.08.003. [Google Scholar] [CrossRef]

22. Y. Qian et al., “Classification of imbalanced travel mode choice to work data using adjustable SVM model,” Appl. Sci., vol. 11, no. 24, 2021, Art. no. 11916. doi: 10.3390/app112411916. [Google Scholar] [CrossRef]

23. W. F. Guthrie, NIST/SEMATECH e-Handbook of Statistical Methods (NIST Handbook 151). Gaithersburg, MD, USA: National Institute of Standards and Technology, 2020. [Google Scholar]

24. T. Ma et al., “Nonlinear relationships between vehicle ownership and household travel characteristics and built environment attributes in the US using the XGBT algorithm,” Sustainability, vol. 14, no. 6, 2022, Art. no. 3395. doi: 10.3390/su14063395. [Google Scholar] [CrossRef]

25. F. Wang and C. L. Ross, “Machine learning travel mode choices: Comparing the performance of an extreme gradient boosting model with a multinomial logit model,” Transp. Res. Rec., vol. 2672, no. 47, pp. 35–45, 2018. doi: 10.1177/0361198118773556. [Google Scholar] [CrossRef]

26. J. Pineda-Jaramillo and Ó. Arbeláez-Arenas, “Assessing the performance of gradient-boosting models for predicting the travel mode choice using household survey data,” J. Urban Plan. Dev., vol. 148, no. 2, 2022, Art. no. 04022007. doi: 10.1061/(ASCE)UP.1943-5444.0000830. [Google Scholar] [CrossRef]

27. Z. Xu, M. Aghaabbasi, M. Ali, and E. Macioszek, “Targeting sustainable transportation development: The support vector machine and the bayesian optimization algorithm for classifying household vehicle ownership,” Sustainability, vol. 14, no. 17, 2022, Art. no. 11094. doi: 10.3390/su141711094. [Google Scholar] [CrossRef]

28. N. F. M. Ali, A. F. M. Sadullah, A. P. P. A. Majeed, M. A. M. Razman, M. A. Zakaria and A. F. A. Nasir, “Travel mode choice modeling: Predictive efficacy between machine learning models and discrete choice model,” Open Transp. J., vol. 15, no. 1, pp. 241–255, 2021. doi: 10.2174/1874447802115010241. [Google Scholar] [CrossRef]

29. P. Tang, M. Aghaabbasi, M. Ali, A. Jan, A. M. Mohamed and A. Mohamed, “How sustainable is people travel to reach public transit stations to go to work? A machine learning approach to reveal complex relationships,” Sustainability, vol. 14, no. 7, 2022, Art. no. 3989. doi: 10.3390/su14073989. [Google Scholar] [CrossRef]

30. M. Aghaabbasi, M. Ali, M. Jasiński, Z. Leonowicz, and T. Novák, “On hyperparameter optimization of machine learning methods using a bayesian optimization algorithm to predict work travel mode choice,” IEEE Access, vol. 11, no. 13, pp. 19762–19774, 2023. doi: 10.1109/ACCESS.2023.3247448. [Google Scholar] [CrossRef]

31. M. Tamim Kashifi, A. Jamal, M. Samim Kashefi, M. Almoshaogeh, and S. Masiur Rahman, “Predicting the travel mode choice with interpretable machine learning techniques: A comparative study,” Travel Behav. Soc., vol. 29, no. 2, pp. 279–296, 2022. doi: 10.1016/j.tbs.2022.07.003. [Google Scholar] [CrossRef]

32. E. -J. Kim, “Analysis of travel mode choice in seoul using an interpretable machine learning approach,” J. Adv. Transport., vol. 2021, 2021, Art. no. 6685004. doi: 10.1155/2021/6685004. [Google Scholar] [CrossRef]

33. X. Zhao, X. Yan, and P. Van Hentenryck, “Modeling heterogeneity in mode-switching behavior under a mobility-on-demand transit system: An interpretable machine learning approach,” 2019, arXiv:1902.02904. [Google Scholar]

34. H. Zhang, L. Zhang, Y. Liu, and L. Zhang, “Understanding travel mode choice behavior: Influencing factors analysis and prediction with machine learning method,” Sustainability, vol. 15, no. 14, 2023, Art. no. 11414. doi: 10.3390/su151411414. [Google Scholar] [CrossRef]

35. S. Wójcik, “The determinants of travel mode choice: The case of Łódź, Poland,” Bull. Geography. Soc.-Eco. Ser., vol. 44, no. 44, pp. 93–101, 2019. [Google Scholar]

36. C. Ding, D. Wang, C. Liu, Y. Zhang, and J. Yang, “Exploring the influence of built environment on travel mode choice considering the mediating effects of car ownership and travel distance,” Transp. Res. Part A: Policy Pract., vol. 100, no. 1, pp. 65–80, 2017. doi: 10.1016/j.tra.2017.04.008. [Google Scholar] [CrossRef]

37. M. Ali and E. Macioszek, “Relationship among socio-demographic characteristics, activity-travel participation, travel parameter, physical activity intensity, and health parameters,” in Advanced Solutions for Mobility in Urban Areas, Lecture Notes in Networks and Systems, G. Sierpiński, S. Al-Majeed, E. Macioszek, Eds., Poland: Springer, Cham, 2024, vol. 907, pp. 65–81. doi: 10.1007/978-3-031-53181-1_5. [Google Scholar] [CrossRef]

38. C. Dora and M. Phillips, Transport, Environment and Health (No. 89). Austria: WHO Regional Office Europe, 2000. [Google Scholar]

39. I. Aijaz and A. Ahmad, “Electric vehicles for environmental sustainability,” in Smart Technologies for Energy and Environmental Sustainability, P. Agarwal, M. Mittal, J. Ahmed, S. M. Idrees, Eds., Cham: Springer International Publishing, 2022, pp. 131–145. [Google Scholar]

40. B. van Wee and D. Ettema, “Travel behaviour and health: A conceptual model and research agenda,” J. Transp. Health, vol. 3, no. 3, pp. 240–248, 2016. doi: 10.1016/j.jth.2016.07.003. [Google Scholar] [CrossRef]

41. J. Zhang, “Urban forms and health promotion: An evaluation based on health-related QOL indicators,” in Proc. 13th World Conf. Transp. Res., Brazil, Rio de Janeiro, Jul. 15–18, 2013, pp. 15–18. [Google Scholar]

42. M. Ali, E. Macioszek, K. Onyelowe, C. W. Yuen, and K. Arif, “Interaction of activity travel, GHG emissions, and health parameters using R—A step towards sustainable transportation system,” Ain Shams Eng. J., 2024, Art. no. 103050. doi: 10.1016/j.asej.2024.103050. [Google Scholar] [CrossRef]

43. NHTS, NextGen National Household Travel Survey. Federal Highway Administration, U.S. Department of Transportation, 2022. Accessed: Oct. 16, 2024. [Online]. Available: http://nhts.ornl.gov [Google Scholar]

44. L. Xu, H. Ü. Yilmaz, Z. Wang, W. -R. Poganietz, and P. Jochem, “Greenhouse gas emissions of electric vehicles in Europe considering different charging strategies,” Transp. Res. Part D: Transp. Environ., vol. 87, 2020, Art. no. 102534. doi: 10.1016/j.trd.2020.102534. [Google Scholar] [CrossRef]

45. J. Hollevoet, A. D. Witte, and C. Macharis, “Improving insight in modal choice determinants: An approach towards more sustainable transport,” in Urban Transport XVII: Urban Transport and the Environment in the 21st Century, UK: Wessex Institute of Technology (WIT Press2011, vol. 116, pp. 129. [Google Scholar]

46. A. de Nazelle et al., “Improving health through policies that promote active travel: A review of evidence to support integrated health impact assessment,” Environ. Int., vol. 37, no. 4, pp. 766–777, 2011. doi: 10.1016/j.envint.2011.02.003. [Google Scholar] [PubMed] [CrossRef]

47. B. Huang and Q. Wu, “Dynamic accessibility analysis in location-based service using an incremental parallel algorithm,” Environ. Plann. B: Plann. Des., vol. 35, no. 5, pp. 831–846, 2008. doi: 10.1068/b33118. [Google Scholar] [CrossRef]

48. W. Tao et al., “An advanced machine learning approach to predicting pedestrian fatality caused by road crashes: A step toward sustainable pedestrian safety,” Sustainability, vol. 14, no. 4, 2022, Art. no. 2436. doi: 10.3390/su14042436. [Google Scholar] [CrossRef]

49. G. Bresson, J. Dargay, J. -L. Madre, and A. Pirotte, “The main determinants of the demand for public transport: A comparative analysis of England and France using shrinkage estimators,” Transp. Res. Part A: Policy Pract., vol. 37, no. 7, pp. 605–627, 2003. doi: 10.1016/S0965-8564(03)00009-0. [Google Scholar] [CrossRef]

50. D. Papaioannou and L. M. Martinez, “The role of accessibility and connectivity in mode choice. A structural equation modeling approach,” Transp. Res. Procedia, vol. 10, pp. 831–839, 2015. doi: 10.1016/j.trpro.2015.09.036. [Google Scholar] [CrossRef]

51. F. Wolday, “The effect of neighbourhood and urban center structures on active travel in small cities,” Cities, vol. 132, no. 4, 2023, Art. no. 104050. doi: 10.1016/j.cities.2022.104050. [Google Scholar] [CrossRef]

52. S. Sultana, Factors Affecting Parents’ Choice of Active Transport Modes for Children’s Commute to School: Evidence from 2017 NHTS Data. The University of Toledo, 2019. [Google Scholar]

53. M. Aghaabbasi, M. Zaly Shah, and R. Zainol, “Investigating the use of active transportation modes among university employees through an advanced decision tree algorithm,” Civil Sustain. Urban Eng., vol. 1, no. 1, pp. 26–49, 2021. doi: 10.53623/csue.v1i1.28. [Google Scholar] [CrossRef]

54. B. X. Wang and N. Japkowicz, “Boosting support vector machines for imbalanced data sets,” Knowl. Inf. Syst., vol. 25, no. 1, pp. 1–20, 2010. doi: 10.1007/s10115-009-0198-y. [Google Scholar] [CrossRef]

55. R. Batuwita and V. Palade, “FSVM-CIL: Fuzzy support vector machines for class imbalance learning,” IEEE Trans. Fuzzy Syst., vol. 18, no. 3, pp. 558–571, 2010. doi: 10.1109/TFUZZ.2010.2042721. [Google Scholar] [CrossRef]

56. G. Wu and E. Y. Chang, “Class-boundary alignment for imbalanced dataset learning,” in ICML 2003 Workshop Learn. Imbalanced Data Sets II, Washington, DC, USA, 2003, pp. 49–56. [Google Scholar]

57. M. Aghaabbasi and S. Chalermpong, “Machine learning techniques for evaluating the nonlinear link between built-environment characteristics and travel behaviors: A systematic review,” Travel Behav. Soc., vol. 33, no. 1, 2023, Art. no. 100640. doi: 10.1016/j.tbs.2023.100640. [Google Scholar] [CrossRef]

58. M. Ali and S. Hin Lai, “Artificial intelligent techniques for prediction of rock strength and deformation properties—A review,” Structures, vol. 55, no. 1, pp. 1542–1555, 2023. doi: 10.1016/j.istruc.2023.06.131. [Google Scholar] [CrossRef]

59. Y. Yang, S. Samaranayake, and T. Dogan, “Assessing impacts of the built environment on mobility: A joint choice model of travel mode and duration,” Environ. Plann., vol. 50, no. 9, pp. 2359–2375, 2023. doi: 10.1177/23998083231154263. [Google Scholar] [CrossRef]

60. Y. Xia, H. Chen, and R. Zimmermann, “A Random Effect Bayesian Neural Network (RE-BNN) for travel mode choice analysis across multiple regions,” Travel Behav. Soc., vol. 30, no. 2, pp. 118–134, 2023. doi: 10.1016/j.tbs.2022.08.011. [Google Scholar] [CrossRef]

61. P. Noorbakhsh, N. Khademi, and K. Chaiyasarn, “Exploration of women cyclists’ perceived security using tree-based machine learning algorithms,” in Procedia Computer Science, E. Shakshuki, Ed., Leuven, Belgium, 2023, vol. 220, pp. 624–631. doi: 10.1016/j.procs.2023.03.079. [Google Scholar] [CrossRef]

62. M. Murugan and S. Marisamynathan, “Mode shift behaviour and user willingness to adopt the electric two-wheeler: A study based on Indian road user preferences,” Int. J. Trans. Sci. Tech., vol. 12, no. 2, pp. 428–446, 2023. doi: 10.1016/j.ijtst.2022.03.008. [Google Scholar] [CrossRef]

63. J. Á. Martín-Baos, J. A. López-Gómez, L. Rodriguez-Benitez, T. Hillel, and R. García-Ródenas, “A prediction and behavioural analysis of machine learning methods for modelling travel mode choice,” Transp. Res. Part C: Emerg. Technol., vol. 156, 2023, Art. no. 104318. doi: 10.1016/j.trc.2023.104318. [Google Scholar] [CrossRef]

64. L. Liu, Y. Wang, and R. Hickman, “How rail transit makes a difference in people’s multimodal travel behaviours: An analysis with the XGBoost method,” Land, vol. 12, no. 3, 2023, Art. no. 675. doi: 10.3390/land12030675. [Google Scholar] [CrossRef]

65. A. N. P. Koushik, M. Manoj, N. Nezamuddin, and A. P. Prathosh, “Testing and enhancing spatial transferability of artificial neural networks based travel behavior models,” Transp. Lett., vol. 15, no. 9, pp. 1083–1094, 2023. doi: 10.1080/19427867.2022.2130150. [Google Scholar] [CrossRef]

66. F. Hatami, M. M. Rahman, B. Nikparvar, and J. C. Thill, “Non-linear associations between the urban built environment and commuting modal split: A random forest approach and SHAP evaluation,” IEEE Access, vol. 11, no. 1, pp. 12648–12661, 2023. doi: 10.1109/ACCESS.2023.3241627. [Google Scholar] [CrossRef]

67. H. Bei, H. Chen, L. Li, X. Gao, Y. Xia and Y. Sun, “Joint prediction of travel mode choice and purpose from travel surveys: A multitask deep learning approach,” Travel Behav. Soc., vol. 33, 2023, Art. no. 100625. doi: 10.1016/j.tbs.2023.100625. [Google Scholar] [CrossRef]

68. E. Yousefzadeh Barri, S. Farber, H. Jahanshahi, and E. Beyazit, “Understanding transit ridership in an equity context through a comparison of statistical and machine learning algorithms,” J. Transp. Geogr., vol. 105, 2022, Art. no. 103482. doi: 10.1016/j.jtrangeo.2022.103482. [Google Scholar] [CrossRef]

69. P. Salas, R. De la Fuente, S. Astroza, and J. A. Carrasco, “A systematic comparative evaluation of machine learning classifiers and discrete choice models for travel mode choice in the presence of response heterogeneity,” Expert Syst. Appl., vol. 193, 2022, Art. no. 116253. doi: 10.1016/j.eswa.2021.116253. [Google Scholar] [CrossRef]

70. H. Naseri, E. O. D. Waygood, B. Wang, and Z. Patterson, “Application of machine learning to child mode choice with a novel technique to optimize hyperparameters,” Int. J. Environ. Res. Public Health, vol. 19, no. 24, 2022, Art. no. 16844. doi: 10.3390/ijerph192416844. [Google Scholar] [PubMed] [CrossRef]

71. K. A. Momin, S. Barua, O. F. Hamim, and S. Roy, “Modeling the behavior in choosing the travel mode for long-distance travel using supervised machine learning algorithms,” Commun. Sci. Lett. Univ. Zilina., vol. 24, no. 4, pp. A187–A197, 2022. doi: 10.26552/com.C.2022.4.A187-A197. [Google Scholar] [CrossRef]

72. N. F. Mohd Ali, A. F. Mohd Sadullah, A. P. P. Abdul Majeed, M. A. Mohd Razman, and R. M. Musa, “The identification of significant features towards travel mode choice and its prediction via optimised random forest classifier: An evaluation for active commuting behavior,” J. Transp. Health, vol. 25, 2022, Art. no. 101362. doi: 10.1016/j.jth.2022.101362. [Google Scholar] [CrossRef]

73. R. A. Hasan, H. Irshaid, F. Alhomaidat, S. Lee, and J. S. Oh, “Transportation mode detection by using smartphones and smartwatches with machine learning,” KSCE J. Civil Eng., vol. 26, no. 8, pp. 3578–3589, 2022. doi: 10.1007/s12205-022-1281-0. [Google Scholar] [CrossRef]

74. J. C. García-García, R. García-Ródenas, J. A. López-Gómez, and J. Á. Martín-Baos, “A comparative study of machine learning, deep neural networks and random utility maximization models for travel mode choice modelling,” Transp. Res. Procedia, vol. 62, pp. 374–382, 2022. doi: 10.1016/j.trpro.2022.02.047. [Google Scholar] [CrossRef]

75. M. Wong and B. Farooq, “ResLogit: A residual neural network logit model for data-driven choice modelling,” Transp. Res. Part C: Emerg. Technol., vol. 126, 2021, Art. no. 103050. doi: 10.1016/j.trc.2021.103050. [Google Scholar] [CrossRef]

76. X. Sun and S. Wandelt, “Transportation mode choice behavior with recommender systems: A case study on Beijing,” Transp. Res. Interdiscip. Perspect., vol. 11, 2021, Art. no. 100408. doi: 10.1016/j.trip.2021.100408. [Google Scholar] [CrossRef]

77. K. Gao, Y. Yang, T. Zhang, A. Li, and X. Qu, “Extrapolation-enhanced model for travel decision making: An ensemble machine learning approach considering behavioral theory,” Knowl. Based Syst., vol. 218, 2021, Art. no. 106882. doi: 10.1016/j.knosys.2021.106882. [Google Scholar] [CrossRef]

78. R. Buijs, T. Koch, and E. Dugundji, “Using neural nets to predict transportation mode choice: Amsterdam network change analysis,” J. Ambient Intell. Humaniz. Comput., vol. 12, no. 1, pp. 121–135, 2021. doi: 10.1007/s12652-020-02855-6. [Google Scholar] [CrossRef]

79. X. Zhao, X. Yan, A. Yu, and P. V. Hentenryck, “Prediction and behavioral analysis of travel mode choice: A comparison of machine learning and logit models,” Travel Behav. Soc., vol. 20, no. 1, pp. 22–35, 2020. doi: 10.1016/j.tbs.2020.02.003. [Google Scholar] [CrossRef]

80. J. Slik and S. Bhulai, “Transaction-driven mobility analysis for travel mode choices,” in Procedia Computer Science, Warsaw, Poland: Elsevier, Apr. 6–9, 2020, vol. 170, pp. 169–176. doi: 10.1016/j.procs.2020.03.022. [Google Scholar] [CrossRef]

81. A. N. P. Koushik, M. Manoj, and N. Nezamuddin, “Machine learning applications in activity-travel behaviour research: A review,” Transp. Rev., vol. 40, no. 3, pp. 288–311, 2020. doi: 10.1080/01441647.2019.1704307. [Google Scholar] [CrossRef]

82. L. Jin et al., “Clustering life course to understand the heterogeneous effects of life events, gender, and generation on habitual travel modes,” IEEE Access, vol. 8, pp. 190964–190980, 2020. doi: 10.1109/ACCESS.2020.3032328. [Google Scholar] [CrossRef]

83. R. Buijs, T. Koch, and E. Dugundji, “Using neural nets to predict transportation mode choice: An Amsterdam case study,” Procedia Comput. Sci., vol. 170, pp. 115–122, 2020. doi: 10.1016/j.procs.2020.03.015. [Google Scholar] [CrossRef]

84. W. Yan, W. Zhou, C. Tan, and L. Fan, “Employee ridesharing: Reinforcement learning and choice modeling,” presented at the 25th Am. Conf. Inf. Syst., AMCIS 2019, Jun. 15–19, 2019. [Google Scholar]

85. E. Haynes, J. Green, R. Garside, M. P. Kelly, and C. Guell, “Gender and active travel: A qualitative data synthesis informed by machine learning,” Int. J. Behav. Nutr. Phys. Act., vol. 16, no. 1, 2019, Art. no. 135. doi: 10.1186/s12966-019-0904-4. [Google Scholar] [PubMed] [CrossRef]

86. L. Cheng, X. Chen, J. De Vos, X. Lai, and F. Witlox, “Applying a random forest method approach to model travel mode choice behavior,” Travel Behav. Soc., vol. 14, no. 3, pp. 1–10, 2019. doi: 10.1016/j.tbs.2018.09.002. [Google Scholar] [CrossRef]

87. X. Chang, J. Wu, H. Liu, X. Yan, H. Sun and Y. Qu, “Travel mode choice: A data fusion model using machine learning methods and evidence from travel diary survey data,” Transportmetrica A Transport Sci., vol. 15, no. 2, pp. 1587–1612, 2019. doi: 10.1080/23249935.2019.1620380. [Google Scholar] [CrossRef]

88. K. J. Assi, M. Shafiullah, K. M. Nahiduzzaman, and U. Mansoor, “Travel-to-school mode choice modelling employing artificial intelligence techniques: A comparative study,” Sustainability, vol. 11, no. 16, 2019, Art no. 4484. doi: 10.3390/su11164484. [Google Scholar] [CrossRef]

89. Z. Zhu, X. Chen, C. Xiong, and L. Zhang, “A mixed Bayesian network for two-dimensional decision modeling of departure time and mode choice,” Transportation, vol. 45, no. 5, pp. 1499–1522, 2018. doi: 10.1007/s11116-017-9770-6. [Google Scholar] [CrossRef]

90. M. Wong and B. Farooq, “Modelling latent travel behaviour characteristics with generative machine learning,” in 2018 21st Int. Conf. Intell. Transp. Syst. (ITSC), Maui, HI, USA, 2018, pp. 749–754. doi: 10.1109/ITSC.2018.8569581. [Google Scholar] [CrossRef]

91. J. Hagenauer and M. Helbich, “A comparative study of machine learning classifiers for modeling travel mode choice,” Expert Syst. Appl., vol. 78, no. 1, pp. 273–282, 2017. doi: 10.1016/j.eswa.2017.01.057. [Google Scholar] [CrossRef]

92. M. Ali, E. Macioszek, and D. B. Endrayana Dharmowijoyo, “Influence of activity-travel participation, travel mode choice, and multitasking activities on subjective well-being using R,” Sustainability, vol. 15, no. 23, 2023, Art. no. 16338. doi: 10.3390/su152316338. [Google Scholar] [CrossRef]

93. M. Ali, E. Macioszek, “The influence of travel mode choice on subjective wellbeing—A case study,” Transp. Probl.: An Int. Sci. J., vol. 18, no. 4, pp. 5–17, 2023. doi: 10.20858/tp.2023.18.4.01. [Google Scholar] [CrossRef]

94. Y. A. S. Harumain, S. Koting, N. S. A. Sukor, M. M. Dali, N. Fauzi and T. Osada, “Mode choice of mothers travelling with young children in malaysia,” Plann. Malays., vol. 20, pp. 352–362, 2022. [Google Scholar]

95. S. Wójcik, “Private or public transport? The determinants of travel behaviour in post-industrial city-the case of Łódź,” in REAL CORP 2017-PANTA RHEI–A World Constant Motion., Vienna, Austria, Sep. 13, 2017, pp. 723–728. [Google Scholar]

96. F. R. Ashik, A. I. Z. Sreezon, M. H. Rahman, N. M. Zafri, and S. M. Labib, “Built environment influences commute mode choice in a global south megacity context: Insights from explainable machine learning approach,” J. Transp. Geogr., vol. 116, 2024, Art. no. 103828. doi: 10.1016/j.jtrangeo.2024.103828. [Google Scholar] [CrossRef]

97. E. Kuskapan, T. Campisi, G. D. Cet, C. Vianello, and M. Y. Codur, “Examination of the effects of the pandemic process on the E-scooter Usage Behaviours of individuals with machine learning,” Trans. Transp. Sci., vol. 14, no. 3, pp. 25–31, 2023. doi: 10.5507/tots.2023.016. [Google Scholar] [CrossRef]

98. J. D. Pineda-Jaramillo, “A review of Machine Learning (ML) algorithms used for modeling travel mode choice,” Dyna, vol. 86, no. 211, pp. 32–41, 2019. [Google Scholar]

99. Y. Ren, M. Yang, E. Chen, L. Cheng, and Y. Yuan, “Exploring passengers’ choice of transfer city in air-to-rail intermodal travel using an interpretable ensemble machine learning approach,” Transportation, vol. 51, no. 4, pp. 1493–1523, 2023. doi: 10.1007/s11116-023-10375-3. [Google Scholar] [CrossRef]

100. M. Ali, D. B. Dharmowijoyo, I. S. Harahap, A. Puri, and L. E. Tanjung, “Travel behaviour and health: Interaction of activity-travel pattern, travel parameter and physical intensity,” Solid State Technol., vol. 63, no. 6, pp. 4026–4039, 2020. [Google Scholar]

Cite This Article

APA Style

Ali, M. (2024). Discrete Choice Models and Artificial Intelligence Techniques for Predicting the Determinants of Transport Mode Choice—A Systematic Review. Computers, Materials & Continua, 81(2), 2161–2194. https://doi.org/10.32604/cmc.2024.058888

Vancouver Style

Ali M. Discrete Choice Models and Artificial Intelligence Techniques for Predicting the Determinants of Transport Mode Choice—A Systematic Review. Comput Mater Contin. 2024;81(2):2161–2194. https://doi.org/10.32604/cmc.2024.058888

IEEE Style

M. Ali, “Discrete Choice Models and Artificial Intelligence Techniques for Predicting the Determinants of Transport Mode Choice—A Systematic Review,” Comput. Mater. Contin., vol. 81, no. 2, pp. 2161–2194, 2024. https://doi.org/10.32604/cmc.2024.058888

BibTex EndNote RIS

Copyright © 2024 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Discrete Choice Models and Artificial Intelligence Techniques for Predicting the Determinants of Transport Mode Choice—A Systematic Review

Abstract

Keywords

References

Cite This Article

1753

1166

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link