Estimating a mode choice model with incomplete data
This study develops and uses a synthetic data base to calibrate a logit mode choice model of work trips in metropolitan Vancouver. Missing survey data entries for perceived measures of travel time and waiting time by bus, as well as operating and parking cost by car, are calculated using statistical methods to increase the survey sample of 275 complete cases to 621 usable cases. The synthesized data set is used to specify random utility functions for two planning assumptions. The short-term policy specification using only level of service variables does not produce a usable model, but the specification based on a long-term planning assumption using a combination of level of service and socioeconomic variables produces plausible results. The inconclusive results from the policy model could be due to survey data problems, data simulation, and (or) the lack of conceptual validity of perceived measures of transportation attributes. The planning model provides insight into mode split prediction and transportation management for cities that are undergoing dynamic demographic and social changes. Key words: mode choice, incomplete data, socioeconomic factors, logit model.