This page is open for editing because it is part of the Incubator. Have something to add? Please register so you can contribute. Have an option you would like to share? Please click on the 'Talk' button to enter the dialogue. The TF Resource Volunteers appreciate your feedback and interest.

----------------------------------------------------------------------------------------------------------------------------------------------------------------

---------------------------------------------------------------------------------------------------------------------------------------------------------------- Zone-based destination choice models will incorporate a utility function that includes a number of different types of explanatory variables. Generally speaking, the utility function will include two categories of explanatory factors:

• Qualitative Factors (how good are the choices in a given destination zone)
• Quantitative Factors (how many individual choices are in a zone)

The usage of qualitative explanatory factors is common in virtually all choice models. The quantitative factors are an unusual feature of destination choice models, and arise out of the fact that travelers are modeled as choosing a destination zone, but in actuality are choosing one of multiple individual opportunities within the zone (e.g. for a work trip the destination is one individual job among all the jobs in a zone).

The sections below detail some of the more common types of parameters and variables that are used to represent these factors in destination choice models.

## Size Terms / Attractions

Destination choice models are usually represented with some level of aggregation of the alternatives. That is, the "alternatives" represented in the model, often TAZs, are not actually the choices, but they represent a pool of choices. For example, the destination choice model may express the choice of a work trip destination as TAZ 123, but in actuality the destination is one particular job among however many jobs there are within that TAZ; if there are more jobs in the TAZ, there are more actual sub-alternatives to choose within the modeled alternative of TAZ 123. The aggregate choices in many ways are similar to a nested logit model, with the aggregations (zones) corresponding to the nests, except we only observe the choice at the nest level, not at the elemental alternative level. To incorporate this detail into the utility function for the destination choice model, we must provide a representation of the number of individual unique alternatives available within the zone.

The exact nature of the quantitative term will generally vary based on the trip purpose being modeled. For work based trips, it is typical to include measures of employment, either in total or by industry type (the latter being preferred if disaggregate employment information for travelers is also available by industry type). For non-work purposes, it is typical to include only particular relevant industry categories (e.g. retail employment for shopping purposes, restaurant employment for meal purposes, etc.) and other socio-economic features of the zones as well (e.g. households or population for social purposes).

In general, when thinking about which variables and parameters are part of the size terms in the utility function, the questions to consider is whether the data represents how many opportunities there are (if so, it's a size term) or how good (or bad) the opportunities are (in which case, not a size term).

While this page describes theoretical foundations, actual data sources commonly used in destination choice modeling as Size Terms / Attractions can be found here.

## Distance / Impedance Terms

Perhaps the most fundamental terms in the utility function for destination choice models are measures of distance, travel time, or more generally, impedance. These terms represent the effort required to get to various alternative destinations from a known origin.

Distance
The simplest representation in this category is purely distance, either straight line ("as the crow flies") or more typically routed via the shortest path on a highway network. Using pure distance is convenient in some cases, as it does not require any representation or calculation of congestion, link flow speeds, travel costs, etc. Depending on how the destination choice model is integrated into a larger family of models, some of these measures might not even be available at the time the model is applied. Still, the simplicity offered by using pure distance should be considered against the limited relationship that distance has on actual travel behavior choices: people generally experience and measure "distance" as travel time, not actual linear distance.
Travel Time
It is more reasonable to measure impedance by using travel time than using pure distance. Travel time encapsulates one of the important aspects of the travel experience, and is in large part how travelers actually experience the disutility of traveling. However, using travel time as an impedance factor does require actually computing travel times, which implies knowledge of the time of day (at least by congested/peak v. uncongested/offpeak) and the mode of travel. The travel time concept can also be expanded to "generalized time", which allows for weighting and summing various components of travel time and cost (e.g., out-of-vehicle travel time can be 3 times more onerous than in-vehicle travel time).
Mode Choice Logsums
If the destination choice model is disaggregate and is placed over a similarly disaggregate mode choice model, it is typical to include the logsum from the mode choice model as a measure of impedance in the destination choice model. Doing so obviates the need for separately identifying the impedance by travel mode, as the logsum represents a impedance calculated jointly across all modes.

It is not unusual to include more than one of these measures of impedance in a single destination choice model. For example, you might have both a mode choice logsum term (to represent the multimodal accessibility to destinations) and also a pure distance term, which aids in calibration and validation. Examples for data sources for impedance measures can be found here.

## Psychological Boundaries

The utility function may include disutility factors associated with crossing boundary lines, which may represent real geographic features (e.g. rivers, railroads, ridge lines) or socio-political boundaries (e.g. state lines, neighborhood boundaries, etc).

## Destination Accessibilities

Accessibility measures capture spatial autocorrelation, agglomeration effects, convenience, or centrality of various alternatives. These measure not so much the intrinsic desirability of a destination itself, but rather the convenience of visiting a destination so as to be able to conduct other activities nearby as well. For example, two office towers might each be construed as substantially similar destinations, but the office tower surrounded by shops and restaurants (i.e. complementary activities) would likely be more desirable than the office tower surrounded simply by parking lots and vacant land.

## Other Destination Qualities

Other physical or socio-economic attributes of the various destinations can also be included in a destination choice model, such as measures of walkability, the diversity of land uses, etc.

## Constants

Unlike for many other choice models, it is not common to incorporate alternative-specific constants for every destination zone a destination choice model. Doing so results in some convenient mathematical properties (notably, for MNL models, the *other* parameters of the model will have unbiased estimators even in the presence of non-uniform sampling). However, including a complete set of alternative-specific constants can result in other complications: if the number of zonal alternatives approaches or exceeds the number of sampled destination observations, the model parameters will be completely over-determined and model estimation will simply fail. Even if the model is not completely over-determined, a very large number of observations may be necessary to provide sufficient statistical confidence in the estimated results. If the estimation data set is large relative to the number of zones, these problems may be overcome and it may be reasonable to use a complete set of alternative specific constants, although this is not common when using household surveys in the United States.

Instead of employing a complete set of constants for every alternative, it may sometimes be advisable to include just a partial set of alternatives.

## Traveler Attributes

One distinct advantage of adopting a "disaggregate" destination choice model is the capability of incorporating a wide variety of attributes of the traveler (e.g. income, gender, auto ownership, etc) into the utility function. These attributes generally do not enter the utility function directly, as the traveler attributes do not vary based upon the choice of any particular destination; instead they are interacted with other attributes that do vary by the choice, including any of the terms discussed above.

When applied in a more aggregate gravity model application, typically there are still a limited number of traveler attributes that can be incorporated into the utility function. However, the decision about which attributes, if any, to include must generally be made in the context of the entire system of models (i.e., the other three steps in the four step model) as the identified attributes usually need to remain consistent across all model components.