Help Page for Polling Place Capacity/Line Optimization Simulator

The Simulation

This program is a simulation of a day's traffic flow through a polling place. It is based on simple queueing theory and data from research projects regarding polling places and elections. Each time you run the simulation, voters arrive randomly within a pattern based on data collected at polling places. There is randomness involved, just as in reality. But as you will see, the randomness approximates the typical daily flow found at polling places, which should give you an accurate range of possible wait times that might develop.

The basic design simulates a polling place with a number of check-in stations (or just one), where the first line will form. Then there is a number of voting stations, for which the second line will form.

For those using optical-scan ballots that require voters to run their ballots through a scanner after completion of the ballot, there is also an option to simulate lines for the scanner(s).

The simulation also has an option to handle same-day registration stations for those states to which they apply. The simulation makes the assumption that arriving voters will be directed to a check-in station if already registered, or a registration station if not. Both of those lines will feed into the voting station line as the check-in or registration tasks are completed.

Critical Data

You will quickly discover that waiting times are surprisingly sensitive to the average amount of time to check-in and the average time to complete a ballot.

As you know, time to complete a ballot depends greatly on the length and make-up of the ballot. The voting technology used also has a minor impact. We can only provide a starting point for ballot completion time; you must provide a reasonably accurate estimate for each election for this simulation to give you accurate estimates of potential waiting times.

Likewise, the average amount of time to check-in depends on the process used and the people staffing that step. When setting this value, remember to not assume that every voter will check in smoothly. Don't use a number for the amount of time it ought to normally take. The average that you provide must include those who for whatever reason take extra time at the check-in station. If one out of every ten voters takes a couple extra minutes to check in because of some issue, that will affect the process, your average, and the waiting times.

Similarly, your average scanning time (for those who use that option), should consider those that do not go smoothly. Don't just use a number that represents how long it should take if all goes well, because seldom does all go well for every person.

Lastly, your average times must account for the turnover period between voters. If 30 seconds elapse between the time one person completes a ballot and the next person settles in at the same voting station and begins voting, those 30 seconds MUST be part of the average that you use.

Simulating All Your Precincts

When you upload a list of all your precincts (from an Excel spreadsheet or a similar source), along with turnout estimates and resource deployment, you can run this simulation across all precincts at once to see which have potential problems. A new table will appear at the bottom of the page with stats for each precinct, which can then be sorted on any field and on which adjustments can be made. See the Load your own data section of this help page for details.

Feedback, Suggestions, and Critiques

This simulation was written by Mark Pelczarski. Questions, comments, suggestions, and pleas for help are welcome. E-mail Mark here.

The Five Key Inputs

Above the simulation graph you will see inputs for Expected Voters, Check-in Stations, and Voting Stations. Below the graph are inputs for Average Minutes to Check in and Average Minutes to Vote. These are the 5 values that will have the greatest effect on the waiting times shown by the simulation.

Expected Voters is the number of voters expected to vote in person that day. Absentee and early voters should not be included in this number. Although designed by default for election-day voting, potential waiting times during early-vote days can also be simulated if you can estimate the turnout for the day in question.

The number of check-in stations and the number of voting stations are your inputs for the resources you deploy. For our purposes, the way lines may be split for check-in stations won't matter much; the average wait times will only differ when a polling place is relatively quiet and some check-in stations are open while others have a short wait. During busy times when all check-in stations are in use, the average waiting time is not affected.

Minutes to Check in and Minutes to Vote are critical values for which you should supply realistic values. It is very worthwhile to take some sample times if you are able to do so prior to an election, once the ballot is known. If you are able to research and document your own check-in and voting times, it would be very useful if you could share your results (with a sample of your ballot for the voting time) so we can perhaps refine this simulation with more predictive data. (E-mail link) Remember to include your "turnover time" in your averages, or the time between one person leaving a station and the next person arriving and starting the process. Another way to look at it is that the average should include the entire time from one person leaving a station through the next person leaving that station.

What You See on the Graph

Each time you run the simulation, you will see the resulting average, maximum, and minimum wait times throughout the day. The graph shows the time of day, from left to right. The height of the graph shows the waiting-time results. The yellow area in the graph is the waiting time for the check-in stations. The violet area is the waiting time for the voting stations. The total height of the graph at any time of day is the average total waiting time for people who arrive at that time. (The solid line is the average; the dotted lines show the shortest and longest waits. See the Settings section of this help document for more explanation.)

To see less yellow, add check-in stations or decrease the average check-in time. To see less violet, add voting stations or decrease the average time to complete a ballot.

It's a Simulation

Instead of a straight mathematical computation which gives consistent results, this simulation gets a little closer to reality by randomizing voter arrival within certain constraints each time you run it. If you click the Run Simulation button again without changing any values, you will get slightly different results each time. That's because of the random factor, and that's more like reality. Simply put, if just two people arrive 5 minutes apart, neither will have to wait and the average wait time for those two people is zero. But if those same two people arrive at exactly the same time (and there's just one check-in station), one of those two will wait, and thus the "average" wait time is longer.

The dotted lines on the graph for the minimum and maximum wait times give you an idea of the range of possible outcomes with the same data.

Scanners

If your polling place has optical scanners that voters must use after completing the ballot at their voting station, you can also simulate scanner lines by providing the number of scanners available and the average time it takes to scan a ballot (including turnover time from one voter to the next!).

At the bottom of the graph, under the "Run Simulation" button, click the "Scanner lines" checkbox. Now, above the graph you will see an input for the number of scanners available and below the graph you'll see an input for the "Average number of minutes to scan". The graph will then show waiting time for all three stations next time the simulation is run.

Same-day Registration

If same-day registration is allowed in your state, check the "Same-day Registration" box under the "Run Simulation" button at the bottom of the graph. New inputs choices will appear above the graph for the number of registration stations and below the graph for the average time to register.

This simulation assumes that already-registered voters will wait in the regular check-in lines and that those who need to register will wait in a separate line for the registration station(s). Then it assumes that BOTH lines will feed into the voting station line. That is, we assume the registration station also serves as a check-in station for those voters, and after registering they will proceed to vote and will NOT get into the regular check-in lines.

In the results, the average check-in time (yellow area on the graph), will be the average waiting time for ALL voters, regardless of whether they waited in the regular check-in line or waited in the registration/check-in line. A separate red line will appear on the graph showing the average wait for registration. The way to interpret that is if the red line is above the yellow area, people are waiting longer in the registration lines than the regular check-in lines, and that pulling up your total average check-in waits. Adding a registration station will help. But if the red line is below the top of the yellow area that means the average check-in wait is NOT due to registration lines; instead the regular check-in waits are longer than the registration waits so you should add check-in stations.

Minor Inputs

You can set the times that the polls open and close. This will compress or spread out the arrivals, and thus have some effect on the waiting times.

Below the graph, you can select an Arrival Pattern. These patterns were derived from arrival data sampled at numerous polling places during several election cycles. (See the Settings section of this help document for how you can view the arrival patterns on the graph.) Unless you use the "Saturday: steady" pattern, which perhaps might describe a Saturday during early voting, the default "Composite" pattern gives a good indication of potential wait times, although the time of day that waiting develops could vary with the arrival pattern of each particular polling place.

Early arrivals is the percentage of the day's voters who will be already waiting in line when the polling place opens for the day. This is adjusted separately from the arrival pattern. The default used for all arrival patterns is 1.2, which means 1.2% of the day's voters. The waiting times reported for early arrivals only include the amount of time those voters wait in lines after the polling place actually opens.

The Cannot check in value describes a percentage of voters who are unable to check in for some reason, and thus do not move on to the voting queue. Many of these voters will typically be given provisional ballots. The default value used is 2.0, meaning 2%. This simulation currently uses this number only to decrease the waiting times for voting stations, since these voters will not wait in the subsequent lines. Although voters with issues that might prevent checking in can slow down the check-in lines, it is assumed that your "average minutes to check-in" already accounts for these occasional delays.

The Simulate Walk-Offs checkbox adds a feature that "predicts" a possible number of voters who see that a line of some length has formed and they choose to leave rather than wait. The longer the line gets, more potential voters will walk off instead of wait. These projections are based on results of Marist polls that asked voters how long they'd be willing to wait in line to vote prior to recent general elections. Since these numbers are based on what people said vs. observations of what they actually did (which is probably impossible to accurately measure), the "Walk-off" predictions should be taken in context: they might indicate the potential of a serious problem, but they should not be taken as an accurate prediction of lost voters.

Looking at the Simulation Settings

If you want to dig deeper into how this simulation works, at the bottom of the page click on Show Simulation Settings and more options will appear just below that. Here's where you can see a little more of the nitty-gritty of how this operates.

Number of Repetitions is how many times the simulation actually runs through a day randomly before giving you its results. It simulates the same voting day with the same input data 20 different times (by default) with 20 random sets of arrivals before computing the averages and telling you the maximum and minimum waits that it found. If you change "20" to "1" and click the Run Simulation button over and over, you'll see just a single attempt at the simulation each time, and the results will vary more. If you change it to "100", the computer might take a little longer running through 100 simulations of the same day, but your average will be more accurate and your minimum and maximum values will range further.

The Show Range checkbox turns off and on those dotted gray lines that display the minimum and maximum results. A less-cluttered graph is prettier; but it tells you less.

Exploring Arrivals

Lines are formed when a lot of people arrive around the same time of day. If they'd only cooperate and space themselves out evenly throughout the day, our lives would be much simpler. But they don't.

Check Show Arrival Pattern to see a new gray line through the middle of your graph that displays the arrival pattern used from the Arrival Pattern choice given above and discussed earlier. ("Composite" was the default.) The right side of your graph shows a new set of numbers for that gray line. Those percentages are the percentage of voters who show up during each 15-minute interval using this pattern. If you're looking at the "Composite" pattern, you'll see that about 3% of the day's voters are expected to arrive around 5:30-5:45pm. Likewise, slightly more than 3% are likely to show up around 7:15am, whereas only 1.25% show up around 8:30am in this same pattern. This doesn't mean that those are the exact percentages that will occur when you run the simulation. Those likelihoods inform the randomness, but the arrivals are still random.

To get an even more accurate picture of what's going on, click Show Arrivals Used and set the Number of Repetitions to "1", then click the Run Simulation button. A new green line appears that shows the times that our simulated voters actually arrived. The graph of that green line looks a lot more like the data that was collected while recording actual arrivals at polling places, which is to say that real people don't conveniently follow our exact mathematical models. Click Run Simulation several more times and watch how that green line changes. This is what we're actually feeding into the simulation to make our "wait time" predictions.

But notice also how the green line somewhat approximates the gray line that is the pattern we're using. That gray line was derived from actual data that looked like all those green lines you just clicked through. Change Number of Repetitions back to "20" and click Run Simulation again and you'll see that after 20 repetitions the aggregate random arrivals approximate our pattern more closely.

And if all this random stuff bothers you, click Arrivals Follow Pattern and then even with only 1 repetition you'll see that our simulated voters behave exactly like the pattern predicts.

Using your own data

If you create a table in an Excel spreadsheet (or similar application) with columns for precinct names, turnout estimates, number of check-in stations, and number of voting stations, you can then save that spreadsheet in a format that you can load right into the simulation and run then run the simulation across all precincts at once.

When you do so, a new table appears at the bottom of the simulation page with every precinct in your list and predicted wait times and possible walk-offs for each. You can adjust the number of check-in stations and voting stations for each and get immediate feedback about the potential results. You can then download the alterations and feed them back into Excel or another program to remember your settings and adjustments.

Creating the data

We'll talk about the Excel spreadsheet application here, but any program that can create "tab-delimited text files" or "CSV files" can be used.

The first row of your spreadsheet will be headings that MUST match the names we list here, because the simulation will look for exactly those headings when it reads the file you load in. The order of the columns does not matter at all. The headings themselvers are used for sorting your data into the right place.

The only required heading and column is "Precinct". In this column you will list the names or designations you want to use for each precinct you'll want to run in the simulation. Please avoid commas in those names, since commas will mess up our use of the CSV file format.

Each of the other columns is optional, although if omitted we'll add default values that you'll almost certainly want to change, so it's probably faster for you to do that in your spreadsheet. Note also that the columns "Registration" and "Scanners" have special meaning that cannot be altered in the simulation. If you plan to use the same-day registration and/or scanning stations options with your data set, you must include these columns in your spreadsheet.

An important note is that you CAN include additional columns in your spreadsheet if you want to generate some of the data using spreadsheet formulas. Any column headings you use that are not listed below will simply be ignored. So if it is easy to generate a column for "Registered Voters" (which is not listed below), and you estimate voter turnout at 60% for an election, the "Turnout" column below can just contain the formula for Registered Voters * 0.6 for each row. Similarly, you can use formulas in columns such as "Voting Stations" and "Check-in Stations" to set some initial values.

Here is the complete list of column headings you can provide. Capitalization, spaces, and hyphens don't matter. Spelling does.

County: "County" is for grouping precincts by county when using the simulation on a larger scale. However it can also be used to create named subsets of your precincts if you have many and wish to work with smaller sets. Each precinct that you want grouped together should have the same county name. In the simulation a "county" list will appear, and when you select a county only the precincts within that county will be shown in the precinct list and in the table at the bottom of the page.
Precinct: This column will contain the identifiers you will use for each precinct or vote center. It is the only required column, but it's strongly suggested that you include as many of the others as pertain.
Turnout: "Turnout" is the number of in-person voters expected on the election day. You should not include absentee and early voters in this number, nor should this be the total number of registered voters. You can also use "Turnout" to simulate arrivals on an early-voting day.
Registration: Use this if and only if your state allows same-day registration on the day you plan to simulate. The existence of this column will set the same-day registration option to true for all your precincts. If you omit this column, the option will be set to false for all precincts and you will not be able to change it in the simulation. Values in this column will be the estimated numbers already included in "Turnout" above who will also need to register on election day. For example, if you expect 500 voters on election day, and 100 of those 500 will need to register, put 500 in the "Turnout" column and 100 in the "Registration" column. (You might also use a formula here, such as "Turnout" * .2 for 20%)
Check-in Stations
Voting Stations
Registration Stations: These columns should contain the number of each that you plan to deploy in each precinct. "Registration Stations" is only applicable if you are using the same-day registration option and have including the "Registration" column described about. If you do not supply columns with these headings, we'll use our own calculation to suggest a starting point for what you might want to supply.
Scanners: Use this if after filling out a ballot there is another station where each voter must scan the ballot to complete the voting process. If you include this column, the option can vary by precinct. The values in this column will be the number of scanning stations (usually just 1). However, you can set this to 0 (zero) or leave this entry blank for a precinct, either of which designates that the precinct does not use scanners. This column sets the "uses scanners" option to true or false for each precinct, and the option cannot be changed in the simulation. However you will be able to change the number of scanners for those precincts that use them. Omission of this column sets "uses scanners" to false for all precincts.
Minutes to Check In
Minutes to Vote
Minutes to Register
Minutes to Scan: It is strongly recommended that you include these columns with reasonable estimated averages for each of the applicable times. The default times in the simulation are only broad estimates. Your times will almost certainly vary from the defaults we provide, and reasonable estimates for each of these averages are essential for any kind of reliable results in this simulation. If you doubt that, run the simulation with just all the default values and try slight adjustments to the average minutes to vote and watch the changes; they can be drastic. Different ballot lengths can have a significant effect on "Minutes to Vote", so this number may vary in groups of precincts. "Minutes to Register" and "Minutes to Scan" are only pertinant if you are using the same-day registration and "scanners" features. Please enter your values in minutes and fractions thereof. For example, don't use "4:30" for "4 minutes and 30 seconds". Instead, use "4.5".
Arrival Pattern: This option lets you change a precinct from using the default "Composite" arrival pattern to any of the other 9 patterns listed in the simulation. If used, the value in this column should simply be a number from 0 to 9, corresponding to the position of the desired pattern in the simulation's list. 0 is the "Composite Pattern", 1 is "Early, Mid-Morning, After Work", and so on. This is not a critical option, as the pattern is not as important in the simulation as other factors in showing what might happen during the largest surge, no matter what time of day that may be.
Early Arrival: Although part of the overall arrival pattern at a polling place, this is adjusted separately and tells the percentage of the day's voters who will already be waiting in line when the polls open for the day. If unchanged, this percentage defaults to 1.2 (meaning 1.2%) for all arrival patterns. Enter this value as a percent but omit the % symbol, such as '5' for 5%.
Cannot Check In: This is the percentage of voters who wait in the check-in line, but are unable to check in to vote at one of the voting stations. Many of these voters are offered provisional ballots, but they will not wait for a voting station and thus will not influence those lines. The default is 2.0, meaning 2%. Provide a percentage, but omit the % symbol.

Saving your spreadsheet to use in the simulation

When you've completed your spreadsheet, save it once in regular Excel format if you've used formulas and you want to be able to edit it with those same formulas. Then choose "Save As" followed by "Save As Type" and select "Text (Tab delimited)". Alternately you can use "CSV (Comma delimited)", but only if you've managed to avoid using commas in your data.

Load your data into the simulation

Near the top of the simulation there is a Load Precinct Data button. Click it. Choose the tab-delimited or CSV file that you saved. Click Okay.

If all went correctly, you should now see new County and Precinct lists at the top of the simulation. If you select a precinct from the list, the values for that precinct should appear from the data you provided. Remember to set Minutes to Check-in, Minutes to Vote, Minutes to Register, and Minutes to Scan if you did not provide these in your spreadsheet. The values you set become the defaults for each precinct in the data for which these values were omitted in the upload.

Precinct data table

Scroll to the bottom of the simulation and you'll see a new choice, Edit Precinct Data. Click that, and a table will appear below with a list of all your precincts, and columns for check-in stations, voting stations, peak wait time, average wait time, and walk-offs. If your data enables same-day registration or includes scanners, columns for registration stations and scanners will be included also. Above the table is a button that says Compute Wait Times. Click that and the simulation will be run for each precinct in your data set and the peak and average waits time will be shown for each. At the top of the table you can click the Precinct, Peak Wait, Avg. Wait, or Walkoff column headings to sort the table by the selected column.

You can also edit the numbers in the Check-in, Voting, Registration, and Scanner station columns to recompute the wait times for that row with the new values. These changes will be remembered, and you can click the Download Current Data button at the top of the simulation to save your changes back to a CSV file that will load right back into Excel.

Note that changes made to Check-in Stations, Voting Stations, Registration Stations, and Scanners above the simulation graph (near the top of the page) are exploratory only and are NOT recorded in the precinct table at the bottom. If you want to save new values for check-in, voting, registration, and scanning stations, you should change the values in the table instead.