Transit Data Gets More Intriguing As the Way It’s Presented Improves

Adil Yalcin

Adil Yalcin said he was always frustrated that so many data-analysis programs require the user to pick the design of the data visualization. In other words, the users had to decide if they wanted to view the information as a line graph, chart, or other map.

In an effort to simplify the process, the student and member of the Human Computer Interaction Lab (HCIL) at the University of Maryland spent nearly two years creating Keshif, a data-analysis program that automatically chooses the best visualization.

“There is a lot of interest in creating visualizations easily and exploring data easily, but there is still an emphasis on the richness of our tools and offering a lot of design alternatives,” Yalcin said. “Actually all of these alternatives … create a lot of extra complexity.

600_437894810-1Originally, the tool was created to analyze any data set but when he stumbled upon the data provided by government transit organizations at a Transportation Techies data hack night, he started to look into what data was available for D.C. It turned out that PlanItMetro’s data spreadsheets were a gold mine.

“[My question was] what kind of perspectives can I offer?” Yalcin said. “The answer was immediate from the data structure: How many people got into station A and got into Station B and then look at that on the weekday, the weekend, and so on.”

One finding from his results, presented at last month’s Transportation Techies: In the thousands of trips starting from Gallery Place during midday, Union Station is the most popular destination while Farragut North stands out as the second most popular. What is driving that traffic?

After the event, he spoke with representatives of BikeArlington and created a second visualization for them using data from Capital Bikeshare.

Currently, Keshif is a compilation of visualizations that Yalcin created himself, but the goal is to allow any users the ability to take data sets they are interested in and run them through the tool to achieve the best visualization possible. He said support from his advisors Ben Bederson and Niklas Elmqvist has been key to creating a product that is as easy as possible for the non-computer scientist to utilize.

“To come to the stage where [Keshif] becomes very intuitive and natural, it requires a lot of preparing in advance and also getting feedback from a lot of people,” Yalcin said. “Both professors and professionals in the field as well as people who might be caring about the data but who might not have the visualization or design skills, or even the broader public because that is what I’m trying to appeal to.”

He plans to conduct user studies of Keshif this summer and said he would be more than happy to speak to anyone who would like to participate in the user study or who has data that they want to explore.

And exploration is key. After all, that’s what “keshif” means in Yalcin’s native Turkish.

For more, here are ridership data sets for the D.C. Metro and Capital Bikeshare.

Photo by M.V. Jantzen

Share this item

4 Comments or Mentions

3 Comment(s)

sabre23t

I had a look at Keshif.me with with DC Metro ridership, seems a bit complex to me.

It seems like looking at Prezi.com presentation full zoomed out view. You see every thing but not any specific interesting details. If the Prezi had a default presentation path, you go through the various zoom ins/outs into the interesting parts.

Perhaps Keshif.me need such “guided tour” of interesting portions of the displayed data set?

Reply
Adil

Hi sabre23t, thanks for looking at the dataset and sending your suggestions.

To make sure others can find the links easily, let me put them down first:
http://www.keshif.me/demo/dc_metro_ridership
http://www.keshif.me/demo/dc_capital_bikeshare

You are absolutely right! When you first step in to the webpage, you see the overview of all the data. There are many things that you can learn from this data, but it requires interaction. First, having some questions up-front may help you focus and learn more from the data. Or maybe, you’ll just browse different aspects and find something unexpected. It certainly will require some effort to find the value, the interesting portions.

The webpages make data explorable, but do not guide you. Actually, I’ve spent more time exploring the data then building the webpages. I want to believe that this was because how keshif brings a clear visual structure to data with the least efforts:) Have you seen the source code of the browsers on github, if you are a developer?

If you have found some interesting patterns in this data, or have comments about how it works, and what you’d like to see, don’t hesitate to contact and say hi.

Adil

Reply
sabre23t

<>
I hope you have the “guidance” portion in your longer term plans.
In meantime you might want to allow for users being able to copy/save URLs for specific views of the dataset. Since you are manipulating the views via Javascript, you might need to provide buttons to save URLs of said specific views. Enthusiastic users could then post/blog such specific views of interest.

Apologies, if such feature already exist. My network connections is a bit slow today.

Reply

Mentions

This article has been mentioned in 1 other place(s).

Leave a Comment:

Your email address will not be published. Required fields are marked *


Warning: call_user_func_array() expects parameter 1 to be a valid callback, function 'end-div' not found or invalid function name in /var/www/mobilitylab.org/htdocs/wp-includes/plugin.php on line 525