Solution Architecture

For this topic area post your best ideas on Data.gov and its capabilities, including data management, dissemination, search, semantic web, evolving core modules (shared services), data infrastructure and visualization tools and more.

Solution Architecture

Merge Geospatial One Stop and Data.gov

Geographic referencing adds critical context to data. It helps users quickly and easily determine whether a dataset pertains to their specific area of interest, and in the event that it does, empowers users by immediately allowing them to visualize that data, perhaps coupled with addtional datasets for informing context. Both Geospatial One Stop and Data.gov are citizen centric initiatives. Migrating and consolidating... more »

Voting

21 votes
Active
(@michael.daconta)

Solution Architecture

Definitions for every column

I just downloaded some energy data from data.gov on nuclear reactors and one of the columns is:

 

"NRC Unit"

 

I have no idea what that means? Every column or field of data should have a definition and that should be available on data.gov or in a standard format with the dataset. In this case, the data dictionary field that the catalog record links to does not have the definition of this field.

Voting

73 votes
Active
(@daniel.a.munz)

Solution Architecture

Systematically Track How People Are Using Data from Data.gov

There's no question that open data can provide citizens with more information and transparency about how government is operating and the results it's getting. But that's only half of the equation: We believe that open data produces not just more informed citizens, but also, ultimately, better government. To make that equation come true, though, we have to be vigilant about tracking how people are using data, and incorporating... more »

Voting

22 votes
Active
(@kohler.jim)

Solution Architecture

Super Crunching

Require agencies to submit datasets in standard format with common metadata fields, including short and long descriptions to improve user understanding of data-set. Create a user interface that enables users to easily graph multiple time-series data sets (simple trend graphs). This will let them visually compare different data sets on relative scales. By aggregating all government data into a standard format and enabling... more »

Voting

23 votes
Active
(@adriandwalker)

Solution Architecture

Social Media support for Executable English Q/A

Data by itself is necessary, but not enough, for practical applications. What's needed is knowledge about how to use the data to answer questions -- such as, "how much could the US save through energy independence?" There's emerging technology that leverages social media for the huge task of acquiring and curating the necessary knowledge -- in the form of executable English. One can Google "executable English" to find... more »

Voting

4 votes
Active
(@davidsmith)

Solution Architecture

Custom Notifications/RSS Feeds for New Datasets

An API to provide customizable RSS feeds should be considered, to allow users to subscribe to specific thematic areas, geographic areas and so on. This way, if for example someone was interested in data on aquatic resources in the Chesapeake Bay, they could subscribe and apply filters by custom search terms, by geography of interest, thematic keywords and so on. This could either provide notices via e-mail, or by... more »

Voting

47 votes
Active
(@pesticides)

Solution Architecture

New FR pesticide information too hard to access

In October 2009 EPA stopped hosting its most useful site for new Federal Register pesticide information at http://www.epa.gov/fedrgstr/EPA-PEST/index.html

 

This was a vital and highly user-friendly site. Please put it back on line as this important information is neither easily accessible or user-friendly to access.

Voting

3 votes
Active
(@louissweeny)

Solution Architecture

A micro-format tag set to describe data resources on the web

We need as many on-ramps for Agencies to get data assets represented on Data.gov as possible (see my On Ramps Idea). How about a micro-format (aka small light set of tags) that could help humans, search engines and other tools identify and aggregate data resources...this could be as simple as an adapted Dublin core set. This markup could be carried on the human documentation page, or on a separate page ala sitemap.xml.... more »

Voting

4 votes
Active
(@jimrolfes)

Solution Architecture

Include the capability tie individual data sets w/ super sets

Include the capability to align individual data sets with super sets.

 

Multiple agencies will have need for capability that enables the aggregation of data sets identified within Data.Gov. Data.Gov can be an excellent tool for linking data from separate owners (or in some cases a single owner) that conform to a consistent standard that can be used individually or combined to form a broader data set.

Voting

5 votes
Active
(@adammarcus)

Solution Architecture

Let Collaborative Tools Drive Future Contributions

The "Collaboration" section that starts on page 9 of the draft is a great start. Over at http://groups.csail.mit.edu/haystack/blog/2009/11/23/building-a-social-data-commons/ I've listed a few more. Summarized here: 1) For each data set, provide a set of developer-generated recipes for data manipulation so that future developers can stand on the shoulders of previous users of this data set by sharing tools and techniques.... more »

Voting

18 votes
Active
(@michael.daconta)

Solution Architecture

Improve Browsing of Data.gov

While there are quite a few discussions on this site about search; I would like to see improved browsing via a robust taxonomy/folksonomy of topic areas. In fact, I would recommend both a top-down/bottom-up approach where you begin with a top-down taxonomy but allow it to be extended via topic area suggestions and popular keywords. As a citizen, I don't always know what I want but want to browse and see what is available.... more »

Voting

24 votes
Active
(@ken000)

Solution Architecture

Have Datasets Divided by Geography Appear as one set in search.

Data.gov would be more usable if there was not so many entries on the catalog search page taken up by one dataset which is divided up geographically. A single row in the search results which indicated the geographical divisions of the data and allowed the user to drill down to the geographic specific files would make data.gov more usable. Similarly, different versions of a dataset release on different dates (such those... more »

Voting

8 votes
Active
(@john.weiler)

Solution Architecture

Solution Architecture Innovation Lab (SAIL)

To establish an open and inclusive COTS & Open Source self vetting "lab" where innovators of the market can engage is exploring the realm of the possible with prospective govt programs before they go to RFP. This would allow PMs and users better articulate the capabilities of the market in a collaborative forum and virtual lab. The "evidence" would come from a network of non-profits whose combined labs would form the... more »

Voting

-5 votes
Active