For this topic area post your best ideas on Data.gov and its capabilities, including data management, dissemination, search, semantic web, evolving core modules (shared services), data infrastructure and visualization tools and more.

Solution Architecture

Build Data Catalogs in the Cloud in Support of Data.gov & EPA's Strategic Data Action Plan

My Data Science Class Project (see http://federaldata.wik.is/Data_Science) is to re-architect Data.gov so that it responds to the IdeaScale Comments, is reusable, and is interoperable (searchable) with other data catalogs (international and sub-national). I am following an architectural pattern that starts with spreadsheets, uses a wiki for the metadata with well-defined URLs, provides analytics/visualizations, and lays ...more »

Submitted by

Voting

1 vote

Solution Architecture

HTML 5 compatability

This could be used as an alternative to Adobe Flash because of the inherent disadvantages of flash (lack of support on current phones/tablets excluding Android 2.2 devices, major stability and performance issues). This might also allow for specific apps to be made for the different mobile Operating Systems which as we have already seen with the expansion of the app stores breeds innovation and ideas that no one else ...more »

Submitted by

Voting

1 vote

Solution Architecture

Faceted Search with Atom Based Web Services

Data.gov should have a faceted search interface that provides a comprehensive overview of Data.gov content (metadata and facet counts), and a way for users to progressively refine their search criteria. Faceted search provides a good way for outsiders to better understand a big collection described by complex metadata structure. The faceted metadata should also be shared as an Atom Feed, so that updates of new content ...more »

Submitted by

Voting

4 votes

Solution Architecture

Remove and Guard Against (Validate) "Junk" Records

I tried to download a dataset that interested me called the "Occupational Outlook Handbook". The data.gov "record" says that it is a CSV dataset. However, when you click the csv link you go to a website that does not allow you to download the dataset! The data.gov link is: http://www.data.gov/details/336 This is a failure of the simplest validation possible - a link that is supposed to be to a dataset must be tested ...more »

Submitted by

Voting

7 votes

Solution Architecture

Don't Inflate Counts! (aka let a Dataset contain multiple files)

When searching for interesting datasets it can be very frustrating to see identical datasets (except by year) cluttering the search results. To me, this seems like just an attempt to inflate the counts by treating separate years or geography as separate datasets. Here is my proposed simple rule: If the dataset metadata fields (i.e. like "Coverage Date") are the same, it is the SAME dataset. Just because the instance ...more »

Submitted by

Voting

10 votes

Solution Architecture

Link / Status Checker

Via a blog post, http://www.spatiallyadjusted.com/2010/02/07/data-gov-is-already-broken-just-like-everything-before-it/ - it would be good to integrate periodic link/service checking, ala Geospatial One Stop's Service Status Checker - http://registry.fgdc.gov/statuschecker/index.php. Any status changes/outages should be reported via notification, e.g. RSS feed / email, and directed to the stewards and registrars managing ...more »

Submitted by

Voting

6 votes

Solution Architecture

Make RSS Feeds Available

This entry is a consensus recommendation of seven organizations that work on government transparency of which OpenTheGovernment.org is one.

Consider making some of the datasets available as feeds that are constantly up to date, rather than as static datasets that are pulled down and then reposted on an occasional basis.

Submitted by

Voting

11 votes

Solution Architecture

Make web-based interfaces available for public use

This entry is a consensus recommendation of seven organizations that work on government transparency of which OpenTheGovernment.org is one. Some of the currently posted files are quite large, ranging upward to several hundred megabytes. Their large size undermines their usefulness for most people or organizations. The large number of currently posted datasets also makes it difficult to find a particular database of ...more »

Submitted by

Voting

7 votes

Solution Architecture

Balance the format of data for developers and the public

This entry is a consensus recommendation of seven organizations that work on government transparency of which OpenTheGovernment.org is one. The format of the data plays a key role in its usability; many within the community of advocates who re-use and repackage government data would prefer data in CSV format, rather than the XML format in which many of the posted databases are provided. Accordingly, we recommend that ...more »

Submitted by

Voting

5 votes

Solution Architecture

Facilitate Access to Agency Web Services via Aggregation

In addition to providing Agency metadata and facilitating informed discovery of datasets, there may be a good role for Data.Gov to serve in also serving as a clearinghouse for discovering and facilitating access to datacentric agency web services, e.g. Open Geospatial Consortium Web Map Service (WMS) and Web Feature Service (WFS). In many instances, these types of services may lie scattered across and within agencies, ...more »

Submitted by

Voting

23 votes

Solution Architecture

Explore an effective organizational structure for public service

The federal government provides various services to public across the united states. Any service involve collaboration between individuals from various offices, disciplines, technologies, and responsibilities. Need to identify an ideal organizational structure that facilitates the collective workforce efforts in improving service to public in the most economical way.

Submitted by

Voting

2 votes

Solution Architecture

Data sets should be extensible like XBRL or RDF/OWL

Data sets should be extensible or flexible, similar to the characteristics of XBRL or RDF/OWL, rather than fixed schemas. This flexibility allows the data sets to evolve, it allows others to connect additional information to existing information. The notion of "linked data" as used by the Semantic Web people. Both XBRL and RDF/OWL are modeled as graphs which are extremely flexible. Combine these graphs with the ...more »

Submitted by

Voting

7 votes

Solution Architecture

Data Enhancement/Manipulation Capabilities

Provide basic capability toward data enhancement/manipulation/packaging - Here, the idea would be to provide reusable infrastructure that can be used by data stewards for data enhancement, and potentially conversion and packaging. Sample use case: Data steward uploads an Excel spreadsheet containing his data, which also includes addresses and - and then uses tools provisioned by Data.gov to geocode the dataset to ...more »

Submitted by

Voting

8 votes

Solution Architecture

Data Hosting for National Datasets

Have Data.gov serve as a platform for serving national data. In addition to serving as a catalog, provide actual data hosting capability, e.g. hosting national framework data layers to support a wide variety of mapping (ala The National Map). Additionally, this may be of particular value to support collaborative and cross-agency efforts such as Imagery For The Nation – Imagery For The Nation (IFTN) is an attempt to ...more »

Submitted by

Voting

18 votes

Solution Architecture

Data.gov Dashboard - Use FEA BRM/SRM to model dashboard measures

I would like to recommend that the Data.gov dashboard be developed (aligned) with the Federal Enterprise Architecture (FEA) Business Reference Model (BRM) and Service Component Reference Model (SRM). The BRM provides a view of the federal government’s core lines of business (the products and services it delivers to its citizens, the private sector, and other government agencies), and the SRM classifies the internal ...more »

Submitted by

Voting

9 votes

Solution Architecture

Add collaborative/social aspects to data.gov

In addition to the search options currently on data.gov and the spatial search capabilities discussed in other ideas, I suggest considering a third alternative: ask-a-friend. Like any forum, I post a question (for a specific dataset/service) and other users of the site may help me by pointing me to it. I could share my favorite (bookmarked) datasets with others or browse through their bookmarks to find data someone found ...more »

Submitted by

Voting

14 votes

Solution Architecture

Solution Architecture Innovation Lab (SAIL)

To establish an open and inclusive COTS & Open Source self vetting "lab" where innovators of the market can engage is exploring the realm of the possible with prospective govt programs before they go to RFP. This would allow PMs and users better articulate the capabilities of the market in a collaborative forum and virtual lab. The "evidence" would come from a network of non-profits whose combined labs would form the ...more »

Submitted by

Voting

-5 votes

Displaying 1 - 25 of 39 Ideas