We, at DataGov are very interested in enabling datasets with API interfaces that will allow applications to dynamically use the data without downloading the entire dataset. What are some of the most popular ways that developers recommend and can you provide examples of use cases and/or existing apps that support this need?
The current Data.gov search tool is capable but cumbersome and does not yield desired results. We should combine and leverage capabilities in other government websites that have good search engines (and/or are planning on more improvements such as usa.gov). We should leverage and make this search available simply on the top of the site for all users. we should also have the ability to invoke more complex search capabilities... more »
Though there is another idea regarding a wikipedia entry... I think it is important to have a separate wiki set up for developers as there are many developer topics where collaboration is key. Of course, there are tons of precedent for this whereby many major development efforts/sites have developer wikis.
In addition to posting datasets and web services, Agencies should also be posting code and documentation. As with a data policy of "By default, all data should be made available, unless there are compelling reasons why not, e.g. sensitivity" so to should be the case with GOTS code. A large number of parallel development efforts by contractors and agency staff alike are underway across many agencies, often replicating... more »
As discussed in the CONOPS, semantic.data.gov will be an adjunct, experimental site to assist in the evolution of data.gov towards greater semantics. This site will also learn from lessons learned via data.gov.uk which is also exploiting semantic technologies. What would be the most important initial use cases the semantic web community would like to see semantic.data.gov tackle? Ontology development? Rule based alerts?... more »
There should be a way to search for data that was created in 1997.
There should also be a way to search for only static data. Likewise there should be a way to filter by only data that is actively being updated or dynamic.
Allow to sort by realtime vs. static or batched data. Include more realtime feeds from science departments (satellite, geographic sensors, camera's, location data of assets, monitoring equipment). This allows the public to for instance write realtime monitoring applications for earthquakes, volcanic erruptions. Or process other data in realtime (like raw feeds from satelites, power stations, water quality, weather,... more »
The current wikipedia Data.gov page is a great, open , neutral platform for the entire comunity to post the links to the latest global to local data.gov sites and the apps that have been generated by developers. It can become an authoritative reference for the growing open data community. Here is the link : http://en.wikipedia.org/wiki/Data.gov
Make sure the site is 508 accessible since its managed by GSA who are responsible for 508!
All government agencies that provide data to the public should offer the data in standardized forms. This would include both the format (CSV, XML, SQL Server, Access, Oracle, etc.) and structure. For example, the State Energy Data System at EIA provides its annual data update in CSV, with columns for the variable name, location, date, and value. This would likely need to be modified for various applications, but if... more »
The cost of poor information is HUGE; over 50% of the Data warehouse projects fail due to poor information. There is no way any agency will provide quality information without producing quality database schemas. Very few agencies pay any attention to the quality of the schema and as a result, (a) development and maintenance costs skyrocket, (b) applications are delivered very late, (c) performance is not ideal and (d)... more »
Currently users can rate the data usefulness etc but the rating isn't that meaningful because it isn't related to what the user is looking for. Adding an ability for users to tag datasets with keywords will make it easier for people to run more specific searches. And, if you allow users to associate ratings to keywords the rating system will deliver much richer information.
I am a consultant that often works with a lot of government entities and it would be nice to have a web service to call to retrieve the datasets and display in our systems.
Like many commercial product catalogs (i.e. Amazon.com) there is a web services api to search/access the catalog. Of course there are also REST APIs to do this (don't want a REST versus web services flame war here).
What catalog API would developers suggest for data.gov?
Is there a standard catalog API suitable for data.gov?
Discuss via your comments...
As data on Data.Gov becomes more robust and use increases tools that operate on identified raw data sets will increase. Different users may find greater utility or data aggregated into tool sets. Providing both options wherever possible is a solution that can best meet the broadest needs of potential Data.Gov users. Providing a mechanism for submitting tools that operate on the data rather than containing the data... more »