Open Data Movement

From P2P Foundation
Jump to navigation Jump to search

= aims at making data freely available to everyone.



"The Open Data Movement aims at making data freely available to everyone. There are already various interesting open data sets availiable on the Web. Examples include Wikipedia, Wikibooks, Geonames, Music Brainz, WordNet, the DBLP bibliography and many more which are published under Creative Commons or Talis licenses." (


Open Data and the Commons

an old story? by Simon Chignard:

"There is a direct link between the open data movement and the philosophy of common goods. Open data are an illustration of the notion of common informational goods proposed by Elinor Ostrom, winner of the 2009 Nobel Prize for economics. Open data belong to everyone and, unlike water and air (and other common goods), they are non-exclusive: their use by one does not prevent others. If I reuse an open data set, this does not prevent other reusers from doing so. This proximity between the commons and open data is also suggested by the presence of the initiator of Creative Commons licences, Lawrence Lessig, at the 2007 Sebastopol meeting in which the concept of open data itself was defined.

But despite the strong conceptual and historical linkages, it seems that we, as actors of open data, are often shy to reaffirm the relationship. In our efforts to encourage public and private bodies to embrace open data, we seem almost embarrassed of this cornerstone philosophy. The four proposals I make here aim at one thing: not letting it drop!" (

The Movement needs to include an 'effective use' criteria in its demands

Michael Gurstein:

"The open data movement in the area of access to public (and other) information is a relatively new but very significant, and potentially powerful, emerging force. It has now been widely endorsed by among others Tim Berners-Lee generally acknowledged as the Father of the World Wide Web. The overall intention is to make local, regional and national data (and particularly publicly acquired data) available in a form that allows for direct manipulation using software tools as for example, for the purposes of cross-tabulation, visualization, mapping and so on.

The underlying idea is that public (and other) data, whether collected directly as part of census collection or indirectly as a secondary output of other activities (crime or accident statistics for example) should be available in electronic form and accessible via the web. There are significant initiatives in this area underway in the US , the UK and Canada among many many other jurisdictions.

This drive towards increased public transparency and allowing for enhanced data enriched citizen/public engagement in policy and other analysis and assessment is certainly a very positive outcome of public computing and online tools for data management and manipulation. However, as with the earlier discussion concerning the “digital divide” there would, in this context, appear to be some confusion as between movements to enhance citizen “access” to data and the related issues concerning enhancing citizen “use” of this data as part, for example, of interventions concerning public policies and programs.

In an earlier paper dealing with the digital divide discussion I suggested the use of the concept of “effective use” to distinguish between the opportunity for digitally-enabled activity presented by ICT access, from the actual realization of those opportunities in the form of “effective use”. At that time I introduced a set of layers of requirements, which can be understood as “pre-conditions” for the realization of “effective use” of digital “access”.

Efforts to extend access to “data” will perhaps inevitably create a “data divide” parallel to the oft-discussed “digital divide” between those who have access to data which could have significance in their daily lives and those who don’t. Associated with this will, one can assume, be many of the same background conditions which have been identified as likely reasons for the digital divide—that is differences in income, education, literacy and so on. However, just as with the “digital divide”, these divisions don’t simply stop or be resolved with the provision of digital (or data) “access”. What is necessary as well, is that those for whom access is being provided are in a position to actually make use of the now available access (to the Internet or to data) in ways that are meaningful and beneficial for them.

The question then becomes, who is in a position to make “effective use” of this newly available data?

The suggestion implicit in most of the discussions on “open data” (and explicit in Berners-Lee’s above quoted talk) is that “everyone” has the potential to make use of the data. However, as we know from experience elsewhere, not “everyone” has access to the digital infrastructure, to the hardware or software, or to the financial or educational resources/skills which would allow for the effective use of data or any other digital resource. Thus rather than the entire range of potential users being able to translate their access into meaningful applications and uses, the lack of these foundational requirements means that the exciting new outcomes available from open data are available only to those who are already reasonably well provided for technologically and with other resources."

Michael Gurstein cites as a counter-example:

"A very interesting and well-documented example of this empowering of the empowered can be found in the work of Solly Benjamin and his colleagues looking at the impact of the digitization of land records in Bangalore. Their findings were that newly available access to land ownership and title information in Bangalore was primarily being put to use by middle and upper income people and by corporations to gain ownership of land from the marginalized and the poor. The newly digitized and openly accessible data allowed the well to do to take the information provided and use that as the basis for instructions to land surveyors and lawyers and others to challenge titles, exploit gaps in title, take advantage of mistakes in documentation, identify opportunities and targets for bribery, among others. They were able to directly translate their enhanced access to the information along with their already available access to capital and professional skills into unequal contests around land titles, court actions, offers of purchase and so on for self-benefit and to further marginalize those already marginalized.

Certainly the newly digitized information was “accessible” to all on an equal basis but the availability of resources to translate that “access” into a beneficial “effective use” was directly proportional to the already existing resources available to those to whom the access was being provided. The old story about the pauper and the millionaire having equal opportunity to purchase a printing press as a means to promote their interests can be seen as holding equally here as in the 19th century.

Benjamin’s meticulously documented paper shows how the digitization and related digital access to land title records in Bangalore had the direct effect of shifting power and wealth to those with the financial resources and skills to use this information in self-interested ways. This is not to suggest that processes of computerization inevitably lead to such outcomes but rather to say that in the absence of efforts to equalize the playing field with respect to enabling opportunities for the use of newly available data, the end result may be increased social divides rather than reduced ones particularly with respect to the already poor and marginalized."

And concludes:

"this is not to argue against “open data” which in fact is a very significant advance and support to broad-based democratic action and empowerment but rather to argue that in the absence of specific efforts to ensure the widest possible availability of the pre-requisites for “effective use” the outcome of “open data” may be quite the opposite to that which is anticipated (and presumably desired) by its strongest proponents.

An “effective use” approach to open data would thus be one that ensured that opportunities and resources for translating this open data into useful outcomes would be available (and adapted) for the widest possible range of users. Thus, to ensure the effective use of open data a range of considerations needs to be included in the open data process and as elements in the open data movement including such factors as the cost and availability of Internet access, the language in which the data is presented, the technical or professional requirements for interpreting and making use of the data, the availability of training in data use and visualization, among others." (

More Information

  • For a more detailed discussion on “effective data use” and overcoming the “data divide” see the blogpost at: