Blog for March 2019 Seminar: Open Data

The speaker at the NetIKX seminar in March 2019 was David Penfold, a veteran of the world of electronic publishing who also participates in ISO committees on standards for graphics technology.  He has been a lecturer at the University of the Arts London and currently teaches Information Management in a publishing context.

David’s talk looked at the two aspects of Open Data.  The most important thing for us to recognise is Data as the foundation and validation of Information.  He gave a series of interesting historical examples and pointed out that closer to the present day, quantum theory, relativity and much besides all developed because the data that people were measuring did not fit the predictions that earlier theoretical frameworks suggested.  A principle of experimental science is that if the data from your experiments don’t fit the predictions of your theories, it is the theories which must be revisited and reformulated.

David talked about some classificatory approaches. He mentioned the idea of a triple, where you have an entity, plus a concept of property, plus a value.  This three-element method of defining things is essential to the implementation of Linked Data.  Unless you can stablish relationships between data elements, they remain meaningless, just bare words or numbers.  A number of methods have been used to associate data elements with each other and with meaning.  The Relational Database model is one.  Spreadsheets are based on another model and the Standard Generalised Markup Language (and subsequently XML) was an approach to giving structure to textual materials.  Finally, the Semantic Web and the Resource Description Framework have developed over the last two decades

Moving on to what it means for data to be Open.  There are various misconceptions around this – it does not mean Open Access, a term used within the worlds of librarian ship and publishing to mean free-of-charge access, mainly to academic journals and books.  We are also not talking about Open Archiving, which has a close relationship to the Open Access concept.  Much of the effort in Open Archiving goes into developing standardised metadata so that archives can be shared.  Open data is freely available.  It is often from government but could be from other bodies and networks and even private companies.

We then watched a short piece of video showing Sir Nigel Shadbolt, in 2012, who was a founder of the Open Data Institute, which set up the open data portals for the UK government.  He explains how government publication of open data, in the interests of transparency is not found in many countries and at national, regional and local level.  The benefits include improved accountability, better public services, improvement in public participation, improved efficiency, creation of social value and innovation value to companies.

We heard about examples of Open Data, for example Network Rail publishes open data and benefits through improvements in customer satisfaction.  It says that its open data generates technology related jobs around the rail sector and saves costs in information provision when their parties invest in building information apps based on that data.    The data is used by commercial users too, but also the rail industry and Network Rail itself.   The data can also be accessed by individuals and academia.

Ordnance Survey open data is important within the economy and in governance.  David uses one application in his role as Chair of the Parish Council in his local village. The data allows them to see Historic England data for their area, and Environment Agency information showing sites of special scientific importance or areas of outstanding natural beauty.

After the tea-break, David showed three clips from a video of a presentation by Tim Berners-Lee.  David then explained how the Semantic Web works.  It is based on four concepts: a) metadata; b) structural relationships; d) tagging; d) the Resource Description Framework method of coding which in turn is based on XML.

The Open Data Institute has developed an ‘ethics canvas’, which we looked at to decide what we thought about it.  It gives a list of fifteen issues which may be of ethical concern.  We discussed this in our table groups and this was followed by a general discussion.  There were plenty of examples raised from our collective experience, which made for a lively end to the seminar.

This is taken from a report by Conrad Taylor

To see the full report follow this link: Conradiator : NetIKX meeting report : Open Data

March 2019 Seminar: Open Data

Summary

At this meeting David Penfold gave an introduction to the applications and implications of Open Data and the related topic of Linked Data. As more and more data is generated daily, and even by the minute, how that data is used and what information can be obtained from it becomes more and more significant. An important aspect of this is Open Data and the related topic of Linked Data. This meeting looked at these topics and reviewed how the use of Open and Linked Data can make access to information and how it is used much more powerful.

The meeting mainly consisted of a general (fairly non-technical) introduction to the subject from David Penfold, who gave examples of how open data is used by organisations such as Network Rail. He showed excerpts from presentations from Sir Tim Berners-Lee and Sir Nigel Shadbolt and concluded with a consideration of the ethics of Open Data and the implications of AI.

Speaker

Dr David Penfold is vice-chairman of NetIKX and has worked for many years in publishing, with a particular emphasis on content, structured documents and information management within a publishing context. He has previously been Chair of the British Computer Society Electronic Publishing Specialist Group and a Senior Lecturer at the London College of Communication (Deputy Course Director of the MA in Publishing). He is currently Convenor of the terminology Working Group of the ISO Technical Committee on Graphic Technology and a founder member of the recently formed IK SpringBoard, which is working on methods of implementation of the revised CILIP/KPMG report on Information as an Asset.

Time and Venue

2pm on 20th March 2019, The British Dental Association, 64 Wimpole Street, London W1G 8YS

Pre Event Information

None

Slides

No slides available for this presentation

Tweets

#netikx97

Blog

A report has been posted on the NetIKX blog

Study Suggestions

Have a look at the website for the Open Data Institute https://www.google.com/search?client=firefox-b-d&q=open+data+institute