<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Language Logic Law Software &#187; RDF/XML</title>
	<atom:link href="http://wyner.info/LanguageLogicLawSoftware/index.php/category/rdfxml/feed/" rel="self" type="application/rss+xml" />
	<link>http://wyner.info/LanguageLogicLawSoftware</link>
	<description>Dr. Adam Wyner&#039;s blog on legal informatics for legal professionals</description>
	<lastBuildDate>Wed, 18 Jan 2012 21:09:54 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.4</generator>
		<item>
		<title>Papers Accepted to the JURIX 2011 Conference</title>
		<link>http://wyner.info/LanguageLogicLawSoftware/index.php/2011/10/13/papers-accepted-to-the-jurix-2011-conference/</link>
		<comments>http://wyner.info/LanguageLogicLawSoftware/index.php/2011/10/13/papers-accepted-to-the-jurix-2011-conference/#comments</comments>
		<pubDate>Thu, 13 Oct 2011 13:42:04 +0000</pubDate>
		<dc:creator>Adam Wyner</dc:creator>
				<category><![CDATA[GATE]]></category>
		<category><![CDATA[RDF/XML]]></category>
		<category><![CDATA[e-Government]]></category>
		<category><![CDATA[legal knowledge management]]></category>
		<category><![CDATA[linguistics]]></category>
		<category><![CDATA[text analytics]]></category>
		<category><![CDATA[text mining]]></category>

		<guid isPermaLink="false">http://wyner.info/LanguageLogicLawSoftware/?p=1204</guid>
		<description><![CDATA[My colleagues and I have had two papers (one long and one short) accepted for presentation at The 24th International Conference on Legal Knowledge and Information Systems (JURIX 2011). The papers are available on the links. On Rule Extraction from Regulations Adam Wyner and Wim Peters Abstract Rules in regulations such as found in the [...]]]></description>
			<content:encoded><![CDATA[<p>My colleagues and I have had two papers (one long and one short) accepted for presentation at <a href="https://www.univie.ac.at/RI/JURIX2011/">The 24th International Conference on Legal Knowledge and Information Systems (JURIX 2011)</a>.  The papers are available on the links.</p>
<p><a href="http://wyner.info/research/Papers/WynerPetersJURIX2011.pdf">On Rule Extraction from Regulations</a><br />
Adam Wyner and Wim Peters</p>
<p>Abstract<br />
Rules in regulations such as found in the US Federal Code of Regulations can be expressed using conditional and deontic rules.  Identifying and extracting such rules from the language of the source material would be useful for automating rulebook management and translating into an executable logic.  The paper presents a linguistically-oriented, rule-based approach, which is in contrast to a machine learning approach.  It outlines use cases, discusses the source materials, reviews the methodology, then provides initial results and future steps.</p>
<p><a href="http://wyner.info/research/Papers/Populating_JURIX2011.pdf">Populating an Online Consultation Tool</a><br />
Sarah Pulfrey-Taylor, Emily Henthorn, Katie Atkinson, Adam Wyner, and Trevor Bench-Capon</p>
<p>Abstract<br />
The paper addresses the extraction, formalisation, and presentation of public policy arguments.  Arguments are extracted from documents that comment on public policy proposals.  Formalising the information from the arguments enables the construction of models and systematic analysis of the arguments.  In addition, the arguments are represented in a form suitable for presentation in an online consultation tool.  Thus, the forms in the consultation correlate with the formalisation and can be evaluated accordingly.  The stages of the process are outlined with reference to a working example.</p>
<p><a href="http://wyner.info/LanguageLogicLawSoftware/?p=1204">Shortlink to this page.</a></p>
<p>By Adam Wyner<br />
Distributed under the Creative Commons<br />
<a href="http://creativecommons.org/licenses/by-nc-sa/2.0/uk/">Attribution-Non-Commercial-Share Alike 2.0</a></p>
]]></content:encoded>
			<wfw:commentRss>http://wyner.info/LanguageLogicLawSoftware/index.php/2011/10/13/papers-accepted-to-the-jurix-2011-conference/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Draft &#8212; Materials for LEX 2011</title>
		<link>http://wyner.info/LanguageLogicLawSoftware/index.php/2011/09/08/materials-for-lex-2011/</link>
		<comments>http://wyner.info/LanguageLogicLawSoftware/index.php/2011/09/08/materials-for-lex-2011/#comments</comments>
		<pubDate>Thu, 08 Sep 2011 12:03:10 +0000</pubDate>
		<dc:creator>Adam Wyner</dc:creator>
				<category><![CDATA[GATE]]></category>
		<category><![CDATA[RDF/XML]]></category>
		<category><![CDATA[legal knowledge engineering]]></category>
		<category><![CDATA[legal knowledge management]]></category>
		<category><![CDATA[linguistics]]></category>
		<category><![CDATA[text analytics]]></category>
		<category><![CDATA[text mining]]></category>

		<guid isPermaLink="false">http://wyner.info/LanguageLogicLawSoftware/?p=1146</guid>
		<description><![CDATA[Draft post At the links below, you can find the slides and hands on materials on GATE for the LEX summer school on Managing Legal Resources in the Semantic Web. GATE Legislative Rulebook By Adam Wyner Distributed under the Creative Commons Attribution-Non-Commercial-Share Alike 2.0]]></description>
			<content:encoded><![CDATA[<p>Draft post</p>
<p>At the links below, you can find the slides and hands on materials on GATE for the LEX summer school on Managing Legal Resources in the Semantic Web.</p>
<p><a href="http://wyner.info/research/Papers/WynerGATELegislativeRulebook.zip">GATE Legislative Rulebook</a></p>
<p>By Adam Wyner<br />
Distributed under the Creative Commons<br />
<a href="http://creativecommons.org/licenses/by-nc-sa/2.0/uk/">Attribution-Non-Commercial-Share Alike 2.0</a></p>
]]></content:encoded>
			<wfw:commentRss>http://wyner.info/LanguageLogicLawSoftware/index.php/2011/09/08/materials-for-lex-2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Estrella Project Overview</title>
		<link>http://wyner.info/LanguageLogicLawSoftware/index.php/2010/01/30/estrella-project-overview/</link>
		<comments>http://wyner.info/LanguageLogicLawSoftware/index.php/2010/01/30/estrella-project-overview/#comments</comments>
		<pubDate>Sat, 30 Jan 2010 19:30:18 +0000</pubDate>
		<dc:creator>Adam Wyner</dc:creator>
				<category><![CDATA[RDF/XML]]></category>
		<category><![CDATA[legal knowledge engineering]]></category>
		<category><![CDATA[legal knowledge management]]></category>

		<guid isPermaLink="false">http://wyner.info/LanguageLogicLawSoftware/?p=751</guid>
		<description><![CDATA[Until September 2009, I worked on the Estrella Project (The European project for Standardized Transparent Representations in order to Extend Legal Accessibility) at the University of Liverpool. One of the documents which I co-authored (with Trevor Bench-Capon) for the project was the ESTRELLA User Report, which is an open document about key elements of the [...]]]></description>
			<content:encoded><![CDATA[<p>Until September 2009, I worked on the <a href="http://www.estrellaproject.org/">Estrella Project</a> (The European project for Standardized Transparent Representations in order to Extend Legal Accessibility) at the University of Liverpool.  One of the documents which I co-authored (with Trevor Bench-Capon) for the project was the <a href="http://www.estrellaproject.org/doc/Estrella-D4.5.pdf">ESTRELLA User Report</a>, which is an open document about key elements of the project.  In the context of commercial, academic, governmental collaborations, many of the issues and topics from that project are still relevant, especially concerning motivations and goals of open source materials for legal informatics.  In order to circulate this discussion further afield, I have taken the liberty to reproduce an extracted from the article.  <em>LKIF</em> stands for the <em>Legal Knowledge Interchange Format</em>, which was a key deliverable in the project.  For further documents from the project, see the Estrella Project website.</p>
<p><strong>Overview</strong></p>
<p>The <em>Estrella Project</em> (The European project for Standardized Transparent Representations in order to Extend Legal Accessibility) has developed a platform which allows public administrations to deploy comprehensive solutions for the management of legal knowledge. In reasoning about social benefits or taxation, public administrators must represent and reason with complex legislation. The platform is intended to support the representation of and reasoning about legislation in a way that can help public administrations to improve the quality and efficiency of their services.  Moreover, given a suitable interface, the legislation can be made available for the public to interact with. For example, LKIF tools could be made available to citizens via the web to help them to assess their eligibility for a social benefits as well as filling out the appropriate application forms.</p>
<p>The platform has been designed to be open and standardised so that public administrations need not become dependent on proprietary products of particular vendors. Along the same lines, the platform supports interoperability among various components for legal knowledge-based systems allowing public administrations to freely choose among the components. A standardised platform also enables a range of vendors to develop innovative products to suit particular market needs without having to be concerned with an all-encompassing solution, compatibility with other vendors, or being locked out of a strategic market by &#8220;monolithic&#8221; vendors. As well, the platform abstracts from the expression of legislation in different natural languages so providing a common, abstract legal &#8220;lingua franca&#8221;.</p>
<p>The main technical achievement of the Estrella Project is the development of a Legal Knowledge Interchange Format (LKIF), which represents legal information in a form which builds upon emerging XML-based standards of the Semantic Web. The project platform provides Application Programmer Interfaces (APIs) for interacting with legal knowledge-based systems using LKIF. LKIF provides formalisms for representing concepts (&#8220;ontologies&#8221;), inference rules, precedent cases and arguments. An XML document schema for legislation has been developed, called MetaLex, which complements and integrates national XML standards for legislation. This format supports document search, exchange, and association among documents as well as enforces a link between legal sources and the legal knowledge systems which reason about the information in the sources. In addition, a reference inference engine has been developed which supports reasoning with legal knowledge represented in LKIF. The utility of LKIF as an interchange format for legal knowledge has been demonstrated with pilot tests of legal documents which are expressed in proprietary formats of several vendors then translated to and from the format of one vendor to that of another.</p>
<p><strong>Background Context</strong></p>
<p>The Estrella Project originated in the context of European Union integration, where:</p>
<ul>
<li> The European Parliament passes EU wide directives which need to be incorporated into or related to the legislation of member states.</li>
<li> Goods, services, and citizens are free to move across open European borders.</li>
<li> Democratic institutions must be strengthened as well as be more responsive to the will of the citizenry.</li>
<li> Public administrations must be more efficient and economical.</li>
</ul>
<p>In the EU, the legal systems of member states have been composed of heterogeneous, often conflicting, rules and regulations concerning taxes, employment, education, pensions, health care, property, trade, and so on. Integration of new EU legislation with existing legislation of the member states as well as homogenisation of legal systems across the EU has been problematic, complex, and expensive to implement. As the borders of member states open, the rules and regulations concerning the benefits and liabilities of citizens and businesses must move as people, goods, and services move. For example, laws concerning employment and pension ought to be comparable across the member states so as to facilitate the movement of employees across national boundaries. In addition, there are more general concerns about improving the functionality of the legal system so as to garner public support for the legal system, promoting transparency, compliance, and citizen involvement.  Finally, the costs of administering the legal system by EU administrative departments, administrations of member states, and companies throughout the EU are signficant and rising. The more complex and dynamic the legislative environment, the more burdensome the costs.</p>
<p><strong>Purposes</strong></p>
<p>Given this background context, the Estrella Project was initiated with the following purposes in mind:</p>
<ul>
<li> to facilitate the integration of EU legal systems</li>
<li> to modernise public administration at the levels of the EU and within member states by supporting efficiency, transparency, accountability, accessibility, inclusiveness, portability, and simplicity of core governmental processes and services</li>
<li> to improve the quality of legal information by testing legal systems for consistency (are there contradictions between portions of the law) and correctness (is the law achieving the goal it is specied for?).</li>
<li> to reduce the costs of public administration</li>
<li> to reduce private sector costs of managing their legal obligations</li>
<li> to encourage public support for democratic institutions by participation, transparency, and personalisation of services</li>
<li> to ease the mobility of goods, services, and EU citizens within the EU</li>
<li> to support businesses across EU member states</li>
<li> to provide the means to &#8220;modularise&#8221; the legal systems for different levels of EU legal structure, e.g. provide a &#8220;municipal government&#8221; module which could be amended to suit local circumstances</li>
<li> to support a range of governmental and legal processes across organisations and on behalf of citizens and businesses</li>
<li> to support a variety of reasoning patterns as needed across a range of resources (e.g. directives, legal case bases).</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://wyner.info/LanguageLogicLawSoftware/index.php/2010/01/30/estrella-project-overview/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Natural Language Processing Techniques for Managing Legal Resources on the Semantic Web &#8212; Tutorial Slides</title>
		<link>http://wyner.info/LanguageLogicLawSoftware/index.php/2009/12/20/slides-available-for-nlp-tutorial-at-jurix-2009/</link>
		<comments>http://wyner.info/LanguageLogicLawSoftware/index.php/2009/12/20/slides-available-for-nlp-tutorial-at-jurix-2009/#comments</comments>
		<pubDate>Sun, 20 Dec 2009 19:48:35 +0000</pubDate>
		<dc:creator>Adam Wyner</dc:creator>
				<category><![CDATA[GATE]]></category>
		<category><![CDATA[RDF/XML]]></category>
		<category><![CDATA[legal knowledge engineering]]></category>
		<category><![CDATA[linguistics]]></category>
		<category><![CDATA[text mining]]></category>

		<guid isPermaLink="false">http://wyner.info/LanguageLogicLawSoftware/?p=412</guid>
		<description><![CDATA[I gave a tutorial on natural language processing for legal resource management at the International Conference on Legal Information Systems (JURIX) 2009 in Rotterdam, The Netherlands. The slides are available below. Comments welcome. The following people attended: Andras Forhecz, Budapest University of Technology and Economics, Hungary Ales Gola, Ministry of Interior of Czech Republic Harold [...]]]></description>
			<content:encoded><![CDATA[<p>I gave a tutorial on natural language processing for legal resource management at the International Conference on Legal Information Systems (JURIX) 2009 in Rotterdam, The Netherlands.  The slides are available below.  Comments welcome.</p>
<p>The following people attended:</p>
<ul>
<li>Andras Forhecz, Budapest University of Technology and Economics, Hungary</li>
<li>Ales Gola, Ministry of Interior of Czech Republic</li>
<li>Harold Hoffman, University Krems, Austria</li>
<li>Czeslaw Jedrzejek, Poznan University of Technology, Poland</li>
<li>Manuel Maarek, INRIA Grenoble, Rhone-Alpes</li>
<li>Michael Sonntag, Johannes Kepler University Linz, Austria</li>
<li>Vit Stastny, Ministry of Interior of Czech Republic</li>
</ul>
<p>I thank the participants for their comments and look forward to continuing the discussions which we started in the tutorial.</p>
<p>At the link, one can find the slides.  Comments are very welcome.  The file is 2.2MB.  The slides were originally prepared using Open Office&#8217;s Impress, then converted to PowerPoint.</p>
<p><a href="http://wyner.info/research/Papers/WynerJURIX2009LegalTextAnalytics.ppt">Natural Language Processing Techniques for Managing Legal Resources on the Semantic Web</a></p>
<p>There is a bit more in the slides than was presented at the tutorial, covering in addition ontologies, parsers, and semantic interpreters.</p>
<p>In the coming weeks, I will make available additional detailed instructions as well as gazetteers and JAPE rules.  I also plan to continue to add additional text mining materials.</p>
<p>By Adam Wyner<br />
Distributed under the Creative Commons<br />
<a href=""http://creativecommons.org/licenses/by-nc-sa/2.0/uk/"">Attribution-Non-Commercial-Share Alike 2.0</a></p>
]]></content:encoded>
			<wfw:commentRss>http://wyner.info/LanguageLogicLawSoftware/index.php/2009/12/20/slides-available-for-nlp-tutorial-at-jurix-2009/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>Tutorial on NLP techniques for managing legal resources on the Semantic Web</title>
		<link>http://wyner.info/LanguageLogicLawSoftware/index.php/2009/12/08/tutorial-on-nlp-techniques-for-managing-legal-resources-on-the-semantic-web/</link>
		<comments>http://wyner.info/LanguageLogicLawSoftware/index.php/2009/12/08/tutorial-on-nlp-techniques-for-managing-legal-resources-on-the-semantic-web/#comments</comments>
		<pubDate>Tue, 08 Dec 2009 12:07:10 +0000</pubDate>
		<dc:creator>Adam Wyner</dc:creator>
				<category><![CDATA[RDF/XML]]></category>
		<category><![CDATA[legal knowledge management]]></category>
		<category><![CDATA[text mining]]></category>

		<guid isPermaLink="false">http://wyner.info/LanguageLogicLawSoftware/?p=407</guid>
		<description><![CDATA[Next week, 16 December 2009, I am giving a three hour tutorial at JURIX (International Conference on Legal Knowledge and Information Systems) in Rotterdam, The Netherlands on Natural Language Processing Techniques for Managing Legal Resources on the Semantic Web. The tutorial description appears below. Further material from the tutorial will be presented on the blog. [...]]]></description>
			<content:encoded><![CDATA[<p>Next week,  16 December 2009, I am giving a three hour tutorial at JURIX (International Conference on Legal Knowledge and Information Systems) in Rotterdam, The Netherlands on <a href="http://www.frg.eur.nl/arw/sectie_informatica_recht/jurix2009/tutorials/">Natural Language Processing Techniques for Managing Legal Resources on the Semantic Web</a>.  The tutorial description appears below.  Further material from the tutorial will be presented on the blog.</p>
<p>Legal resources such as legislation, public notices, and case law are increasingly available on the internet.  To be automatically processed by web services, the resources must be annotated using semantic web technologies such as XML, RDF, and ontologies.  However, manual annotation is labour and knowledge intensive.  Using natural language processing techniques and systems (NLP), a significant portion of these resources can be automatically annotated.  In this tutorial, we outline the motivations and objectives of NLP, give an overview of several accessible systems (General Architecture on Text Engineering, C&#038;C/Boxer, Attempto Controlled English), provide examples of processing legal resources, and discuss future directions in this area. </p>
]]></content:encoded>
			<wfw:commentRss>http://wyner.info/LanguageLogicLawSoftware/index.php/2009/12/08/tutorial-on-nlp-techniques-for-managing-legal-resources-on-the-semantic-web/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Annotating Rules in Legislation</title>
		<link>http://wyner.info/LanguageLogicLawSoftware/index.php/2009/11/27/annotating-rules-in-legislation/</link>
		<comments>http://wyner.info/LanguageLogicLawSoftware/index.php/2009/11/27/annotating-rules-in-legislation/#comments</comments>
		<pubDate>Fri, 27 Nov 2009 14:02:15 +0000</pubDate>
		<dc:creator>Adam Wyner</dc:creator>
				<category><![CDATA[RDF/XML]]></category>
		<category><![CDATA[argumentation]]></category>
		<category><![CDATA[e-Government]]></category>
		<category><![CDATA[legal knowledge engineering]]></category>
		<category><![CDATA[legal knowledge management]]></category>
		<category><![CDATA[text mining]]></category>

		<guid isPermaLink="false">http://wyner.info/LanguageLogicLawSoftware/?p=381</guid>
		<description><![CDATA[Over the last couple of months, I have had discussions about text mining and annotating rules in legislation with several people (John Sheridan of The Office of Public Sector Information, Richard Goodwin of The Stationery Office, and John Cyriac of Compliance Track). While nothing yet concrete has resulted from these discussions, it is clearly a [...]]]></description>
			<content:encoded><![CDATA[<p>Over the last couple of months, I have had discussions about text mining and annotating rules in legislation with several people (John Sheridan of <a href="http://www.opsi.gov.uk/">The Office of Public Sector Information</a>, Richard Goodwin of <a href="http://www.tso.co.uk/">The Stationery Office</a>, and John Cyriac of <a href="http://www.compliance-track.com/">Compliance Track</a>).  While nothing yet concrete has resulted from these discussions, it is clearly a &#8220;hot topic&#8221;.</p>
<p>In the course of these discussions, I prepared a short outline of the issues and approaches, which I present below.  Comments, suggestions, and collaborations are welcome.</p>
<p><strong>Vision, context, and objectives</strong></p>
<p>One of the main visions of <em>artificial intelligence and law</em> has been to develop a <em>legislative processing tool</em>.  Such a tool has several related objectives:</p>
<ul>
<ol>[1.] To guide the drafter to write well-formed legal rules in natural language.</ol>
<ol>[2.] To automatically parse and semantically represent the rules.</ol>
<ol>[3.] To automatically identify and annotate the rules so that they can be extracted from a corpus of legislation for web-based applications.</ol>
<ol>[4.] To enable inference, modeling, and consistency testing with respect to the rules.</ol>
<ol>[5.] To reason with respect to <em>domain knowledge</em> (an ontology).</ol>
<ol>[6.] To serve the rules on the web so that users can use natural language to input information and receive determinations.</ol>
</ul>
<p>While no such tool exists, there has been steady progress on understanding the problems and developing working software solutions.  In early work (see <a href="http://www.doc.ic.ac.uk/~rak/papers/British%20Nationality%20Act.pdf">The British nationality act as a logic program (1986)</a>), an act was manually translated into a program, allowing one to draw inferences given ground facts.  <a href="http://www.oracle.com/haley/index.html">Haley</a> is a software and service company which provides a framework which partially addresses 1, 2, 4, and 6 (see <a href="http://www.oracle.com/industries/government/enterprise-policy-automation/automation-for-public-sector.html">Policy Automation</a>).  Some research addresses aspects of 3 (see <a href="https://lirias.kuleuven.be/bitstream/123456789/234784/1/MochalesMoensICAIL09.pdf>Argumentation mining: the detection, classification, and structure of arguments in text</a>).  Legal ontologies have been proposed to address 5 (see <a href="http://www.estrellaproject.org/lkif-core/">LKIF-Core Ontology</a>).  Finally, there are XML annotation schemas for legislation (and related input support) such as <a href="http://www.opsi.gov.uk/legislation/schema/">The Crown XML Schema for Legislation</a> and <a href="http://www.akomantoso.org/">Akoma Ntoso</a>, both of which require manual input.  Despite these advances, there is much progress yet to be made.  In particular, no results fulfill [3.].</p>
<p>In consideration of [3.], the primary objective of this proposal is to use the <a href="http://gate.ac.uk/">General Architecture for Text Engineering (GATE)</a> framework in order to automatically identify and annotate legislative rules from a corpus.  The annotation should support web-based applications and be consistent with semantic web mark ups for rules, e.g. <a href="http://ruleml.org/">RuleML</a>.  A subsidiary objective is to define an authoring template which can be used within existing authoring applications to manually annotate legislative rules.</p>
<p><strong>Benefits</strong></p>
<p>Attaining these objectives would:</p>
<ul>
<li>Support automated creation, maintenance, and distribution of rule books for compliance.
<li>Contribute to the development of a legislative processing tool.</li>
<li>Make legislative rules accessible for web-based applications.  For example, given other annotations, one could identify rules that apply with respect to particular individuals in an organisation along with relevant dates, locations, etc.</li>
<li>Enable further processing of the rules such as removing formatting, parsing the content of the rules, and representing them semantically.</li>
<li>Allow an inference engine to be applied over the formalised rule base.</li>
<li>Make legislation more transparent and communicable among interested parties such as government departments, EU governments, and citizenry.</li>
</ul>
<p><strong>Scope</strong></p>
<p>To attain the objectives, we propose the following phases, where the numbers represent weeks of effort:</p>
<ul>
<li>Create a relatively small sample corpus to scope the study.</li>
<li>Manually identify the forms of legislative rules within the corpus.</li>
<li>Develop or adapt an annotation scheme for rules.</li>
<li>Apply the analysis tools of GATE and annotate the rules.</li>
<li>Validate that GATE annotates the rules as intended.</li>
<li>Apply the annotation system to a larger corpus of documents.</li>
</ul>
<p>For each section, we would produce a summary of results, noting where difficulties are encountered and ways they might be addressed.</p>
<p><strong>Extending the work</strong></p>
<p>The work can be extended in a variety of ways:</p>
<ul>
<li>Apply the GATE rules to a larger corpus with more variety of rule forms.</li>
<li>Process the rules for semantic representation and inference.</li>
<li>Take into consideration defeasiblity and exceptions.</li>
<li>Develop semantic web applications for the rules.</li>
</ul>
<p>By Adam Wyner<br />
Distributed under the Creative Commons<br />
<a href=""http://creativecommons.org/licenses/by-nc-sa/2.0/uk/"">Attribution-Non-Commercial-Share Alike 2.0</a></p>
]]></content:encoded>
			<wfw:commentRss>http://wyner.info/LanguageLogicLawSoftware/index.php/2009/11/27/annotating-rules-in-legislation/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Instructions for GATE&#8217;s Onto Root Gazetteer</title>
		<link>http://wyner.info/LanguageLogicLawSoftware/index.php/2009/11/24/notes-on-onto-root-gazetteer/</link>
		<comments>http://wyner.info/LanguageLogicLawSoftware/index.php/2009/11/24/notes-on-onto-root-gazetteer/#comments</comments>
		<pubDate>Tue, 24 Nov 2009 18:41:48 +0000</pubDate>
		<dc:creator>Adam Wyner</dc:creator>
				<category><![CDATA[GATE]]></category>
		<category><![CDATA[RDF/XML]]></category>
		<category><![CDATA[legal knowledge engineering]]></category>
		<category><![CDATA[ontology]]></category>
		<category><![CDATA[taxonomy]]></category>
		<category><![CDATA[text mining]]></category>

		<guid isPermaLink="false">http://wyner.info/LanguageLogicLawSoftware/?p=355</guid>
		<description><![CDATA[In this post, I present User Manual notes for GATE&#8217;s Onto Root Gazetteer (ORG) and references to ORG. In Discussion of GATE&#8217;s Onto Root Gazetteer, I discuss aspects of Onto Root Gazetteer which I found interesting or problematic. These notes and discussion may be of use to those researchers in legal informatics who are interested [...]]]></description>
			<content:encoded><![CDATA[<p>In this post, I present User Manual notes for GATE&#8217;s <em>Onto Root Gazetteer</em> (ORG) and references to ORG.  In <a href="http://wyner.info/LanguageLogicLawSoftware/index.php/2009/11/24/discussion-of-gates-onto-root-gazetteer/?preview=true&#038;preview_id=498&#038;preview_nonce=d412a16361">Discussion of GATE&#8217;s Onto Root Gazetteer</a>, I discuss aspects of Onto Root Gazetteer which I found interesting or problematic.  These notes and discussion may be of use to those researchers in legal informatics who are interested in text mining and annotation for the semantic web.</p>
<p>Thanks to Diana Maynard, Danica Damljanovic, Phil Gooch, and the GATE User Manual for comments and materials which I have liberally used.  Errors rest with me (and please tell me where they are so I can fix them!).</p>
<p><strong>Purpose</strong></p>
<p>Onto Root Gazetteer links text to an ontology by creating Lookup annotations which come from the ontology rather than a default gazetteer.  The ontology is preprocessed to produce a flexible, dynamic gazetteer; that is, it is a gazetteer which takes into account alternative morphological forms and can be added to.  An important advantage is that text can be annotated as an individual of the ontology, thus facilitating the population of the ontology.</p>
<p>Besides being flexible and dynamic, some advantages of ORG over other gazetteers:</p>
<ul>
<li>It is more richly structured (see it as a gazetteer containing other gazetteers)</li>
<li>It allows one to relate textual and ontological information by adding instances.</li>
<li>It gives one richer annotations that can be used for further processes.</li>
</ul>
<p>In the following, we present the step by step instructions for &#8216;rolling your own&#8217;, then show the results of the &#8216;prepackaged&#8217; example that comes with the plugin.</p>
<p><strong>Setup</strong></p>
<p>Step 1.  Add (if not already used) the Onto Root Gazetteer plugin to GATE following the usual plugin instructions.</p>
<p>Step 2.  Add (if not already used) the Ontology Tools (OWLIM Ontology LR, OntoGazetteer, GATE Ontology Editor, OAT) plugin.  ORG uses ontologies, so one must have these tools to load them as language resources.</p>
<p>Step 3.  Create (or load) an ontology with OWLIM (see the instructions on the ontologies).  This is the ontology that is the language resource that is then used by Onto Root Gazetteer.  Suppose this ontology is called myOntology.  It is important to note that OWLIM can only use OWL-Lite ontologies (see the documentation about this).  Also, I succeeded in loading an ontology only from the resources folder of the Ontology_Tools plugin (rather than from another drive); I don&#8217;t know if this is significant.</p>
<p>Step 4.  In GATE, create processing resources with default parameters:</p>
<ul>
<li>Document Reset PR</li>
<li>RegEx Sentence Splitter (or ANNIE Sentence Splitter, but that one is likely to run slower</li>
<li>ANNIE English Tokeniser</li>
<li>ANNIE POS Tagger</li>
<li>GATE Morphological Analyser</li>
</ul>
<p>Step 5.  When all these PRs are loaded, create a Onto Root Gazetteer PR and set the initial parameters as follows. Mandatory ones are as follows (though some are set as defaults):</p>
<ul>
<li>Ontology: select previously created myOntology</li>
<li>Tokeniser: select previously created Tokeniser</li>
<li>POSTagger: select previously created POS Tagger</li>
<li>Morpher: select previously created Morpher.</li>
</ul>
<p>Step 6. Create another PR which is a Flexible Gazetteer. At the initial parameters, it is mandatory to select previously created OntoRootGazetteer for gazetteerInst.  For another parameter, inputFeatureNames, click on the button on the right and when prompt with a window, add &#8216;Token.root&#8217; in the provided text box, then click Add button. Click OK, give name to the new PR (optional) and then click OK.</p>
<p>Step 7.  To create an application, right click on Application, New &#8211;> Pipeline (or Corpus Pipeline).  Add the following PRS to the application in this order:</p>
<ul>
<li>Document Reset PR</li>
<li>RegEx Sentence Splitter</li>
<li>ANNIE English Tokeniser</li>
<li>ANNIE POS Tagger</li>
<li>GATE Morphological Analyser</li>
<li>Flexible Gazetteer</li>
</ul>
<p>Step 8.  Run the application over the selected corpus.</p>
<p>Step 9.  Inspect the results.  Look at the Annotation Set with Lookup and also the Annotation List to see how the annotations appear.</p>
<p><strong>Small Example</strong></p>
<p>The ORG plugin comes with a demo application which not only sets up all the PRs and LRs (the text, corpus, and ontology), but also the application ready to run.  This is the file <em>exampleApp.xgapp</em>, which is in resource folder of the plugin (Ontology_Based_Gazetteer).  To start this, start GATE with a clean slate (no other PRs, LRs, or applications), then Applications, then right click to Restore application from file, then load the file from the folder just given.</p>
<p>The ontology which is used for an illustration is for GATE itself, giving the classes, subclasses, and instances of the system.  While the ontology is loaded along with the application, one can find it <a href="http://gate.ac.uk/ns/gate-kb">here</a>.  The text is simple (and comes with the application):  <em>language resources and parameters</em>.</p>
<p>FIGURE 1 (missing at the moment)</p>
<p>FIGURE 2 (missing at the moment)</p>
<p>One can see that the token &#8220;language resources&#8221; is annotated with respect to the class LanguageResource, &#8220;resources&#8221; is annotated with GATEResource, and &#8220;parameters&#8221; is annotated with ResourceParameter.  We discuss this further below.</p>
<p>One further aspect is important and useful.  Since the ontology tools have been loaded and a particular ontology has been used, one can not only see the ontology (open the OAT tab in the window with the text), but <em>one can annotate the text with respect to the ontology</em> &#8212; highlight some text and a popup menu allows one to select how to annotate the text.  With this, one can add instances (or classes) to the ontology.</p>
<p><strong>Documentation</strong></p>
<p>One can consult the following for further information about how the gazetteer is made, among other topics:</p>
<ul>
<li>GATE User Manual on <a href="http://gate.ac.uk/sale/tao/splitch13.html#x18-31400013.8">Onto Root Gazetteer</a>.</li>
<li>See section 3.1. of <a href="http://gate.ac.uk/sale/lrec2008/clone-ql/clone-ql-paper.pdf">this paper</a>.</li>
<li>See section 4 of <a href="http://www.tao-project.eu/resources/publicdeliverables/d3-1.pdf">TAO Deliverables</a>.</li>
<li>See Peters and Maynard in <a href="http://www.neon-project.org/web-content/images/Publications/neon_2009_d1042.pdf">A GATE-way into NeON</a>.
</ul>
<p><strong>Discussion</strong></p>
<p>See the related post <a href="http://wyner.info/LanguageLogicLawSoftware/index.php/2009/11/24/discussion-of-gates-onto-root-gazetteer/">Discussion of GATE&#8217;s Onto Root Gazetteer</a>.</p>
<p>By Adam Wyner<br />
Distributed under the Creative Commons<br />
<a href=""http://creativecommons.org/licenses/by-nc-sa/2.0/uk/"">Attribution-Non-Commercial-Share Alike 2.0</a></p>
]]></content:encoded>
			<wfw:commentRss>http://wyner.info/LanguageLogicLawSoftware/index.php/2009/11/24/notes-on-onto-root-gazetteer/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>London&#8217;s DataStore Workshop</title>
		<link>http://wyner.info/LanguageLogicLawSoftware/index.php/2009/10/24/londons-datastore-workshop/</link>
		<comments>http://wyner.info/LanguageLogicLawSoftware/index.php/2009/10/24/londons-datastore-workshop/#comments</comments>
		<pubDate>Sat, 24 Oct 2009 17:14:56 +0000</pubDate>
		<dc:creator>Adam Wyner</dc:creator>
				<category><![CDATA[RDF/XML]]></category>
		<category><![CDATA[e-Government]]></category>

		<guid isPermaLink="false">http://wyner.info/LanguageLogicLawSoftware/?p=344</guid>
		<description><![CDATA[Today I attended a workshop organised by the Greater London Authority (GLA), which is the citywide government for London. The workshop was held at City Hall on the top floor where we had a splendid view over the Thames, of Tower Bridge, and the Tower of London. The GLA is in the process of scoping [...]]]></description>
			<content:encoded><![CDATA[<p>Today I attended a workshop organised by the <a href="http://www.london.gov.uk/gla/index.jsp">Greater London Authority (GLA)</a>, which is the citywide government for London.  The workshop was held at City Hall on the top floor where we had a splendid view over the Thames, of Tower Bridge, and the Tower of London.</p>
<p>The GLA is in the process of scoping a datastore for information London.  The objective is to begin to encourage development of &#8220;government 2.0&#8243; using open government data along the lines of what has been done in San Franscisco in the US (see an article on <a href="http://www.guardian.co.uk/technology/2009/oct/14/san-francisco-open-city-data">DataSF</a> and a post by <a href="http://mashable.com/2009/10/21/san-francisco-government/">San Francisco Mayor Gavin Newsom</a>).  The principle idea is that by putting data of public interest into the public domain, the government can provide the basis for development of applications and services for the government, business community, and public.  For example, using police data, one can generate crime maps.</p>
<p>At the GLA meeting, the objective was to meet with the developer community to get ideas and feedback on what and how the data should be released as well as how best to encourage applications in the near future.</p>
<p>Clearly, the GLA meeting is along the lines of what is happening elsewhere in the UK government (see <a href="http://blogs.cabinetoffice.gov.uk/digitalengagement/">Digital Engagement at the Cabinet Office</a>, the <a href="http://www.opsi.gov.uk/">Office of Public Sector Information</a>, and <a href="http://www.tso.co.uk/">The Stationery Office</a>).</p>
<p>There were some 70 participants at the meeting, and we can look forward to further information coming from the organisers at the GLA.  Some very useful suggestions where made about where to get further information such as the <a href="http://www.innovateuk.org/">Technology Strategy Board</a> which supports technology development in the UK.</p>
<p>Among the topics of discussion where:</p>
<ul>
<li>What sort of data should be released and in what form?  There were those who wanted it raw and those who wanted it structured.  Likely releasing it both forms will occur.</li>
<li>How to get licensing for the data?  There are a host of difficult issues here, as most of the data is owned or copyrighted by a range of organisations, each of whom wants to control the flow of information, profit from it, or has concerns about security/liability.  Moreover, the government contracts information service providers, which process the data, may have some legal claim.  Such providers may be required to make their data open.</li>
<li>How would the data be used?  There were many suggestions about data reuse and mash up, mostly along the lines of existing applications such as mapping data to physical maps in order to get ideas about what is happening where in neighborhoods, transportation assistance, information access in a local area, and so on.</li>
<li>Who would develop the applications and how would development be funded?  Clearly there is an issue about funding, but some of the ways around it are to leverage funding between academic, government, and business communities.</li>
<li>Who to consult about applications?  A range of parties might be consulted about what they would find useful, from the person on the street, to members of service organisations (police, licensing, etc), to higher level government organisations.  Alternative, the GLA or similar organisations might develop applications which they thought would be useful, then provide them to the public.  Here, the focus would be on small, manageable, pilot projects to show proof of concept.</li>
</ul>
<p>The discussion was very nicely organised and led &#8212; several large tables around a circle, several large monitors, several boards for writing, an MC who kept things moving along, and a good overall atmosphere.  However, missing were a list of participants, contact information, and a short (three sentence) statement of interest; hopefully, all this will appear soon.</p>
<p>This is all very interesting and exciting, for things are just beginning to happen.  However, I have some concerns and realise I have a somewhat different focus.</p>
<ul>
<li>There were too few participants from government services and academia.  This is also reflected in the gap between the technology and the data, since it is highly unclear who is developing what for whom and what purpose?  It is hard to get a handle on how data should be served without some sense of goals.  Nonetheless, likely there will be another meet-up at which this more substantive discussion will happen.</li>
<li>There was only passing mention of ontologies, annotation, information extraction, and the semantic web.  The absence of semantic web concepts suggests that &#8220;reasoning&#8221; and complex information management is not high on the agenda.  This is consistent with the family of application ideas (graphs, maps, local information).  While I was told that there were ontologies via OPSI, this is not what I understood from John Sheridan in my recent discussion, so I will be eager to see exactly what this is.</li>
<li>Similarly, there was some discussion about whether there should be or could be standards and schemas for the data.  These are always compatible (make them both available), but I see standards are essential for any communication across agencies and localities.  There are drawbacks to standards development, but the issue will arise sooner or later in any case.</li>
<li>There is, as yet, no <em>the pitch</em>.  In other words, what is the incentive for anyone to make their data available or for organisations to otherwise cooperate with this endeavour?  The only mentions of incentive were government obligation, but this is perhaps the most heavy handed way to make headway.  Rather, there should be positive incentives.  In addition, the eGovernment agenda should be pushed (e.g. transparency, support for government, participation, efficiency, cost reduction, consistency&#8230;.).</li>
<li>There was little discussion of exactly which technologies would be used though RDF/XML and <a href="http://en.wikipedia.org/wiki/Representational_State_Transfer">REST</a> were mentioned.  These are generic and widespread; is the real hangup right now data access, or is there some technological issue?  If I wanted to know what I should need to know to program and provide a simple service, what would I have to know and do?</li>
<li>Despite the widespread interest in government 2.0, there is little vertical/horizontal integration or communication among the interested parties.  There is not, apparently, a coherent website or &#8216;state of the art&#8217; article with links to the relevant data/functionalities/support organisations.</li>
<li>There was no over-arching conception of design or context for applications.  Likely some sort of &#8216;apps&#8217; or plugins framework will emerge so that, for example, a local council would build none of its own applications or services, but these would be provided as plugins by independent providers, yet given a consistent style and structure.</li>
<li>Though there are claims that there have been consultations about government 2.0 with the various interested parties, there is no clear presentation of the results of those consultations.  A &#8216;brain-storming&#8217; site would be very useful.</li>
<li>It is unclear to me the extent to which the participants have the political/social context in mind.  While we were hosted by the GLA and discussed GLA data, the opportunities, limitations, requirements, and objectives of government seem to have entirely overlooked.  For example, government is successful (not always, but often) with making and monitoring standards for the public good; as elsewhere, why not here?  The requirements of government information provision are different than for commercial provision, especially since the government provides goods and services that would not otherwise be profitable.  The government does consult with the public and interested parties in making policy, but in some cases it is crucial that government <em>lead</em> and <em>direct</em> developments; the government is not simply another commerical provider of goods and services, driven by consumer interests; the government has a legislative role.  Keeping this in mind may change the sorts of proposals that come out from open GLA data</li>
<li>There were several discussions about why and how government data should be published.  The main points ought to be developed, discussed further, and summarised.  Yet, it ought also be pointed out that there is, in the UK, an abundance of information that the government holds about individuals; it is unclear how a &#8216;firewall&#8217; to protect and promote civil liberties will be set up and maintained; privacy and rights are in fact rather weak in the UK.  For example, the NHS is state funded and one might argue certain matters are in the public interest, so open information issues may arise here:  will we have &#8216;disease&#8217; mashups such as there are for broken lamp-posts, but in this case for drug addicts, HIV carriers, swine flu, etc?</li>
<li>I am particularly interested in legal reasoning, but this is not something on the agenda with respect to this data.
</ul>
<p>In any case, there is much of interest here and much to look forward to.</p>
<p>Cheers,<br />
Adam Wyner</p>
<p>Copyright © 2009 Adam Wyner</p>
]]></content:encoded>
			<wfw:commentRss>http://wyner.info/LanguageLogicLawSoftware/index.php/2009/10/24/londons-datastore-workshop/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Article about RDFa and Open Government</title>
		<link>http://wyner.info/LanguageLogicLawSoftware/index.php/2009/08/14/article-about-rdfa-and-open-government/</link>
		<comments>http://wyner.info/LanguageLogicLawSoftware/index.php/2009/08/14/article-about-rdfa-and-open-government/#comments</comments>
		<pubDate>Fri, 14 Aug 2009 17:12:45 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[RDF/XML]]></category>

		<guid isPermaLink="false">http://wyner.info/LanguageLogicLawSoftware/?p=284</guid>
		<description><![CDATA[This is a link to an article by Mark Birkbeck on RDFa and Open Government. It introduces the problems goverments have in managing their information and serving to the public, then outlines how RDFa, which introduces RDF into HTML documents, addresses these problem and provides for highly productive and readable web-documents. Copyright &#169; 2009 Adam [...]]]></description>
			<content:encoded><![CDATA[<p>This is a link to an article by <a href="http://blogs.talis.com/nodalities/2009/07/rdfa-and-linked-data-in-uk-government-web-sites.php">Mark Birkbeck</a> on RDFa and Open Government.  It introduces the problems goverments have in managing their information and serving to the public, then outlines how RDFa, which introduces RDF into HTML documents, addresses these problem and provides for highly productive and readable web-documents.</p>
<p>Copyright &copy; 2009 Adam Wyner</p>
]]></content:encoded>
			<wfw:commentRss>http://wyner.info/LanguageLogicLawSoftware/index.php/2009/08/14/article-about-rdfa-and-open-government/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

