<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Open Source Information Extraction:  Data, Lists, Rules, and Development Environment</title>
	<atom:link href="http://wyner.info/LanguageLogicLawSoftware/index.php/2010/01/06/open-source-information-extraction-data-lists-rules-and-development-environment/feed/" rel="self" type="application/rss+xml" />
	<link>http://wyner.info/LanguageLogicLawSoftware/index.php/2010/01/06/open-source-information-extraction-data-lists-rules-and-development-environment/</link>
	<description>Dr. Adam Wyner&#039;s blog on legal informatics for legal professionals</description>
	<lastBuildDate>Thu, 22 Sep 2011 17:47:40 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.4</generator>
	<item>
		<title>By: wim</title>
		<link>http://wyner.info/LanguageLogicLawSoftware/index.php/2010/01/06/open-source-information-extraction-data-lists-rules-and-development-environment/comment-page-1/#comment-51</link>
		<dc:creator>wim</dc:creator>
		<pubDate>Tue, 12 Jan 2010 12:15:47 +0000</pubDate>
		<guid isPermaLink="false">http://wyner.info/LanguageLogicLawSoftware/?p=432#comment-51</guid>
		<description>Hi Adam,

I largely agree with your stance on openness.
Where we differ inopinion is how you present this. In my opinion you are wrong to single out Gate the way you do as a special case of the lack iof openness you want to criticize.

Reproducability and openness of course very important in science and have always been firmly embedded in  GATE&#039;s philosophy.
I contend that the development of GATE applications around the world has fostered scientific progress and collaboration. The evidence is all over the web. Just look at the GATE web site to appreciate how large the GATE community out there is. Just read the GATE papers to encounter exactly the same opinions that you put forward. The very essence of GATE is that it is important for GATE that people work with its freely available code. That&#039;s why the architecture itself is open source, and new applications are constantly made publicly available. Solving the knowledge bottleneck is in the case of GATE a truly collaborative activity.

Most freely available tools come as executables, i.e. without source code. Increasingly tools are also offered as web services. This is general practise.

Depending on the rigidity of one&#039;s principle, opinions may vary about whether applications should be made available as open source code on the one hand or as executable/web service on the other.
In my view, what is crucial is the level of detail of the description of the tools&#039; functionality. In principle, as in other branches of science, reproducabilty entails independent verification of both methodology and results. Executables and web services provide the results, but hide the source code. Adequate description of the methods, e.g. coverage of grammars and statistical algorithms, should warrant reproducability. Description of scientific method, not the provision of scientific technology, is fundamental to scientific progress.
As analogy, in chemistry, results can be verified without the original author supplying laboratory equipment or analytical software.
It is therefore in perfect tandem with scientific method if, for instance, in NLP, the verification of noun phrase annotation performance involves the evaluation of the results of an NP annotator with a description of its functionality, but without the provision of the source code.

You imply that all Sheffield development of GATE should be made public. I reply that this is our principle, except when tools are still under development, or subjec tto legal restrictions imposed by the funder.

Best wishes,

Wim

Dr. W. Peters
Natural Language Processing group
Department of Computer Science
University of Sheffield
Regent Court
211 Portobello Street
Sheffield S1 4DP
tel: 00-44-114-2221902
fax: 00-44-114-2221810
email: w.peters@dcs.shef.ac.uk</description>
		<content:encoded><![CDATA[<p>Hi Adam,</p>
<p>I largely agree with your stance on openness.<br />
Where we differ inopinion is how you present this. In my opinion you are wrong to single out Gate the way you do as a special case of the lack iof openness you want to criticize.</p>
<p>Reproducability and openness of course very important in science and have always been firmly embedded in  GATE&#8217;s philosophy.<br />
I contend that the development of GATE applications around the world has fostered scientific progress and collaboration. The evidence is all over the web. Just look at the GATE web site to appreciate how large the GATE community out there is. Just read the GATE papers to encounter exactly the same opinions that you put forward. The very essence of GATE is that it is important for GATE that people work with its freely available code. That&#8217;s why the architecture itself is open source, and new applications are constantly made publicly available. Solving the knowledge bottleneck is in the case of GATE a truly collaborative activity.</p>
<p>Most freely available tools come as executables, i.e. without source code. Increasingly tools are also offered as web services. This is general practise.</p>
<p>Depending on the rigidity of one&#8217;s principle, opinions may vary about whether applications should be made available as open source code on the one hand or as executable/web service on the other.<br />
In my view, what is crucial is the level of detail of the description of the tools&#8217; functionality. In principle, as in other branches of science, reproducabilty entails independent verification of both methodology and results. Executables and web services provide the results, but hide the source code. Adequate description of the methods, e.g. coverage of grammars and statistical algorithms, should warrant reproducability. Description of scientific method, not the provision of scientific technology, is fundamental to scientific progress.<br />
As analogy, in chemistry, results can be verified without the original author supplying laboratory equipment or analytical software.<br />
It is therefore in perfect tandem with scientific method if, for instance, in NLP, the verification of noun phrase annotation performance involves the evaluation of the results of an NP annotator with a description of its functionality, but without the provision of the source code.</p>
<p>You imply that all Sheffield development of GATE should be made public. I reply that this is our principle, except when tools are still under development, or subjec tto legal restrictions imposed by the funder.</p>
<p>Best wishes,</p>
<p>Wim</p>
<p>Dr. W. Peters<br />
Natural Language Processing group<br />
Department of Computer Science<br />
University of Sheffield<br />
Regent Court<br />
211 Portobello Street<br />
Sheffield S1 4DP<br />
tel: 00-44-114-2221902<br />
fax: 00-44-114-2221810<br />
email: <a href="mailto:w.peters@dcs.shef.ac.uk">w.peters@dcs.shef.ac.uk</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Open Source Information Extraction: Data, Lists, Rules, and &#8230; Software Rss</title>
		<link>http://wyner.info/LanguageLogicLawSoftware/index.php/2010/01/06/open-source-information-extraction-data-lists-rules-and-development-environment/comment-page-1/#comment-49</link>
		<dc:creator>Open Source Information Extraction: Data, Lists, Rules, and &#8230; Software Rss</dc:creator>
		<pubDate>Wed, 06 Jan 2010 21:05:40 +0000</pubDate>
		<guid isPermaLink="false">http://wyner.info/LanguageLogicLawSoftware/?p=432#comment-49</guid>
		<description>[...] original post here:  Open Source Information Extraction: Data, Lists, Rules, and &#8230;          By admin &#124; category: language software &#124; tags: entries, legal-informatics, logic, wyner &#124;  [...]</description>
		<content:encoded><![CDATA[<p>[...] original post here:  Open Source Information Extraction: Data, Lists, Rules, and &#8230;          By admin | category: language software | tags: entries, legal-informatics, logic, wyner |  [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>

