<?xml version="1.0" encoding="UTF-8"?>
<!-- generator="wordpress/wordpress-mu-1.2.5" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>Xavier Llorà @ DITA &#187; Notes</title>
	<link>http://dita.ncsa.uiuc.edu/xllora</link>
	<description>There is never enough data</description>
	<pubDate>Sun, 18 May 2008 04:20:40 +0000</pubDate>
	<generator>http://wordpress.org/?v=wordpress-mu-1.2.5</generator>
	<language>en</language>
			<item>
		<title>Moving This Blog</title>
		<link>http://dita.ncsa.uiuc.edu/xllora/2008/05/17/moving-this-blog/</link>
		<comments>http://dita.ncsa.uiuc.edu/xllora/2008/05/17/moving-this-blog/#comments</comments>
		<pubDate>Sun, 18 May 2008 04:20:40 +0000</pubDate>
		<dc:creator>Xavier</dc:creator>
		
		<category><![CDATA[Notes]]></category>

		<guid isPermaLink="false">http://dita.ncsa.uiuc.edu/xllora/2008/05/17/moving-this-blog/</guid>
		<description><![CDATA[I am just working on unifying all my blogs, and this has been the first one. The new one is located here http://www.xavierllora.net/
Related PostsE2K blog has movedBlogging about NCSA PSPGeneric looping in Python]]></description>
			<content:encoded><![CDATA[<p>I am just working on unifying all my blogs, and this has been the first one. The new one is located here <a href="http://www.xavierllora.net/">http://www.xavierllora.net/</a></p>
<div class="aizatto_related_posts"><span class="aizatto_related_posts_header" >Related Posts</span><ul><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2007/08/02/e2k-blog-has-moved/" rel="bookmark" title="Permanent Link: E2K blog has moved" >E2K blog has moved</a></span><div class="aizatto_related_posts_excerpt"></div></li><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2007/05/22/blogging-about-ncsa-psp/" rel="bookmark" title="Permanent Link: Blogging about NCSA PSP" >Blogging about NCSA PSP</a></span><div class="aizatto_related_posts_excerpt"></div></li><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2008/01/16/generic-looping-in-python/" rel="bookmark" title="Permanent Link: Generic looping in Python" >Generic looping in Python</a></span><div class="aizatto_related_posts_excerpt"></div></li></ul></div>]]></content:encoded>
			<wfw:commentRss>http://dita.ncsa.uiuc.edu/xllora/2008/05/17/moving-this-blog/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Meandre: Semantic-Driven Data-Intensive Flow Engine</title>
		<link>http://dita.ncsa.uiuc.edu/xllora/2008/04/18/meandre-semantic-driven-data-intensive-flow-engine/</link>
		<comments>http://dita.ncsa.uiuc.edu/xllora/2008/04/18/meandre-semantic-driven-data-intensive-flow-engine/#comments</comments>
		<pubDate>Sat, 19 Apr 2008 00:30:22 +0000</pubDate>
		<dc:creator>Xavier</dc:creator>
		
		<category><![CDATA[Notes]]></category>

		<guid isPermaLink="false">http://dita.ncsa.uiuc.edu/xllora/2008/04/18/meandre-semantic-driven-data-intensive-flow-engine/</guid>
		<description><![CDATA[Finally we have finished setting up the website for Meandre a semantic-driven data-intensive flow engine. Meandre provides basic infrastructure for data-intensive computation. It provides, among others, tools for creating components and flows, a high-level language to describe flows, and multicore and distributed execution environment based on a service-oriented paradigm. We are currently working on getting [...]]]></description>
			<content:encoded><![CDATA[<p>Finally we have finished setting up the website for Meandre a semantic-driven data-intensive flow engine. Meandre provides basic infrastructure for data-intensive computation. It provides, among others, tools for creating components and flows, a high-level language to describe flows, and multicore and distributed execution environment based on a service-oriented paradigm. We are currently working on getting gear up for a first alpha release. You can visit the Meandre site <a href="http://seasr.org/meandre">here</a>. I will be posting in the Meandre blog about our current steps toward getting the release out of the door. The Meandre infrastructure is being build to support the <a href="http://seasr.org">SEASR project</a></p>
<div class="aizatto_related_posts"><span class="aizatto_related_posts_header" >Related Posts</span><ul><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2007/11/26/reasoning-for-the-semantic-web/" rel="bookmark" title="Permanent Link: Reasoning for the semantic web" >Reasoning for the semantic web</a></span><div class="aizatto_related_posts_excerpt"></div></li><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2008/03/26/bdcsg2008-text-information-management-challenges-and-oportunities-chengxiang-zhai/" rel="bookmark" title="Permanent Link: [BDCSG2008] Text Information Management: Challenges and Oportunities (ChengXiang Zhai)" >[BDCSG2008] Text Information Management: Challenges and Oportunities (ChengXiang Zhai)</a></span><div class="aizatto_related_posts_excerpt"></div></li><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2008/03/26/bdcsg2008-nsf-plans-for-supporting-data-intensive-computing-jeannette-wing-and-christophe-bisciglia/" rel="bookmark" title="Permanent Link: [BDCSG2008] NSF Plans for Supporting Data Intensive Computing (Jeannette Wing and Christophe Bisciglia)" >[BDCSG2008] NSF Plans for Supporting Data Intensive Computing (Jeannette Wing and Christophe Bisciglia)</a></span><div class="aizatto_related_posts_excerpt"></div></li></ul></div>]]></content:encoded>
			<wfw:commentRss>http://dita.ncsa.uiuc.edu/xllora/2008/04/18/meandre-semantic-driven-data-intensive-flow-engine/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Generic looping in Python</title>
		<link>http://dita.ncsa.uiuc.edu/xllora/2008/01/16/generic-looping-in-python/</link>
		<comments>http://dita.ncsa.uiuc.edu/xllora/2008/01/16/generic-looping-in-python/#comments</comments>
		<pubDate>Wed, 16 Jan 2008 16:00:12 +0000</pubDate>
		<dc:creator>Xavier</dc:creator>
		
		<category><![CDATA[Notes]]></category>

		<guid isPermaLink="false">http://dita.ncsa.uiuc.edu/xllora/2008/01/16/generic-looping-in-python/</guid>
		<description><![CDATA[I just posted on my IlliGAL blog how to implement a generic genetic algorithm (GA) main loop squeezing the dynamic behavior of Python. Pretty sleek, if you have tweaked GA main you  main find this interesting 
Related PostsProfiling Python CodeReasoning for the semantic webVisualizing content from metadata stores]]></description>
			<content:encoded><![CDATA[<p>I just posted on my <a href="http://www.illigal.uiuc.edu/web/xllora/">IlliGAL blog</a> <a href="http://www.illigal.uiuc.edu/web/xllora/2008/01/16/a-simple-and-flexible-ga-loop-in-python/">how to implement a generic genetic algorithm (GA) main loop</a> squeezing the dynamic behavior of Python. Pretty sleek, if you have tweaked GA main you  main find this interesting <img src='http://dita.ncsa.uiuc.edu/xllora/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /></p>
<div class="aizatto_related_posts"><span class="aizatto_related_posts_header" >Related Posts</span><ul><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2008/01/10/profiling-python-code/" rel="bookmark" title="Permanent Link: Profiling Python Code" >Profiling Python Code</a></span><div class="aizatto_related_posts_excerpt"></div></li><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2007/11/26/reasoning-for-the-semantic-web/" rel="bookmark" title="Permanent Link: Reasoning for the semantic web" >Reasoning for the semantic web</a></span><div class="aizatto_related_posts_excerpt"></div></li><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2007/04/14/visualizing-content-from-metadata-stores/" rel="bookmark" title="Permanent Link: Visualizing content from metadata stores" >Visualizing content from metadata stores</a></span><div class="aizatto_related_posts_excerpt"></div></li></ul></div>]]></content:encoded>
			<wfw:commentRss>http://dita.ncsa.uiuc.edu/xllora/2008/01/16/generic-looping-in-python/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Profiling Python Code</title>
		<link>http://dita.ncsa.uiuc.edu/xllora/2008/01/10/profiling-python-code/</link>
		<comments>http://dita.ncsa.uiuc.edu/xllora/2008/01/10/profiling-python-code/#comments</comments>
		<pubDate>Thu, 10 Jan 2008 15:36:48 +0000</pubDate>
		<dc:creator>Xavier</dc:creator>
		
		<category><![CDATA[Notes]]></category>

		<guid isPermaLink="false">http://dita.ncsa.uiuc.edu/xllora/2008/01/10/profiling-python-code/</guid>
		<description><![CDATA[For the first time in 9 years, this vacation break I have done absolutely nothing. Wow what a coach potato I have become! Well that is not totally true, just for fun I started going over Python and, as usual, for any new language I end writing a simple genetic algorithm. I like the flexibility [...]]]></description>
			<content:encoded><![CDATA[<p>For the first time in 9 years, this vacation break I have done absolutely nothing. Wow what a coach potato I have become! Well that is not totally true, just for fun I started going over Python and, as usual, for any new language I end writing a simple genetic algorithm. I like the flexibility and compactness of the code (no verbose at all). However when I fire my first run (yes, the good old OneMax problem), I realized that some of my assumptions about coding did not directly transfer. Yes, it was a bit slow. So I started digging for a profiler and, surprise!, it comes with the Python interpreter. </p>
<p>Here is an example on how to run the profiling capabilities</p>
<p><code><br />
import cProfile<br />
cProfile.run('main()')<br />
</code></p>
<p>The cProfile module is a profile based coded in C. If you do not have it in your install you could run the same code with the profile module instead (highly likely to be in your install). Also if you are using Python &lt; 2.5 you may also want to use the profile instead (I read somewhere there was a bug on the cProfile, but I could not recall where I saw it). Below you can read the output of the profiler.</p>
<p><code><br />
         1246109 function calls (1096109 primitive calls) in 1.428 CPU seconds<br />
   Ordered by: standard name<br />
   ncalls  tottime  percall  cumtime  percall filename:lineno(function)<br />
        1    0.000    0.000    1.428    1.428 :1()<br />
156000/6000    0.366    0.000    0.905    0.000 copy.py:144(deepcopy)<br />
    29953    0.008    0.000    0.008    0.000 copy.py:197(_deepcopy_atomic)<br />
     6000    0.154    0.000    0.535    0.000 copy.py:223(_deepcopy_list)<br />
     6000    0.034    0.000    0.740    0.000 copy.py:250(_deepcopy_dict)<br />
    47953    0.105    0.000    0.131    0.000 copy.py:260(_keep_alive)<br />
     6000    0.040    0.000    0.861    0.000 copy.py:276(_deepcopy_inst)<br />
     3000    0.180    0.000    0.258    0.000 crossovers.py:6(uniformCrossover)<br />
     6600    0.005    0.000    0.017    0.000 fitnesses.py:5(oneMax)<br />
     6000    0.006    0.000    0.006    0.000 ind_n_pop_classes.py:16(__init__)<br />
       11    0.023    0.002    0.040    0.004 ind_n_pop_classes.py:35(evaluate)<br />
       10    0.004    0.000    1.071    0.107 ind_n_pop_classes.py:63(selection)<br />
       10    0.026    0.003    0.317    0.032 ind_n_pop_classes.py:67(crossover)<br />
    18011    0.074    0.000    0.079    0.000 random.py:147(randrange)<br />
    18011    0.023    0.000    0.102    0.000 random.py:211(randint)<br />
       10    0.081    0.008    1.067    0.107 selections.py:7(tournamentSelection)<br />
        1    0.001    0.001    1.428    1.428 test.py:39(main)<br />
    24000    0.033    0.000    0.033    0.000 {hasattr}<br />
   227953    0.042    0.000    0.042    0.000 {id}<br />
    18038    0.004    0.000    0.004    0.000 {len}<br />
    12008    0.004    0.000    0.004    0.000 {method 'add' of 'set' objects}<br />
   293953    0.085    0.000    0.085    0.000 {method 'append' of 'list' objects}<br />
        1    0.000    0.000    0.000    0.000 {method 'disable' of '_lsprof.Profiler' objects}<br />
   203953    0.061    0.000    0.061    0.000 {method 'get' of 'dict' objects}<br />
     6000    0.002    0.000    0.002    0.000 {method 'iteritems' of 'dict' objects}<br />
   141011    0.032    0.000    0.032    0.000 {method 'random' of '_random.Random' objects}<br />
     6000    0.006    0.000    0.006    0.000 {method 'update' of 'dict' objects}<br />
       21    0.000    0.000    0.000    0.000 {range}<br />
     6600    0.012    0.000    0.012    0.000 {sum}<br />
     3000    0.017    0.000    0.017    0.000 {zip}<br />
</code></p>
<p>Yes, I used the deepcopy method because it was nice and make my live easy. Yup, big mistake. That force my selection to take almost 67% of the overall execution time. Quite unacceptable. Thanks to the profiler, now I knew were to look for slowness and more important, I learn what Python blanks in my knowledge need to be improved <img src='http://dita.ncsa.uiuc.edu/xllora/wp-includes/images/smilies/icon_biggrin.gif' alt=':D' class='wp-smiley' /></p>
<div class="aizatto_related_posts"><span class="aizatto_related_posts_header" >Related Posts</span><ul><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2008/01/16/generic-looping-in-python/" rel="bookmark" title="Permanent Link: Generic looping in Python" >Generic looping in Python</a></span><div class="aizatto_related_posts_excerpt"></div></li><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2007/04/24/principal-component-analysis-in-r/" rel="bookmark" title="Permanent Link: Principal Component Analysis in R" >Principal Component Analysis in R</a></span><div class="aizatto_related_posts_excerpt"></div></li><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2007/05/11/uniform-sampling-of-a-data-set/" rel="bookmark" title="Permanent Link: Uniform sampling of a data set" >Uniform sampling of a data set</a></span><div class="aizatto_related_posts_excerpt"></div></li></ul></div>]]></content:encoded>
			<wfw:commentRss>http://dita.ncsa.uiuc.edu/xllora/2008/01/10/profiling-python-code/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Reasoning for the semantic web</title>
		<link>http://dita.ncsa.uiuc.edu/xllora/2007/11/26/reasoning-for-the-semantic-web/</link>
		<comments>http://dita.ncsa.uiuc.edu/xllora/2007/11/26/reasoning-for-the-semantic-web/#comments</comments>
		<pubDate>Mon, 26 Nov 2007 15:04:04 +0000</pubDate>
		<dc:creator>Xavier</dc:creator>
		
		<category><![CDATA[Notes]]></category>

		<guid isPermaLink="false">http://dita.ncsa.uiuc.edu/xllora/2007/11/26/reasoning-for-the-semantic-web/</guid>
		<description><![CDATA[Jena 2 Inference Support is a nice introduction to the inference engines provided by the Jena package. Besides standardized reasoning for RDF and a subset of OWL/Lite and OWL/All ontologies, it also provides the mechanisms for creating your own rule-based inference engine using the generic rule-based inference also provided.

Powered by ScribeFire.
Related PostsMeandre: Semantic-Driven Data-Intensive Flow [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://jena.sourceforge.net/inference/">Jena 2 Inference Support</a> is a nice introduction to the inference engines provided by the <a href="http://jena.sourceforge.net/">Jena</a> package. Besides standardized reasoning for RDF and a subset of OWL/Lite and OWL/All ontologies, it also provides the mechanisms for creating your own rule-based inference engine using the generic rule-based inference also provided.</p>
<p>
<p>Powered by <a href="http://scribefire.com/">ScribeFire</a>.</p>
<div class="aizatto_related_posts"><span class="aizatto_related_posts_header" >Related Posts</span><ul><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2008/04/18/meandre-semantic-driven-data-intensive-flow-engine/" rel="bookmark" title="Permanent Link: Meandre: Semantic-Driven Data-Intensive Flow Engine" >Meandre: Semantic-Driven Data-Intensive Flow Engine</a></span><div class="aizatto_related_posts_excerpt"></div></li><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2008/03/26/bdcsg2008-text-information-management-challenges-and-oportunities-chengxiang-zhai/" rel="bookmark" title="Permanent Link: [BDCSG2008] Text Information Management: Challenges and Oportunities (ChengXiang Zhai)" >[BDCSG2008] Text Information Management: Challenges and Oportunities (ChengXiang Zhai)</a></span><div class="aizatto_related_posts_excerpt"></div></li><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2007/07/29/ditaalgdiscus-vast-contest-entry/" rel="bookmark" title="Permanent Link: DITA+ALG+DISCUS = VAST contest entry" >DITA+ALG+DISCUS = VAST contest entry</a></span><div class="aizatto_related_posts_excerpt"></div></li></ul></div>]]></content:encoded>
			<wfw:commentRss>http://dita.ncsa.uiuc.edu/xllora/2007/11/26/reasoning-for-the-semantic-web/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Embedded Jetty applications</title>
		<link>http://dita.ncsa.uiuc.edu/xllora/2007/09/19/embedded-jetty-applications/</link>
		<comments>http://dita.ncsa.uiuc.edu/xllora/2007/09/19/embedded-jetty-applications/#comments</comments>
		<pubDate>Wed, 19 Sep 2007 14:25:00 +0000</pubDate>
		<dc:creator>Xavier</dc:creator>
		
		<category><![CDATA[Notes]]></category>

		<guid isPermaLink="false">http://dita.ncsa.uiuc.edu/xllora/2007/09/19/embedded-jetty-applications/</guid>
		<description><![CDATA[I am currently working on an open source project (do not ask me which since it will surface soon, and I should not talk much about it till it does ;)) that required to provide web access to apps, services, and contents. From my days fighting with Mulgara descriptors I remembered that Jetty (full-featured web [...]]]></description>
			<content:encoded><![CDATA[<p>I am currently working on an open source project (do not ask me which since it will surface soon, and I should not talk much about it till it does ;)) that required to provide web access to apps, services, and contents. From my days fighting with <a href="http://www.mulgara.org/">Mulgara</a> descriptors I remembered that <a href="http://www.mortbay.org/">Jetty</a> (full-featured web server implemented entirely in Java) could be embedded into applications to provide such services. It has been two months now since I started using, and it is a nice, shiny, and slick piece of software. I used <a href="http://tomcat.apache.org/">Tomcat</a> for most of my stuff, but Jetty is definitely and amazing alternative Below I just pasted one of the ways you can embed Jetty in your app.<br />
<code><br />
Server server = new Server(8080);<br />
Context root = new Context(server,"/",Context.SESSIONS);<br />
root.addServlet(new ServletHolder(new HelloServlet("Hello World!")), "/*");<br />
server.start();<br />
server.join();<br />
</code><br />
Yes, that&#8217;s it. You can also embed full-fledge multiple web apps using<br />
<code><br />
Server server = new Server();<br />
XmlConfiguration configuration = new XmlConfiguration(new File("myJetty.xml").toURL()); //or use new XmlConfiguration(new FileInputStream("myJetty.xml"));<br />
configuration.configure(server);<br />
server.start();<br />
server.join();<br />
</code><br />
Oh, and one last cool thing. You can remove apps from the server without needing to restart it! That is pretty useful.</p>
<div class="aizatto_related_posts"><span class="aizatto_related_posts_header" >Related Posts</span><ul><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/about/" rel="bookmark" title="Permanent Link: About" >About</a></span><div class="aizatto_related_posts_excerpt"></div></li><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2007/04/14/we-finally-have-a-group-name/" rel="bookmark" title="Permanent Link: We finally have a group name!" >We finally have a group name!</a></span><div class="aizatto_related_posts_excerpt"></div></li><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2008/03/26/summary-of-bdcsg2008-blogging/" rel="bookmark" title="Permanent Link: Summary of BDCSG2008 blogging" >Summary of BDCSG2008 blogging</a></span><div class="aizatto_related_posts_excerpt"></div></li></ul></div>]]></content:encoded>
			<wfw:commentRss>http://dita.ncsa.uiuc.edu/xllora/2007/09/19/embedded-jetty-applications/feed/</wfw:commentRss>
		</item>
		<item>
		<title>E2K blog has moved</title>
		<link>http://dita.ncsa.uiuc.edu/xllora/2007/08/02/e2k-blog-has-moved/</link>
		<comments>http://dita.ncsa.uiuc.edu/xllora/2007/08/02/e2k-blog-has-moved/#comments</comments>
		<pubDate>Thu, 02 Aug 2007 19:40:55 +0000</pubDate>
		<dc:creator>Xavier</dc:creator>
		
		<category><![CDATA[Notes]]></category>

		<guid isPermaLink="false">http://dita.ncsa.uiuc.edu/xllora/2007/08/02/e2k-blog-has-moved/</guid>
		<description><![CDATA[The E2K blog has moved. You can reach it at
http://dita.ncsa.uiuc.edu/e2k/
Related PostsMoving This BlogBlogging about NCSA PSPGeneric looping in Python]]></description>
			<content:encoded><![CDATA[<p>The E2K blog has moved. You can reach it at</p>
<p><a href="http://dita.ncsa.uiuc.edu/e2k/">http://dita.ncsa.uiuc.edu/e2k/</a></p>
<div class="aizatto_related_posts"><span class="aizatto_related_posts_header" >Related Posts</span><ul><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2008/05/17/moving-this-blog/" rel="bookmark" title="Permanent Link: Moving This Blog" >Moving This Blog</a></span><div class="aizatto_related_posts_excerpt"></div></li><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2007/05/22/blogging-about-ncsa-psp/" rel="bookmark" title="Permanent Link: Blogging about NCSA PSP" >Blogging about NCSA PSP</a></span><div class="aizatto_related_posts_excerpt"></div></li><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2008/01/16/generic-looping-in-python/" rel="bookmark" title="Permanent Link: Generic looping in Python" >Generic looping in Python</a></span><div class="aizatto_related_posts_excerpt"></div></li></ul></div>]]></content:encoded>
			<wfw:commentRss>http://dita.ncsa.uiuc.edu/xllora/2007/08/02/e2k-blog-has-moved/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Loading RDF/XML files into Virtuoso&#8217;s metadata store</title>
		<link>http://dita.ncsa.uiuc.edu/xllora/2007/07/20/loading-rdfxml-files-into-virtuosos-metadata-store-2/</link>
		<comments>http://dita.ncsa.uiuc.edu/xllora/2007/07/20/loading-rdfxml-files-into-virtuosos-metadata-store-2/#comments</comments>
		<pubDate>Fri, 20 Jul 2007 14:19:07 +0000</pubDate>
		<dc:creator>Xavier</dc:creator>
		
		<category><![CDATA[Notes]]></category>

		<guid isPermaLink="false">http://dita.ncsa.uiuc.edu/xllora/2007/07/20/loading-rdfxml-files-into-virtuosos-metadata-store-2/</guid>
		<description><![CDATA[Bernie just put together this beauty to load small RDF/XML files into Virtuoso&#8217;s metadata store (We are using testing the open source version).

DB.DBA.RDF_LOAD_RDFXML(http_get ('URI to the RDF/XML file'),'','Name of the graph in the store');

We have tested loading a 5Million triple RDF/XML and results are pretty nice (It took around 6 minutes to load into a [...]]]></description>
			<content:encoded><![CDATA[<p>Bernie just put together this beauty to load small RDF/XML files into <a href="http://virtuoso.openlinksw.com/">Virtuoso</a>&#8217;s metadata store (We are using testing the <a href="http://sourceforge.net/projects/virtuoso/">open source version</a>).<br />
<code><br />
DB.DBA.RDF_LOAD_RDFXML(http_get ('URI to the RDF/XML file'),'','Name of the graph in the store');</p>
<p></code><br />
We have tested loading a 5Million triple RDF/XML and results are pretty nice (It took around 6 minutes to load into a dual Pentium 4 extreme edition at 3GHz with 4GB of RAM on a slow 7500rpm ext3fs). When pushing to larger files, the stream version of this is a must to reduce memory consumption.&nbsp;</p>
<div class="aizatto_related_posts"><span class="aizatto_related_posts_header" >Related Posts</span><ul><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2007/04/14/visualizing-content-from-metadata-stores/" rel="bookmark" title="Permanent Link: Visualizing content from metadata stores" >Visualizing content from metadata stores</a></span><div class="aizatto_related_posts_excerpt"></div></li><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/about/" rel="bookmark" title="Permanent Link: About" >About</a></span><div class="aizatto_related_posts_excerpt"></div></li><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2008/03/26/bdcsg2008-scientific-applications-of-large-databases-alex-szalay/" rel="bookmark" title="Permanent Link: [BDCSG2008] Scientific Applications of Large Databases (Alex Szalay)" >[BDCSG2008] Scientific Applications of Large Databases (Alex Szalay)</a></span><div class="aizatto_related_posts_excerpt"></div></li></ul></div>]]></content:encoded>
			<wfw:commentRss>http://dita.ncsa.uiuc.edu/xllora/2007/07/20/loading-rdfxml-files-into-virtuosos-metadata-store-2/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Reset user&#8217;s password on a MediaWiki</title>
		<link>http://dita.ncsa.uiuc.edu/xllora/2007/05/14/reset-users-password-on-a-mediawiki/</link>
		<comments>http://dita.ncsa.uiuc.edu/xllora/2007/05/14/reset-users-password-on-a-mediawiki/#comments</comments>
		<pubDate>Mon, 14 May 2007 23:31:47 +0000</pubDate>
		<dc:creator>Xavier</dc:creator>
		
		<category><![CDATA[Notes]]></category>

		<guid isPermaLink="false">http://dita.ncsa.uiuc.edu/xllora/2007/05/14/reset-users-password-on-a-mediawiki/</guid>
		<description><![CDATA[I needed to reset the password for a user on a MediaWiki site. Luckily, I run into this post &#8220;Reset a user password on MediaWiki - Greg&#8217;s Postgres stuff&#8221; which helps you to do so. The five-cent summary for a MySQL powered site:
UPDATE user SET user_password = md5(CONCAT('123-',md5('newpassword'))) WHERE user_id=123;
Related Posts[BDCSG2008] Simplicity and Complexity in [...]]]></description>
			<content:encoded><![CDATA[<p>I needed to reset the password for a user on a MediaWiki site. Luckily, I run into this post <a href="http://people.planetpostgresql.org/greg/index.php?/archives/74-Reset-a-user-password-on-MediaWiki.html">&#8220;Reset a user password on MediaWiki - Greg&#8217;s Postgres stuff&#8221;</a> which helps you to do so. The five-cent summary for a <a href="http://www.mysql.com/">MySQL</a> powered site:
<pre><code>UPDATE user SET user_password = md5(CONCAT('123-',md5('newpassword'))) WHERE user_id=123;</code></pre>
<div class="aizatto_related_posts"><span class="aizatto_related_posts_header" >Related Posts</span><ul><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2008/03/26/bdcsg2008-simplicity-and-complexity-in-data-systems-garth-gibson/" rel="bookmark" title="Permanent Link: [BDCSG2008] Simplicity and Complexity in Data Systems (Garth Gibson)" >[BDCSG2008] Simplicity and Complexity in Data Systems (Garth Gibson)</a></span><div class="aizatto_related_posts_excerpt"></div></li><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2008/03/26/bdcsg2008-clouds-and-manycores-the-revolution-dan-reed/" rel="bookmark" title="Permanent Link: [BDCSG2008] Clouds and ManyCores: The Revolution (Dan Reed)" >[BDCSG2008] Clouds and ManyCores: The Revolution (Dan Reed)</a></span><div class="aizatto_related_posts_excerpt"></div></li><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2008/03/26/bdcsg2008-algorithmic-perspectives-on-large-scale-social-network-data-jon-kleinberg/" rel="bookmark" title="Permanent Link: [BDCSG2008] Algorithmic Perspectives on Large-Scale Social Network Data (Jon Kleinberg)" >[BDCSG2008] Algorithmic Perspectives on Large-Scale Social Network Data (Jon Kleinberg)</a></span><div class="aizatto_related_posts_excerpt"></div></li></ul></div>]]></content:encoded>
			<wfw:commentRss>http://dita.ncsa.uiuc.edu/xllora/2007/05/14/reset-users-password-on-a-mediawiki/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Uniform sampling of a data set</title>
		<link>http://dita.ncsa.uiuc.edu/xllora/2007/05/11/uniform-sampling-of-a-data-set/</link>
		<comments>http://dita.ncsa.uiuc.edu/xllora/2007/05/11/uniform-sampling-of-a-data-set/#comments</comments>
		<pubDate>Fri, 11 May 2007 13:20:34 +0000</pubDate>
		<dc:creator>Xavier</dc:creator>
		
		<category><![CDATA[Notes]]></category>

		<guid isPermaLink="false">http://dita.ncsa.uiuc.edu/xllora/2007/05/11/uniform-sampling-of-a-data-set/</guid>
		<description><![CDATA[Sometimes you may need to sample a dataset. You may want to get a uniformly sampled subset out of a datatset stored in a file. The perlscript below does the job for you.


#!/usr/bin/perl -w
if ( $#ARGV!=1 ) {
        print "Wrong number of arguments\n\t".
      [...]]]></description>
			<content:encoded><![CDATA[<p>Sometimes you may need to sample a dataset. You may want to get a uniformly sampled subset out of a datatset stored in a file. The perlscript below does the job for you.</p>
<pre>
<code>
#!/usr/bin/perl -w
if ( $#ARGV!=1 ) {
        print "Wrong number of arguments\n\t".
                "uniform-sampler.pl &lt;file&gt; &lt;sample_proportion&gt;\n";
}
else {
        srand();
        open(FILE,$ARGV[0]) or die "File $ARGV[0] could not be open";
        while($line=&lt;FILE&gt;) {
                if ( rand()&lt;$ARGV[1] ) {
                                print $line;
                }
        }
        close FILE;
}
1;
</code>
</pre>
<div class="aizatto_related_posts"><span class="aizatto_related_posts_header" >Related Posts</span><ul><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2008/03/26/bdcsg2008-mining-the-web-graph-marc-najork/" rel="bookmark" title="Permanent Link: [BDCSG2008] Mining the Web Graph (Marc Najork)" >[BDCSG2008] Mining the Web Graph (Marc Najork)</a></span><div class="aizatto_related_posts_excerpt"></div></li><li><span class="aizatto_related_posts_title" ><a href="http://dita.ncsa.uiuc.edu/xllora/2008/03/26/bdcsg2008-sherpa-cloud-computing-of-the-third-kind-raghu-ramakrishnan/" rel="bookmark" title="Permanent Link: [BDCSG2008] Sherpa: Cloud Computing of the Third Kind (Raghu Ramakrishnan)" >[BDCSG2008] Sherpa: Cloud Computing of the Third Kind (Raghu Ramakrishnan)</a></span><div class="aizatto_related_posts_excerpt"></div></li></ul></div>]]></content:encoded>
			<wfw:commentRss>http://dita.ncsa.uiuc.edu/xllora/2007/05/11/uniform-sampling-of-a-data-set/feed/</wfw:commentRss>
		</item>
	</channel>
</rss>
