<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: How to mirror Wikipedia</title>
	<atom:link href="http://www.igeek.co.za/2009/10/16/how-to-mirror-wikipedia/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.igeek.co.za/2009/10/16/how-to-mirror-wikipedia/</link>
	<description>A Capetonian geek living the life</description>
	<lastBuildDate>Thu, 02 Feb 2012 05:51:41 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
<xhtml:meta xmlns:xhtml="http://www.w3.org/1999/xhtml" name="robots" content="noindex" />
	<item>
		<title>By: pdh</title>
		<link>http://www.igeek.co.za/2009/10/16/how-to-mirror-wikipedia/comment-page-1/#comment-285045</link>
		<dc:creator>pdh</dc:creator>
		<pubDate>Tue, 17 Jan 2012 09:06:35 +0000</pubDate>
		<guid isPermaLink="false">http://www.igeek.co.za/?p=476#comment-285045</guid>
		<description>Is anybody hosting a wikipedia mirror that is accessible on-line?</description>
		<content:encoded><![CDATA[<p>Is anybody hosting a wikipedia mirror that is accessible on-line?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nick</title>
		<link>http://www.igeek.co.za/2009/10/16/how-to-mirror-wikipedia/comment-page-1/#comment-284664</link>
		<dc:creator>Nick</dc:creator>
		<pubDate>Tue, 17 Jan 2012 01:59:23 +0000</pubDate>
		<guid isPermaLink="false">http://www.igeek.co.za/?p=476#comment-284664</guid>
		<description>For the two users above me, the images aren&#039;t available. Longer explanation here:
http://en.wikipedia.org/wiki/Wikipedia:Database_download#Where_are_images_and_uploaded_files</description>
		<content:encoded><![CDATA[<p>For the two users above me, the images aren&#8217;t available. Longer explanation here:<br />
<a href="http://en.wikipedia.org/wiki/Wikipedia:Database_download#Where_are_images_and_uploaded_files" rel="nofollow">http://en.wikipedia.org/wiki/Wikipedia:Database_download#Where_are_images_and_uploaded_files</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Matt</title>
		<link>http://www.igeek.co.za/2009/10/16/how-to-mirror-wikipedia/comment-page-1/#comment-272919</link>
		<dc:creator>Matt</dc:creator>
		<pubDate>Mon, 02 Jan 2012 07:19:10 +0000</pubDate>
		<guid isPermaLink="false">http://www.igeek.co.za/?p=476#comment-272919</guid>
		<description>unfortunately no..  i looked around for an answer to this problem but couldn&#039;t find one.. if you find a way to get all the images, please do share!







1</description>
		<content:encoded><![CDATA[<p>unfortunately no..  i looked around for an answer to this problem but couldn&#8217;t find one.. if you find a way to get all the images, please do share!</p>
<p>1</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Zach</title>
		<link>http://www.igeek.co.za/2009/10/16/how-to-mirror-wikipedia/comment-page-1/#comment-272735</link>
		<dc:creator>Zach</dc:creator>
		<pubDate>Mon, 02 Jan 2012 03:04:21 +0000</pubDate>
		<guid isPermaLink="false">http://www.igeek.co.za/?p=476#comment-272735</guid>
		<description>Does this include all the images as well?</description>
		<content:encoded><![CDATA[<p>Does this include all the images as well?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Matt</title>
		<link>http://www.igeek.co.za/2009/10/16/how-to-mirror-wikipedia/comment-page-1/#comment-233475</link>
		<dc:creator>Matt</dc:creator>
		<pubDate>Mon, 14 Nov 2011 22:34:20 +0000</pubDate>
		<guid isPermaLink="false">http://www.igeek.co.za/?p=476#comment-233475</guid>
		<description>excellent walk through, and the only definitive guide that i could find. thank you very much.

a pretty consistant link to download the latest wikipedia pages would be:
http://dumps.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles.xml.bz2

in the end i had a little bit of trouble trying to get mwimport to be recognized/found.  make sure to use the whole file name &quot;mwimport.sh&quot;</description>
		<content:encoded><![CDATA[<p>excellent walk through, and the only definitive guide that i could find. thank you very much.</p>
<p>a pretty consistant link to download the latest wikipedia pages would be:<br />
<a href="http://dumps.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles.xml.bz2" rel="nofollow">http://dumps.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles.xml.bz2</a></p>
<p>in the end i had a little bit of trouble trying to get mwimport to be recognized/found.  make sure to use the whole file name &#8220;mwimport.sh&#8221;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ken D'Ambrosio</title>
		<link>http://www.igeek.co.za/2009/10/16/how-to-mirror-wikipedia/comment-page-1/#comment-52461</link>
		<dc:creator>Ken D'Ambrosio</dc:creator>
		<pubDate>Wed, 23 Feb 2011 21:30:28 +0000</pubDate>
		<guid isPermaLink="false">http://www.igeek.co.za/?p=476#comment-52461</guid>
		<description>Couple things:
1) Yes, a tweak to /etc/mysql/my.cnf was required; I changed max_packet_size to 128 MB.  (It was 16 MB.  128 was probably overkill... but hey -- better safe than sorry.)
2) No need to de-compress the .bz2 -- and I don&#039;t know if you could even *do* that with tar, since a .bz2 is a bzip&#039;d file, and not a tar archive.  Instead, I used the following:
bzcat enwiki-[...]-pages-articles.xml.bz2 &#124; mwimport &#124; mysql -p -f -u  
3) Note that you can&#039;t just do a &quot;wget&quot; on the mwimport link above -- that&#039;s a link to a mediawiki page that, in turn, has text you need to stuff into an executable, and then chmod +x on.</description>
		<content:encoded><![CDATA[<p>Couple things:<br />
1) Yes, a tweak to /etc/mysql/my.cnf was required; I changed max_packet_size to 128 MB.  (It was 16 MB.  128 was probably overkill&#8230; but hey &#8212; better safe than sorry.)<br />
2) No need to de-compress the .bz2 &#8212; and I don&#8217;t know if you could even *do* that with tar, since a .bz2 is a bzip&#8217;d file, and not a tar archive.  Instead, I used the following:<br />
bzcat enwiki-[...]-pages-articles.xml.bz2 | mwimport | mysql -p -f -u<br />
3) Note that you can&#8217;t just do a &#8220;wget&#8221; on the mwimport link above &#8212; that&#8217;s a link to a mediawiki page that, in turn, has text you need to stuff into an executable, and then chmod +x on.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dan</title>
		<link>http://www.igeek.co.za/2009/10/16/how-to-mirror-wikipedia/comment-page-1/#comment-567</link>
		<dc:creator>Dan</dc:creator>
		<pubDate>Mon, 14 Dec 2009 04:30:17 +0000</pubDate>
		<guid isPermaLink="false">http://www.igeek.co.za/?p=476#comment-567</guid>
		<description>Just curious, how much memory was on the system that you did this on, and did you modify any of the settings for mySQL?</description>
		<content:encoded><![CDATA[<p>Just curious, how much memory was on the system that you did this on, and did you modify any of the settings for mySQL?</p>
]]></content:encoded>
	</item>
</channel>
</rss>

