<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: Design details of Audiogalaxy.com&#8217;s high performance MySQL search engine</title>
	<atom:link href="http://www.spiteful.com/2008/02/29/design-details-of-audiogalaxycoms-high-performance-mysql-search-engine/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.spiteful.com/2008/02/29/design-details-of-audiogalaxycoms-high-performance-mysql-search-engine/</link>
	<description></description>
	<pubDate>Wed, 23 Jul 2008 21:52:46 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.5</generator>
		<item>
		<title>By: Adding Search with Lucene &#124; Ajaxonomy</title>
		<link>http://www.spiteful.com/2008/02/29/design-details-of-audiogalaxycoms-high-performance-mysql-search-engine/#comment-1102</link>
		<dc:creator>Adding Search with Lucene &#124; Ajaxonomy</dc:creator>
		<pubDate>Mon, 30 Jun 2008 03:45:44 +0000</pubDate>
		<guid isPermaLink="false">http://www.spiteful.com/2008/02/29/design-details-of-audiogalaxycoms-high-performance-mysql-search-engine/#comment-1102</guid>
		<description>[...] need to build my own search engine.This is not really that hard to do. I would recommend you read some articles and then download Managing Gigabytes for Java. Those articles are by Tom from AudioGalaxy. [...]</description>
		<content:encoded><![CDATA[<p>[&#8230;] need to build my own search engine.This is not really that hard to do. I would recommend you read some articles and then download Managing Gigabytes for Java. Those articles are by Tom from AudioGalaxy. [&#8230;]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ueberbach JP</title>
		<link>http://www.spiteful.com/2008/02/29/design-details-of-audiogalaxycoms-high-performance-mysql-search-engine/#comment-1086</link>
		<dc:creator>Ueberbach JP</dc:creator>
		<pubDate>Fri, 27 Jun 2008 01:19:09 +0000</pubDate>
		<guid isPermaLink="false">http://www.spiteful.com/2008/02/29/design-details-of-audiogalaxycoms-high-performance-mysql-search-engine/#comment-1086</guid>
		<description>I agree on the fact than when you want real speed you should be able to programm in C(++).  We have kinda similar structure on mercuriusguides datacenter with a diffence in the webserver cluster; our webserver only outputs xml. The rendering servers which render the xml with xsl to html are located all over the world on (paid) hosting servers. The xml to be rendered is outputted, level 9 compressed, from our datacenter. Just another way to keep costs down (power,cooling,bandwidth,rackspace, etc...) and be more efficient. We have currently about 80000 unique users per day, about 10 million page views per month. Cpu of mysql database always below 5%. Especially the plugin structure of mysql is very usefull to increase performance. Hoerai for that.</description>
		<content:encoded><![CDATA[<p>I agree on the fact than when you want real speed you should be able to programm in C(++).  We have kinda similar structure on mercuriusguides datacenter with a diffence in the webserver cluster; our webserver only outputs xml. The rendering servers which render the xml with xsl to html are located all over the world on (paid) hosting servers. The xml to be rendered is outputted, level 9 compressed, from our datacenter. Just another way to keep costs down (power,cooling,bandwidth,rackspace, etc&#8230;) and be more efficient. We have currently about 80000 unique users per day, about 10 million page views per month. Cpu of mysql database always below 5%. Especially the plugin structure of mysql is very usefull to increase performance. Hoerai for that.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Large Mysql Database</title>
		<link>http://www.spiteful.com/2008/02/29/design-details-of-audiogalaxycoms-high-performance-mysql-search-engine/#comment-261</link>
		<dc:creator>Large Mysql Database</dc:creator>
		<pubDate>Sun, 06 Apr 2008 15:47:44 +0000</pubDate>
		<guid isPermaLink="false">http://www.spiteful.com/2008/02/29/design-details-of-audiogalaxycoms-high-performance-mysql-search-engine/#comment-261</guid>
		<description>found your site on del.icio.us today and really liked it.. i bookmarked it and will be back to check it out some more later ..</description>
		<content:encoded><![CDATA[<p>found your site on del.icio.us today and really liked it.. i bookmarked it and will be back to check it out some more later ..</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tom</title>
		<link>http://www.spiteful.com/2008/02/29/design-details-of-audiogalaxycoms-high-performance-mysql-search-engine/#comment-97</link>
		<dc:creator>Tom</dc:creator>
		<pubDate>Sat, 15 Mar 2008 17:48:02 +0000</pubDate>
		<guid isPermaLink="false">http://www.spiteful.com/2008/02/29/design-details-of-audiogalaxycoms-high-performance-mysql-search-engine/#comment-97</guid>
		<description>@Vijay:  Thanks for the pointer -- I'll dig through the Nutch code and see what I can find.</description>
		<content:encoded><![CDATA[<p>@Vijay:  Thanks for the pointer &#8212; I&#8217;ll dig through the Nutch code and see what I can find.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Vijay Chakravarthy</title>
		<link>http://www.spiteful.com/2008/02/29/design-details-of-audiogalaxycoms-high-performance-mysql-search-engine/#comment-82</link>
		<dc:creator>Vijay Chakravarthy</dc:creator>
		<pubDate>Sat, 15 Mar 2008 02:06:45 +0000</pubDate>
		<guid isPermaLink="false">http://www.spiteful.com/2008/02/29/design-details-of-audiogalaxycoms-high-performance-mysql-search-engine/#comment-82</guid>
		<description>Regarding the indexing for situations where the dictionary doesnt fit into memory, basically a standard approach is to index every nth entry of the dictionary, and then have a prefix based compressed representation of the n+1 to 2n th terms of the dictionary. Obviously this adds another disk seek to get the postings, but it works ok in practice especially if you cache the postings. 

I would look at some of the nutch source code, since it has scaled pretty well for large internet searches.</description>
		<content:encoded><![CDATA[<p>Regarding the indexing for situations where the dictionary doesnt fit into memory, basically a standard approach is to index every nth entry of the dictionary, and then have a prefix based compressed representation of the n+1 to 2n th terms of the dictionary. Obviously this adds another disk seek to get the postings, but it works ok in practice especially if you cache the postings. </p>
<p>I would look at some of the nutch source code, since it has scaled pretty well for large internet searches.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Lessons Learned Scaling the Audiogalaxy Search Engine &#124; Spiteful.com</title>
		<link>http://www.spiteful.com/2008/02/29/design-details-of-audiogalaxycoms-high-performance-mysql-search-engine/#comment-46</link>
		<dc:creator>Lessons Learned Scaling the Audiogalaxy Search Engine &#124; Spiteful.com</dc:creator>
		<pubDate>Sun, 09 Mar 2008 03:45:12 +0000</pubDate>
		<guid isPermaLink="false">http://www.spiteful.com/2008/02/29/design-details-of-audiogalaxycoms-high-performance-mysql-search-engine/#comment-46</guid>
		<description>[...] About           &#171; Design details of Audiogalaxy.com&#8217;s high performance MySQL search engine [...]</description>
		<content:encoded><![CDATA[<p>[&#8230;] About           &laquo; Design details of Audiogalaxy.com&#8217;s high performance MySQL search engine [&#8230;]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tom</title>
		<link>http://www.spiteful.com/2008/02/29/design-details-of-audiogalaxycoms-high-performance-mysql-search-engine/#comment-35</link>
		<dc:creator>Tom</dc:creator>
		<pubDate>Thu, 06 Mar 2008 20:42:21 +0000</pubDate>
		<guid isPermaLink="false">http://www.spiteful.com/2008/02/29/design-details-of-audiogalaxycoms-high-performance-mysql-search-engine/#comment-35</guid>
		<description>@Jonathan:  Hey -- good to hear from you.  I won't call it a problem, but that was certainly a characteristic of AG (and possibly most start ups): stuff really wasn't documented.  In the time it took me to write that article I could have knocked out a few new features.</description>
		<content:encoded><![CDATA[<p>@Jonathan:  Hey &#8212; good to hear from you.  I won&#8217;t call it a problem, but that was certainly a characteristic of AG (and possibly most start ups): stuff really wasn&#8217;t documented.  In the time it took me to write that article I could have knocked out a few new features.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jonathan</title>
		<link>http://www.spiteful.com/2008/02/29/design-details-of-audiogalaxycoms-high-performance-mysql-search-engine/#comment-34</link>
		<dc:creator>Jonathan</dc:creator>
		<pubDate>Thu, 06 Mar 2008 15:27:14 +0000</pubDate>
		<guid isPermaLink="false">http://www.spiteful.com/2008/02/29/design-details-of-audiogalaxycoms-high-performance-mysql-search-engine/#comment-34</guid>
		<description>Nice write up Tom. I think I learned more about the design details of AG from this post than from working for AG for over a year.</description>
		<content:encoded><![CDATA[<p>Nice write up Tom. I think I learned more about the design details of AG from this post than from working for AG for over a year.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tom</title>
		<link>http://www.spiteful.com/2008/02/29/design-details-of-audiogalaxycoms-high-performance-mysql-search-engine/#comment-33</link>
		<dc:creator>Tom</dc:creator>
		<pubDate>Tue, 04 Mar 2008 17:16:11 +0000</pubDate>
		<guid isPermaLink="false">http://www.spiteful.com/2008/02/29/design-details-of-audiogalaxycoms-high-performance-mysql-search-engine/#comment-33</guid>
		<description>I recently read through this and a few of your other posts.  It's always fun reading how others create hugely scalable systems.


"The lengths people will go instead of just using a scaleable PostgreSQL server…"

Haha, this was my thought!  Although managing to scale a PHP/MySQL solution to 70 million searches per day is quite amazing and certainly explains the need for a completely custom-coded C system and transfer protocol.  A technological achievement to be sure.</description>
		<content:encoded><![CDATA[<p>I recently read through this and a few of your other posts.  It&#8217;s always fun reading how others create hugely scalable systems.</p>
<p>&#8220;The lengths people will go instead of just using a scaleable PostgreSQL server…&#8221;</p>
<p>Haha, this was my thought!  Although managing to scale a PHP/MySQL solution to 70 million searches per day is quite amazing and certainly explains the need for a completely custom-coded C system and transfer protocol.  A technological achievement to be sure.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Uri L.</title>
		<link>http://www.spiteful.com/2008/02/29/design-details-of-audiogalaxycoms-high-performance-mysql-search-engine/#comment-28</link>
		<dc:creator>Uri L.</dc:creator>
		<pubDate>Sun, 02 Mar 2008 14:07:59 +0000</pubDate>
		<guid isPermaLink="false">http://www.spiteful.com/2008/02/29/design-details-of-audiogalaxycoms-high-performance-mysql-search-engine/#comment-28</guid>
		<description>thanks for sharing, was a great reading.</description>
		<content:encoded><![CDATA[<p>thanks for sharing, was a great reading.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Dynamic Page Served (once) in 0.361 seconds -->
