<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: &#8230; some paralel processing tests using Microsoft Accelerator</title>
	<atom:link href="http://sorin.serbans.net/blog/index.php/2009/05/07/some-paralel-processing-tests-using-microsoft-accelerator/feed/" rel="self" type="application/rss+xml" />
	<link>http://sorin.serbans.net/blog/index.php/2009/05/07/some-paralel-processing-tests-using-microsoft-accelerator/</link>
	<description>Look at the spoon ... There are no bugs</description>
	<lastBuildDate>Sun, 11 Jul 2010 22:42:32 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
	<item>
		<title>By: admin</title>
		<link>http://sorin.serbans.net/blog/index.php/2009/05/07/some-paralel-processing-tests-using-microsoft-accelerator/comment-page-1/#comment-5082</link>
		<dc:creator>admin</dc:creator>
		<pubDate>Fri, 20 Nov 2009 13:39:22 +0000</pubDate>
		<guid isPermaLink="false">http://sorin.serbans.net/blog/?p=99#comment-5082</guid>
		<description>Hi JLafleur

Unfortunately I miss your point. The ideea was to have the same operations in the timed loops. I was not trying to also have the data avaialble somewhere. Maybe the operations should look like:

                for (int i = 0; i &lt; dimension; i++)
                {
                    float temp = matrix1[i] + matrix2[i]; 
                }
and
                for (int k = 0; k &lt; repetitions; k++)
                {
                    ParallelArrays.Add(pmatrix1, pmatrix2);
                }

I was pointing out just raw power not access latencies.</description>
		<content:encoded><![CDATA[<p>Hi JLafleur</p>
<p>Unfortunately I miss your point. The ideea was to have the same operations in the timed loops. I was not trying to also have the data avaialble somewhere. Maybe the operations should look like:</p>
<p>                for (int i = 0; i < dimension; i++)<br />
                {<br />
                    float temp = matrix1[i] + matrix2[i];<br />
                }<br />
and<br />
                for (int k = 0; k < repetitions; k++)<br />
                {<br />
                    ParallelArrays.Add(pmatrix1, pmatrix2);<br />
                }</p>
<p>I was pointing out just raw power not access latencies.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: JLafleur</title>
		<link>http://sorin.serbans.net/blog/index.php/2009/05/07/some-paralel-processing-tests-using-microsoft-accelerator/comment-page-1/#comment-5070</link>
		<dc:creator>JLafleur</dc:creator>
		<pubDate>Wed, 18 Nov 2009 20:16:49 +0000</pubDate>
		<guid isPermaLink="false">http://sorin.serbans.net/blog/?p=99#comment-5070</guid>
		<description>i think its not fair, do deklarate die ParallelArrays only once

 DisposableFloatParallelArray pmatrix1 = new DisposableFloatParallelArray(matrix1);
 DisposableFloatParallelArray pmatrix2 = new DisposableFloatParallelArray(matrix2);
 DisposableFloatParallelArray pmatrix3 = new DisposableFloatParallelArray(matrix3);
//....
for loop

and, after the calculation with the normal forLoop u have the result in the maxtrix3-Array

in the Gpu-calculation u DONT have the managed float array after calculation, so u cant use it to display it or so.

when u want to show, that one algorithm is faster than another, both have to do the same.
so put the array deklaration oft the ParralelArrays into the loop, and write the result back in a managed float Array.
and u ll see, that its much slower than the normal calculation

(sry, my english ist worse :) )</description>
		<content:encoded><![CDATA[<p>i think its not fair, do deklarate die ParallelArrays only once</p>
<p> DisposableFloatParallelArray pmatrix1 = new DisposableFloatParallelArray(matrix1);<br />
 DisposableFloatParallelArray pmatrix2 = new DisposableFloatParallelArray(matrix2);<br />
 DisposableFloatParallelArray pmatrix3 = new DisposableFloatParallelArray(matrix3);<br />
//&#8230;.<br />
for loop</p>
<p>and, after the calculation with the normal forLoop u have the result in the maxtrix3-Array</p>
<p>in the Gpu-calculation u DONT have the managed float array after calculation, so u cant use it to display it or so.</p>
<p>when u want to show, that one algorithm is faster than another, both have to do the same.<br />
so put the array deklaration oft the ParralelArrays into the loop, and write the result back in a managed float Array.<br />
and u ll see, that its much slower than the normal calculation</p>
<p>(sry, my english ist worse <img src='http://sorin.serbans.net/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  )</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ricardo Bartolomeu</title>
		<link>http://sorin.serbans.net/blog/index.php/2009/05/07/some-paralel-processing-tests-using-microsoft-accelerator/comment-page-1/#comment-3737</link>
		<dc:creator>Ricardo Bartolomeu</dc:creator>
		<pubDate>Sun, 05 Jul 2009 18:07:06 +0000</pubDate>
		<guid isPermaLink="false">http://sorin.serbans.net/blog/?p=99#comment-3737</guid>
		<description>Cheers ;)

Try matrix multiplication instead, (since it is computationaly harsher). ;)
I have a GeForce Go 7400 and my computation limit is bound to 4096 elements. 
Which is very low. =(</description>
		<content:encoded><![CDATA[<p>Cheers <img src='http://sorin.serbans.net/blog/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> </p>
<p>Try matrix multiplication instead, (since it is computationaly harsher). <img src='http://sorin.serbans.net/blog/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /><br />
I have a GeForce Go 7400 and my computation limit is bound to 4096 elements.<br />
Which is very low. =(</p>
]]></content:encoded>
	</item>
</channel>
</rss>
