Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.programming.threads > #1972

I have come to an interresting subject

Path csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!us.feeder.erje.net!feeder.erje.net!eu.feeder.erje.net!news.albasani.net!.POSTED!not-for-mail
From aminer <aminer@toto.net>
Newsgroups comp.programming.threads, comp.programming
Subject I have come to an interresting subject
Date Mon, 28 Oct 2013 21:19:05 -0700
Organization albasani.net
Lines 56
Message-ID <l4n2cd$2uk$1@news.albasani.net> (permalink)
Mime-Version 1.0
Content-Type text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding 7bit
X-Trace news.albasani.net WtGjtLI1J6zSAX6sxOkOi8bpY24n9lzOhQjeFu1xychgrxCoWL+04Cv4XHcObQMmkYGZRKDgyg1/PU1yqDC3yw==
NNTP-Posting-Date Tue, 29 Oct 2013 01:18:05 +0000 (UTC)
Injection-Info news.albasani.net; logging-data="g+XwPDOMXJZ9e3rmlGB67YbsTGrci1y4rN9u+AprEdRsEhUtPpyxtwQ/59iqf4Kj/BE4Wk4T1wW+ir3PsyP5phP9hZekiP0mWiHOfQFGXbGlKCJHgNzRTH/dvS03Fl/z"; mail-complaints-to="abuse@albasani.net"
User-Agent Mozilla/5.0 (Windows NT 6.0; WOW64; rv:24.0) Gecko/20100101 Thunderbird/24.0.1
Cancel-Lock sha1:XnWzxSP7cCulGgvlz24DViy+jRQ=
Xref csiph.com comp.programming.threads:1972 comp.programming:3988

Cross-posted to 2 groups.

Show key headers only | View raw


Hello,

I have come to an interresting subject, so be smart and follow with me
please...

I have tried to do a worst scalability prediction with an HDD hardisk
for my parallel archiver(you will find my parallel archiver here:
http://pages.videotron.com/aminer/)  with Parallel LZMA, and i think 
it's worst than what i have thought..

there is four things in my Parallel LZMA algorithm:

First we have to copy serially a stream from the hardisk to the memory
and this will take in average 0.9 second and in the compression method 
we have to copy a stream to the memory and this will take in average 
0.01 second and in the compression method you have to compress a stream 
to another stream in memory and this will take in average 3.1 seconds
and in the compression method you have to copy a compressed stream to a 
hardisk file and this will take in average 0.01 second.

So we have the serial part that is: 0.9 second + 0.01 second  + 0.01 second
and the parallel part will that is: 3.1 second


So the worst case scalability scenario using an HDD and using the Amdahl 
equation will
give us: 1 / 0.22% + (0.77%/N) (N: is the number of cores)

So this will scale up to: 4.54X , so as you have noticed with an HDD 
hardisk this not good..

So what can we do to scale more parallel archiver using parallel LZMA ?

You can for example use a RAID 10 with a base configuration of 4 HDD 
hardrives, so this will cut in 4 the 0.9 second and the 0.01 second , so 
  this will give a scalability of 16.9X and this is better.. but
to speed more the things we can use SSD hardrives that are 2X time 
faster than a HDD hardrives using  a RAID 10
configuration and this will give: 35X worst case scalability.

So as you have noticed if you are using only an HHD
with a multicore system you will not get more than 4.54X with my 
parallel archiver using parallel LZMA, so you have to use a RAID 10 with 
SSD drives to scale it up to 35X.


And this is why i have talked about RAID 10 etc.



Thank you,
Amine Moulay Ramdane.



Back to comp.programming.threads | Previous | Next | Find similar


Thread

I have come to an interresting subject aminer <aminer@toto.net> - 2013-10-28 21:19 -0700

csiph-web