FAQ Search Today's Posts Mark Forums Read
» Video Reviews

» Linux Archive

Linux-archive is a website aiming to archive linux email lists and to make them easily accessible for linux users/developers.


» Sponsor

» Partners

» Sponsor

Go Back   Linux Archive > Debian > Debian User

 
 
LinkBack Thread Tools
 
Old 12-22-2008, 10:37 AM
Tzafrir Cohen
 
Default Parallel GZIP

On Mon, Dec 22, 2008 at 01:10:04PM +0200, Volkan YAZICI wrote:
> Hi,
>
> Are there any compression tools in the official Debian GNU/Linux
> repositories that can take the benefit of multiple cores/cpus?

Could you please giva a usage scenario?

Is the task really CPU bound and not i/o bound?

--
Tzafrir Cohen | tzafrir@jabber.org | VIM is
http://tzafrir.org.il | | a Mutt's
tzafrir@cohens.org.il | | best
ICQ# 16849754 | | friend


--
To UNSUBSCRIBE, email to debian-user-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
 
Old 12-22-2008, 11:48 AM
Volkan YAZICI
 
Default Parallel GZIP

On Mon, 22 Dec 2008, Tzafrir Cohen <tzafrir@cohens.org.il> writes:
> Could you please giva a usage scenario?

Compressing a database backup file of size 300GB.

> Is the task really CPU bound and not i/o bound?

"top -d 1" tells me that CPU usage is 800%. (BTW, have you ever met with
an input file that is not CPU bound, but I/O bound -- discarding
specifically generated worst-case scenario inputs.)


Regards.


--
To UNSUBSCRIBE, email to debian-user-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
 
Old 12-22-2008, 11:53 AM
"Eugene V. Lyubimkin"
 
Default Parallel GZIP

Volkan YAZICI wrote:
> Hi,
>
> Are there any compression tools in the official Debian GNU/Linux
> repositories that can take the benefit of multiple cores/cpus?
pbzip2?

--
Eugene V. Lyubimkin aka JackYF, JID: jackyf.devel(maildog)gmail.com
Ukrainian C++ Developer, Debian Maintainer, APT contributor
 
Old 12-22-2008, 12:16 PM
Osamu Aoki
 
Default Parallel GZIP

Hi,

On Mon, Dec 22, 2008 at 02:48:41PM +0200, Volkan YAZICI wrote:
> On Mon, 22 Dec 2008, Tzafrir Cohen <tzafrir@cohens.org.il> writes:
> > Could you please giva a usage scenario?
>
> Compressing a database backup file of size 300GB.
>
> > Is the task really CPU bound and not i/o bound?
>
> "top -d 1" tells me that CPU usage is 800%. (BTW, have you ever met with
> an input file that is not CPU bound, but I/O bound -- discarding
> specifically generated worst-case scenario inputs.)

gzip, bzip2, and lzma seem quite CPU intensive as I saw too.

For my local archiving of not-so-big ata, I just tar/cpio them w/o
compression. Lazy me.

You may wish to try lzop which is optimized for speed.

Here are notable Debian compression tools.

http://people.debian.org/~osamu/pub/getwiki/html/ch11.en.html#archiveandcompressiontools

Osamu


--
To UNSUBSCRIBE, email to debian-user-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
 
Old 12-22-2008, 01:25 PM
"Douglas A. Tutty"
 
Default Parallel GZIP

On Mon, Dec 22, 2008 at 11:37:34AM +0000, Tzafrir Cohen wrote:
> On Mon, Dec 22, 2008 at 01:10:04PM +0200, Volkan YAZICI wrote:
> >
> > Are there any compression tools in the official Debian GNU/Linux
> > repositories that can take the benefit of multiple cores/cpus?
>
> Could you please giva a usage scenario?
>
> Is the task really CPU bound and not i/o bound?

Installing on a new dual PII-450 box from a local apt-cacher repository.
The box was an HP NetServer LPr with hardware RAID. Yes, the install
was compute bound according to top. It would have been nice if there
was a drop-in replacement for gzip that dpkg could have used that would
use both CPUs.

Doug.


--
To UNSUBSCRIBE, email to debian-user-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
 
Old 12-22-2008, 01:51 PM
"Thomas Preud'homme"
 
Default Parallel GZIP

The Monday 22 December 2008 13:53:43 Eugene V. Lyubimkin, you wrote*:
> Volkan YAZICI wrote:
> > Hi,
> >
> > Are there any compression tools in the official Debian GNU/Linux
> > repositories that can take the benefit of multiple cores/cpus?
>
> pbzip2?


I profit of this thread to ask you if you see amelioration with pbzip2. I made
2 tests on file of 50 MB and the result if approximately the same time as
bzip2 (a little much longer with pbzip2 !!!) but a CPU twice time more used.
I have a core 2 duo and time give me a usage of almost 200% with pbzip2

Anyone has better results ?

Regards,

Thomas Preud'homme

--
Why debian : http://www.debian.org/intro/why_debian
 
Old 12-22-2008, 02:13 PM
"Eugene V. Lyubimkin"
 
Default Parallel GZIP

Thomas Preud'homme wrote:
> The Monday 22 December 2008 13:53:43 Eugene V. Lyubimkin, you wrote :
>> Volkan YAZICI wrote:
>>> Hi,
>>>
>>> Are there any compression tools in the official Debian GNU/Linux
>>> repositories that can take the benefit of multiple cores/cpus?
>> pbzip2?
>
>
> I profit of this thread to ask you if you see amelioration with pbzip2. I made
> 2 tests on file of 50 MB and the result if approximately the same time as
> bzip2 (a little much longer with pbzip2 !!!) but a CPU twice time more used.
> I have a core 2 duo and time give me a usage of almost 200% with pbzip2
>
> Anyone has better results ?
Yes, see following:

jackyf@work:~/temp$ time tar -cjf temp1.tar.bz2 debian

real 0m18.830s
user 0m18.317s
sys 0m0.268s
jackyf@work:~/temp$ time bash -c "tar -cf - debian | pbzip2 -c > temp2.tar.bz2"

real 0m10.494s
user 0m19.557s
sys 0m0.488s

10 is much less than 18.

--
Eugene V. Lyubimkin aka JackYF, JID: jackyf.devel(maildog)gmail.com
Ukrainian C++ Developer, Debian Maintainer, APT contributor
 
Old 12-22-2008, 02:17 PM
Ron Johnson
 
Default Parallel GZIP

On 12/22/08 09:13, Eugene V. Lyubimkin wrote:

Thomas Preud'homme wrote:

The Monday 22 December 2008 13:53:43 Eugene V. Lyubimkin, you wrote :

Volkan YAZICI wrote:

Hi,

Are there any compression tools in the official Debian GNU/Linux
repositories that can take the benefit of multiple cores/cpus?

pbzip2?


I profit of this thread to ask you if you see amelioration with pbzip2. I made
2 tests on file of 50 MB and the result if approximately the same time as
bzip2 (a little much longer with pbzip2 !!!) but a CPU twice time more used.
I have a core 2 duo and time give me a usage of almost 200% with pbzip2


Anyone has better results ?

Yes, see following:

jackyf@work:~/temp$ time tar -cjf temp1.tar.bz2 debian

real 0m18.830s
user 0m18.317s
sys 0m0.268s
jackyf@work:~/temp$ time bash -c "tar -cf - debian | pbzip2 -c > temp2.tar.bz2"

real 0m10.494s
user 0m19.557s
sys 0m0.488s

10 is much less than 18.


That might be because much of the data is already cached. Each of
those commands should be run 3 times to get good numbers.


--
Ron Johnson, Jr.
Jefferson LA USA

I like my women like I like my coffee - purchased at above-market
rates from eco-friendly organic farming cooperatives in Latin America.


--
To UNSUBSCRIBE, email to debian-user-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
 
Old 12-22-2008, 02:18 PM
"Thomas Preud'homme"
 
Default Parallel GZIP

The Monday 22 December 2008 16:13:17 Eugene V. Lyubimkin, you wrote*:
> Thomas Preud'homme wrote:
> > The Monday 22 December 2008 13:53:43 Eugene V. Lyubimkin, you wrote :
> >> Volkan YAZICI wrote:
> >>> Hi,
> >>>
> >>> Are there any compression tools in the official Debian GNU/Linux
> >>> repositories that can take the benefit of multiple cores/cpus?
> >>
> >> pbzip2?
> >
> > I profit of this thread to ask you if you see amelioration with pbzip2. I
> > made 2 tests on file of 50 MB and the result if approximately the same
> > time as bzip2 (a little much longer with pbzip2 !!!) but a CPU twice time
> > more used. I have a core 2 duo and time give me a usage of almost 200%
> > with pbzip2
> >
> > Anyone has better results ?
>
> Yes, see following:
>
> jackyf@work:~/temp$ time tar -cjf temp1.tar.bz2 debian
>
> real 0m18.830s
> user 0m18.317s
> sys 0m0.268s
> jackyf@work:~/temp$ time bash -c "tar -cf - debian | pbzip2 -c >
> temp2.tar.bz2"
>
> real 0m10.494s
> user 0m19.557s
> sys 0m0.488s
>
> 10 is much less than 18.


Ok so sometime it works well. It would be great to know when it'll be quicker
and when it will not. What kind of files are in your debian directory ?

Regards,

Thomas Preud'homme
--
Why debian : http://www.debian.org/intro/why_debian
 

Thread Tools




All times are GMT. The time now is 09:11 AM.

VBulletin, Copyright ©2000 - 2013, Jelsoft Enterprises Ltd.
Content Relevant URLs by vBSEO ©2007, Crawlability, Inc.
Copyright ©2007 - 2008, www.linux-archive.org