FAQ Search Today's Posts Mark Forums Read
» Video Reviews

» Linux Archive

Linux-archive is a website aiming to archive linux email lists and to make them easily accessible for linux users/developers.


» Sponsor

» Partners

» Sponsor

Go Back   Linux Archive > Debian > Debian User

 
 
LinkBack Thread Tools
 
Old 08-15-2010, 06:00 AM
Borden Rhodes
 
Default Why is troubleshooting Linux so hard?

Good morning,

I'm going to list some of the frustrations I've been having with
troubleshooting Linux's quirks, crashes and problems in hopes that someone may
be able to help me (and the community) become better bug reporters and
troubleshooters. I'll make comparisons to Windows only because I am used to
fixing the same problems in Windows a certain way - maybe there are analogies
in Linux or maybe I'm approaching these problems the wrong way. I'm not
trying to troll or flame-bait. I'm using Debian Squeeze, by the way.

1) Is there a way to apply debugging symbols retroactively to a dump? A few
times I've had Linux crash on me and spit out a debugging dump. I do my best
to install debugging symbols for all 1400 packages I have on my system (when I
can find them) but this requires a huge amount of hard disk space and,
invariably, the odd dump is missing symbols. Recreating the crash isn't
always possible. Is there (or could someone invent) a way to save a dump
without the symbols, download the symbol tables and then regenerate the dump
with the symbols so it's useful to developers?

2) I find that the logs contain lots of facts but not a whole lot of useful
information (if any) when something goes wrong. I've had KDE go black-screen
on me, for example, and force a hard reboot but there's no mention whatsoever
(that I can find) in xorg.log, kdm.log, messages, syslog or dmesg. Windows
seems to be fairly good at making its last breath a stop error before it dies
which means when I get back into the system (or when I'm looking at a client's
computer days after) I can find that stop error, look it up and figure out what
went wrong. Are Linux's logs designed for troubleshooting or only for
monitoring? Are proper troubleshooting logs kept somewhere else or in a
special file? Is there a guide on how to read Linux's logs so I can make sense
out of them like I can Windows' logs?

3) Linux needs better troubleshooting and recovery systems. The answer I
usually get when I get an unexplained error is to run the program inside a dbg
or with valgrind. I'm not convinced that this is a practical way to
troubleshoot serious problems (like kernel panics) and it requires a certain
amount of foresight that a problem will occur. According to this logic, the
only way that someone can produce useful reports and feedback (or even get a
clue as to what happened) on the day-to-day crashes and bugs is to start Linux
and all of its sub process inside valgrind and/or gdb. This is obviously not
an intended use of these programs.

This is what would make it easier (at least for me) to troubleshoot Linux
problems. If these features exist, please let me know so I can start using
them (they should probably be documented in the man pages too).

1) Logs need to have useful information. When I look at a client's Windows
box days after they report something going wrong, the logs tell me at what
time the problem happened, which process failed and what error it threw just
before it blew. I can look those error codes up and (usually) fix the problem
within an hour. When something dies on Linux, the log entry (assuming it even
makes one) only tells me how many seconds into that particular boot the
problem occurred. I've never been able to go back a few days later and find the
log entries related to a particular crash - maybe because they've been purged.
I know that the Linux tradition is to identify processes only by ID but surely
there must be a way that it can print a file or package name or anything more
useful than memory addresses and registers so at least I know where to start
pointing fingers. Several people have told me that it's pointless trying to
debug a dump in the logs. What's the point of dumping it in the first place if
nobody can read it?

2) I wish error logs had simple codes or messages (which have documentation)
like Windows Stop errors so I can look them up and figure out why something
died. Often times I try to Google the whole error message and either get
directed to source code or totally irrelevant postings (since it seems that
many messages are reused for all kinds of problems). For example, 'segfault'
gets thrown so much that it only tells you that the program crashed -
something I already know.

3) Logs need better organisation. I'm looking at the most recent dump and
each message is printed on its own line. The problem is that interspersed in
those individual lines may be other entries from other events not related to
the problem in question. When I look at a Windows log, each event is entirely
contained in one entry. It doesn't make one entry for "Stop", another entry
for the Stop number, another 4 entries for the parameters and more entries for
whatever other information usually is in them - whilst having other entries
amid the list with what other things were doing at the time. I find Linux logs
very frustrating to read for that reason since I don't know when an event is
finished reporting or which items are relevant.

4) Logs need to focus on reporting on one thing and making sure it reports
that one thing well. Other than formatting, I can't see many differences
between syslog, dmesg and messages. Xorg.log is some help for troubleshooting
misconfigured xorg.conf files (which are depreciated anyway) but not very useful
if your X session burns down. kdm.log seems identical to Xorg.log except for
a few KDE-specific entries. I had to uninstall my firewall because it kept
writing firewall entries to messages (and stdout) and I couldn't figure out how
to get it to stop. Why isn't there one log that only deals with hardware
status and changes, another one that only deals with network status and
firewall logging, another one which only deals with dumps and crashes and so
on?

Maybe I just haven't found the right manual yet that has all of these answers
so I'd appreciate any direction.

With regards,

Borden Rhodes


--
To UNSUBSCRIBE, email to debian-user-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: 201008150200.52677.jrvp@bordenrhodes.com">http://lists.debian.org/201008150200.52677.jrvp@bordenrhodes.com
 
Old 08-15-2010, 06:27 AM
Kousik Maiti
 
Default Why is troubleshooting Linux so hard?

My suggestion, can't we create troubleshooting database??
On Sun, Aug 15, 2010 at 11:30 AM, Borden Rhodes <jrvp@bordenrhodes.com> wrote:


Good morning,



I'm going to list some of the frustrations I've been having with

troubleshooting Linux's quirks, crashes and problems in hopes that someone may

be able to help me (and the community) become better bug reporters and

troubleshooters. Â*I'll make comparisons to Windows only because I am used to

fixing the same problems in Windows a certain way - maybe there are analogies

in Linux or maybe I'm approaching these problems the wrong way. Â*I'm not

trying to troll or flame-bait. Â*I'm using Debian Squeeze, by the way.



1) Is there a way to apply debugging symbols retroactively to a dump? A few

times I've had Linux crash on me and spit out a debugging dump. Â*I do my best

to install debugging symbols for all 1400 packages I have on my system (when I

can find them) but this requires a huge amount of hard disk space and,

invariably, the odd dump is missing symbols. Â*Recreating the crash isn't

always possible. Â*Is there (or could someone invent) a way to save a dump

without the symbols, download the symbol tables and then regenerate the dump

with the symbols so it's useful to developers?



2) I find that the logs contain lots of facts but not a whole lot of useful

information (if any) when something goes wrong. Â*I've had KDE go black-screen

on me, for example, and force a hard reboot but there's no mention whatsoever

(that I can find) in xorg.log, kdm.log, messages, syslog or dmesg. Â*Windows

seems to be fairly good at making its last breath a stop error before it dies

which means when I get back into the system (or when I'm looking at a client's

computer days after) I can find that stop error, look it up and figure out what

went wrong. Â*Are Linux's logs designed for troubleshooting or only for

monitoring? Â*Are proper troubleshooting logs kept somewhere else or in a

special file? Is there a guide on how to read Linux's logs so I can make sense

out of them like I can Windows' logs?



3) Linux needs better troubleshooting and recovery systems. Â*The answer I

usually get when I get an unexplained error is to run the program inside a dbg

or with valgrind. Â*I'm not convinced that this is a practical way to

troubleshoot serious problems (like kernel panics) and it requires a certain

amount of foresight that a problem will occur. Â*According to this logic, the

only way that someone can produce useful reports and feedback (or even get a

clue as to what happened) on the day-to-day crashes and bugs is to start Linux

and all of its sub process inside valgrind and/or gdb. Â*This is obviously not

an intended use of these programs.



This is what would make it easier (at least for me) to troubleshoot Linux

problems. Â*If these features exist, please let me know so I can start using

them (they should probably be documented in the man pages too).



1) Logs need to have useful information. Â*When I look at a client's Windows

box days after they report something going wrong, the logs tell me at what

time the problem happened, which process failed and what error it threw just

before it blew. Â*I can look those error codes up and (usually) fix the problem

within an hour. Â*When something dies on Linux, the log entry (assuming it even

makes one) only tells me how many seconds into that particular boot the

problem occurred. I've never been able to go back a few days later and find the

log entries related to a particular crash - maybe because they've been purged.

I know that the Linux tradition is to identify processes only by ID but surely

there must be a way that it can print a file or package name or anything more

useful than memory addresses and registers so at least I know where to start

pointing fingers. Â*Several people have told me that it's pointless trying to

debug a dump in the logs. Â*What's the point of dumping it in the first place if

nobody can read it?



2) I wish error logs had simple codes or messages (which have documentation)

like Windows Stop errors so I can look them up and figure out why something

died. Â*Often times I try to Google the whole error message and either get

directed to source code or totally irrelevant postings (since it seems that

many messages are reused for all kinds of problems). Â*For example, 'segfault'

gets thrown so much that it only tells you that the program crashed -

something I already know.



3) Logs need better organisation. Â*I'm looking at the most recent dump and

each message is printed on its own line. Â*The problem is that interspersed in

those individual lines may be other entries from other events not related to

the problem in question. Â*When I look at a Windows log, each event is entirely

contained in one entry. Â*It doesn't make one entry for "Stop", another entry

for the Stop number, another 4 entries for the parameters and more entries for

whatever other information usually is in them - whilst having other entries

amid the list with what other things were doing at the time. Â*I find Linux logs

very frustrating to read for that reason since I don't know when an event is

finished reporting or which items are relevant.



4) Logs need to focus on reporting on one thing and making sure it reports

that one thing well. Â*Other than formatting, I can't see many differences

between syslog, dmesg and messages. Â*Xorg.log is some help for troubleshooting

misconfigured xorg.conf files (which are depreciated anyway) but not very useful

if your X session burns down. Â*kdm.log seems identical to Xorg.log except for

a few KDE-specific entries. Â*I had to uninstall my firewall because it kept

writing firewall entries to messages (and stdout) and I couldn't figure out how

to get it to stop. Â*Why isn't there one log that only deals with hardware

status and changes, another one that only deals with network status and

firewall logging, another one which only deals with dumps and crashes and so

on?



Maybe I just haven't found the right manual yet that has all of these answers

so I'd appreciate any direction.



With regards,



Borden Rhodes





--

To UNSUBSCRIBE, email to debian-user-REQUEST@lists.debian.org

with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org

Archive: http://lists.debian.org/201008150200.52677.jrvp@bordenrhodes.com





--
Wishing you the very best of everything, always!!!
Kousik Maiti(কৌশিক মাইতি)
Registered Linux User #474025
Registered Ubuntu User # 28654
 
Old 08-15-2010, 06:31 AM
"John A. Sullivan III"
 
Default Why is troubleshooting Linux so hard?

On Sun, 2010-08-15 at 02:00 -0400, Borden Rhodes wrote:
> Good morning,
>
> I'm going to list some of the frustrations I've been having with
> troubleshooting Linux's quirks, crashes and problems in hopes that someone may
> be able to help me (and the community) become better bug reporters and
> troubleshooters.<snip>
Very interesting and helpful post. Thank you. I've snipped most of it
out for the sake of those for whom long emails are a problem or
expensive.

As someone without deep troubleshooting experience, I'll be curious to
see the responses. The only thing I can help with is the old log
information. Linux usually rotates the logs. As I'd imagine you've
seen, you can find compressed versions of the logs, e.g., messages.1,
message.2.gz - John


--
To UNSUBSCRIBE, email to debian-user-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: 1281853909.24518.43.camel@denise.theartistscloset. com">http://lists.debian.org/1281853909.24518.43.camel@denise.theartistscloset. com
 
Old 08-15-2010, 07:19 AM
Robert Holtzman
 
Default Why is troubleshooting Linux so hard?

On Sun, Aug 15, 2010 at 02:00:52AM -0400, Borden Rhodes wrote:
> Good morning,
>
> I'm going to list some of the frustrations I've been having with
> troubleshooting Linux's quirks, crashes and problems in hopes that someone may
> be able to help me (and the community) become better bug reporters and
> troubleshooters. I'll make comparisons to Windows only because I am used to
> fixing the same problems in Windows a certain way - maybe there are analogies
> in Linux or maybe I'm approaching these problems the wrong way. I'm not
> trying to troll or flame-bait. I'm using Debian Squeeze, by the way.

..........snip..........

That's been my main complaint about Linux ever since I started using it
10 year or more ago (Redhat 6.0).

--
Bob Holtzman
Key ID: 8D549279
"If you think you're getting free lunch,
check the price of the beer"
 
Old 08-15-2010, 09:35 AM
"Boyd Stephen Smith Jr."
 
Default Why is troubleshooting Linux so hard?

In <201008150200.52677.jrvp@bordenrhodes.com>, Borden Rhodes wrote:
>1) Is there a way to apply debugging symbols retroactively to a dump? A few
>times I've had Linux crash on me and spit out a debugging dump. I do my
>best to install debugging symbols for all 1400 packages I have on my system
>(when I can find them) but this requires a huge amount of hard disk space
>and, invariably, the odd dump is missing symbols. Recreating the crash
>isn't always possible. Is there (or could someone invent) a way to save a
>dump without the symbols, download the symbol tables and then regenerate
>the dump with the symbols so it's useful to developers?

Yes, it is, sometimes. Ubuntu has a process to do it automatically, that
mostly gets it right.

Modern versions of "strip" et. al. allow you to save the debugging information
to a separate .so that just contains debugging information. gdb (et. al.) can
then use the debugging-info only .so to decorate an existing backtrace.

This is actually how a lot of distributions produce separate -dbg or -DEBUG
packages.

However, this debugging-info only files only match with the *same exact build*
of the real .so. Taking a random backtrace, determining which build it came
from and finding the appropriate -dbg packages is a bit difficult.

Also, things like prelink, that modify existing .so files result in the
debugging-info only .so not matching. This might also happen this some types
of hardening that reduces the impact of heap/stack overflow/underflow attacks.

Compounding this problem is the large number of programs that are being
written with parts in "scripting" languages, or otherwise non-C/C++ languages
where the path from a symbol in a ELF file to the problematic code is not as
simple.

In short, it can be done in some cases and there are programmers working on
making backtraces from Joe Sixpack or Jane Boxwine more useful. It does seem
to be like there may need to be more people working on this, but it is not
very "sexy" work. Most programmers would rather spend their time improving
the user experience when things are working; IME, that is where the user
spends most of their time.

>2) I find that the logs contain lots of facts but not a whole lot of useful
>information (if any) when something goes wrong. I've had KDE go
>black-screen on me, for example, and force a hard reboot but there's no
>mention whatsoever (that I can find) in xorg.log, kdm.log, messages, syslog
>or dmesg. Windows seems to be fairly good at making its last breath a stop
>error before it dies which means when I get back into the system (or when
>I'm looking at a client's computer days after) I can find that stop error,
>look it up and figure out what went wrong. Are Linux's logs designed for
>troubleshooting or only for monitoring? Are proper troubleshooting logs
>kept somewhere else or in a special file? Is there a guide on how to read
>Linux's logs so I can make sense out of them like I can Windows' logs?

In the case of a kernel crash, the last breath of the system is unfortunately
not writing to dmesg/syslog and sync()ing disks. Depending on the nature of
the crash, there are some good reasons not to do this, though. (E.g. is the
case of a PANIC(), the kernel developer is basically indicating that the
kernel image has been compromised -- doing FS operations with a compromised
kernel might cause [more] data loss.)

I think that logs in general are... dropping in quality. They seem to be less
focused around failed "sanity" checks, mis-configuration warnings, and I-was-
here before I called exit() message. They seem to more filled with I-didn't-
comment-this-out-before-our-release build debugging messages for random
developers. This is not true of kernel logs for the most part; I find them
informative, but it is rarely my kernel that causes me problems.

I speak as someone that has been working as a developer in some capacity for 8
years. Take that for what you will.

>3) Linux needs better troubleshooting and recovery systems. The answer I
>usually get when I get an unexplained error is to run the program inside a
>dbg or with valgrind. I'm not convinced that this is a practical way to
>troubleshoot serious problems (like kernel panics) and it requires a
>certain amount of foresight that a problem will occur. According to this
>logic, the only way that someone can produce useful reports and feedback
>(or even get a clue as to what happened) on the day-to-day crashes and bugs
>is to start Linux and all of its sub process inside valgrind and/or gdb.
>This is obviously not an intended use of these programs.

If we don't know how to reproduce the problem, we can't fix it. If we do know
how to reproduce the problem, the foresight needed to use gdb/valgrind is not
too much more. They shouldn't be your first tools, but they are necessary.

I've also had gdb/valgrind mask errors, which is truly unfortunate. Still, if
you know a way to make it crash every time EXCEPT when in gdb/valgrind, that
tells me something as a developer

NB: I've never had gdb/valgrind help with kernel errors, since they generally
live in user space.

Being able to reproduce the error is the *most important* step. IME, there
are very few problems that can't be fixed/worked-around in 8 man-hours once
you can reproduce the problem in under 15 minutes.

Also, if you have an unreproducable problem, I'm gonna blame the hardware or
cosmic radiation, not the code.

>1) Logs need to have useful information.

Agreed.

>When I look at a client's Windows
>box days after they report something going wrong, the logs tell me at what
>time the problem happened, which process failed and what error it threw just
>before it blew. I can look those error codes up and (usually) fix the
>problem within an hour.

As a less homogeneous environment, there's no ultimate table of error codes to
look at.

>When something dies on Linux, the log entry
>(assuming it even makes one) only tells me how many seconds into that
>particular boot the problem occurred. I've never been able to go back a few
>days later and find the log entries related to a particular crash - maybe
>because they've been purged.

I've still got logs from 2009 on my currently running desktop. They *have*
been archived, but they are still available. You should check your logrotate
settings to make sure your logs are being handled the way you'd like.

>I know that the Linux tradition is to identify
>processes only by ID but surely there must be a way that it can print a
>file or package name or anything more useful than memory addresses and
>registers so at least I know where to start pointing fingers.

The kernel doesn't know about packages. It does know about files, but once
the process is running, it doesn't identify the file using a pathname. As it
is dying is it difficult to extract accurate information, particularly if it
has already "eaten" it's own memory image.

>Several
>people have told me that it's pointless trying to debug a dump in the logs.
> What's the point of dumping it in the first place if nobody can read it?

It is a place to start, but it's not a very good one. A kdump or corefile is
usually much better. A backtrace tells you a set of functions to look at for
obvious errors, a kdump or corefile allows you to inspect local variables and
determine exactly which of your assumptions was violated.

>2) I wish error logs had simple codes or messages (which have documentation)
>like Windows Stop errors so I can look them up and figure out why something
>died. Often times I try to Google the whole error message and either get
>directed to source code or totally irrelevant postings (since it seems that
>many messages are reused for all kinds of problems). For example,
>'segfault' gets thrown so much that it only tells you that the program
>crashed - something I already know.

segfault is a very specific type of crash: A process attempted to access a
memory address that was either not mapped or was mapped without the required
permissions. (Trying to move the IP to a place that is mapped NOEXEC, trying
to write to a read-only mmap(), or even a simple dereference of a NULL
pointer.)

Unfortunately, it is the most common type of hard crash. It can be caused a
multitude of programming errors. If your program is not segfaulting, it can
likely recover in some meaningful way, or at least write a log message and
cleanly exit. If it is segfaulting, there is relatively little you can do; a
signal handler is C isn't allowed to call all of the library functions, and
returning from the SIGSEGV handler causes the program to terminate or
immediately get the signal again, so you can't set a flag.

Error codes and fixed error messages are established after the main body of
code is written, so they can be standardized throughout the body of the code
and documented. However, with release early, release often being the mantra
of many projects, that level of "freeze" never happens. New error messages
and conditions are added all the time, and (more often than not) old error
messages and conditions go way when recovery code is added.

>xorg.conf files (which are depreciated
>anyway)

It's not depreciated. xorg.conf is *the* correct place to configure your
Xorg. However, one of the goal of Xorg is to have enough auto-detection and
dynamic re-configuration that an empty (or missing) xorg.conf is enough for
everyone.

>Why isn't
>there one log that only deals with hardware status and changes, another one
>that only deals with network status and firewall logging, another one which
>only deals with dumps and crashes and so on?

There a a fixed number of "syslog" facilities, but they were designed in the
days of AT&T UNIX, so not all of them are entirely relevant. It seems like
Linux could probably add some more, but portable programs would use them.
Plus, a lot of programs don't log via syslog() anymore anyway.

Anyway, it could be a lot better, I agree. I seem to remember that Debian and
most upstream projects do accept volunteers.
--
Boyd Stephen Smith Jr. ,= ,-_-. =.
bss@iguanasuicide.net ((_/)o o(\_))
ICQ: 514984 YM/AIM: DaTwinkDaddy `-'(. .)`-'
http://iguanasuicide.net/ \_/
 
Old 08-15-2010, 05:03 PM
Camaleón
 
Default Why is troubleshooting Linux so hard?

On Sun, 15 Aug 2010 02:00:52 -0400, Borden Rhodes wrote:

> I'm going to list some of the frustrations I've been having with
> troubleshooting Linux's quirks, crashes and problems in hopes that
> someone may be able to help me (and the community) become better bug
> reporters and troubleshooters. I'll make comparisons to Windows only
> because I am used to fixing the same problems in Windows a certain way -
> maybe there are analogies in Linux or maybe I'm approaching these
> problems the wrong way. I'm not trying to troll or flame-bait. I'm
> using Debian Squeeze, by the way.
>
> 1) Is there a way to apply debugging symbols retroactively to a dump?

(...)

Dunno, but in all the time I was using linux, never had to install a "-
dbg" package to solve 99% of the problems :-)

Anyway, true is that every piece of code (kernel, gnome apps, kde apps,
mozilla apps...) have their own system for debugging problems.

> 2) I find that the logs contain lots of facts but not a whole lot of
> useful information (if any) when something goes wrong.

(...)

Ugh, in windows is even worst :-(

I find linux logging to be very useful (for finding "real" problems) and
highly configurable. Is not uncommon to have a verbose switch available
for every program that tends to provide very detailed information.

The problem here is knowing "what" (what program is involved in the
crash) and so "where" to look into. Expertise helps here.

> 3) Linux needs better troubleshooting and recovery systems.

Yes, I agree this point can be improved and automated. But -generally
speaking- I find linux debugging system to be far useful than Windows one.

Greetings,

--
Camaleón


--
To UNSUBSCRIBE, email to debian-user-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: pan.2010.08.15.17.03.04@gmail.com">http://lists.debian.org/pan.2010.08.15.17.03.04@gmail.com
 
Old 08-16-2010, 04:54 AM
Aaron Toponce
 
Default Why is troubleshooting Linux so hard?

On Sun, Aug 15, 2010 at 02:31:49AM -0400, John A. Sullivan III wrote:
> Very interesting and helpful post. Thank you. I've snipped most of it
> out for the sake of those for whom long emails are a problem or
> expensive.

You should ALWAYS trim your messages, cutting out the irrelevant cruft,
leaving only enough of the original message to which you're replying, so
others can make sense of your reply.

Thank you for trimming. Now if everyone else would learn that lesson.

--
. O . O . O . . O O . . . O .
. . O . O O O . O . O O . . O
O O O . O . . O O O O . O O O
 
Old 11-15-2010, 05:51 PM
Borden Rhodes
 
Default Why is troubleshooting Linux so hard?

(Sorry for the late reply to a thread started way back)

I'm pleased for all of the feedback and that I'm not the only person
who's frustrated. I tried proposing to debian-policy that it be
mandatory that all logs have timestamps
http://lists.debian.org/debian-policy/2010/02/msg00035.html but my
suggestion was dismissed because it was considered too hard to enforce.
I responded http://lists.debian.org/debian-policy/2010/08/msg00043.html
saying that it shouldn't be very difficult to enforce or implement at
all.

Linux already has a huge troubleshooting database: Google. The trouble
is that simply copying-and-pasting an error message into Google with a
program or package name (assuming you know whose fault it is) doesn't
generate very useful results. The most useful change, therefore, would
be to improve the quality of the error messages.

I just want to say that I like KDE's auto-crash popup and it would be
nice to have implemented Linux-wide. Of course, the trick is when you
don't have the appropriate debugging packages installed to install them
and regenerate the crash report before it goes away.

What would it take to get some error message standards in place so that
troubleshooting Linux is possible for those of us who aren't computer
science PhD candidates?


--
To UNSUBSCRIBE, email to debian-user-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: 1289847115.4309.9.camel@Firefly.bordenrhodes.com"> http://lists.debian.org/1289847115.4309.9.camel@Firefly.bordenrhodes.com
 
Old 11-15-2010, 06:25 PM
Camaleón
 
Default Why is troubleshooting Linux so hard?

On Mon, 15 Nov 2010 13:51:55 -0500, Borden Rhodes wrote:

> (Sorry for the late reply to a thread started way back)
>
> I'm pleased for all of the feedback and that I'm not the only person
> who's frustrated. I tried proposing to debian-policy that it be
> mandatory that all logs have timestamps
> http://lists.debian.org/debian-policy/2010/02/msg00035.html but my
> suggestion was dismissed because it was considered too hard to enforce.
> I responded http://lists.debian.org/debian-policy/2010/08/msg00043.html
> saying that it shouldn't be very difficult to enforce or implement at
> all.

I guess you posted your wish into the wrong list.

Debian Policy is about (sic) "Discussion and editing of the Debian Policy
Manual". Maybe you should have addressed your concern to "debian-
devel" (where meal is being coocked) or by directly filling a bug report.

But I'm afraid that feature will have to be addressed upstream or just in
case another distribution is already adding that option, you should have
pointed that it is already available.

> Linux already has a huge troubleshooting database: Google. The trouble
> is that simply copying-and-pasting an error message into Google with a
> program or package name (assuming you know whose fault it is) doesn't
> generate very useful results. The most useful change, therefore, would
> be to improve the quality of the error messages.

I'd say Google _and_ experience.

Becasue Google means nothing for people who does not know where to look
or how to search.

> I just want to say that I like KDE's auto-crash popup and it would be
> nice to have implemented Linux-wide. Of course, the trick is when you
> don't have the appropriate debugging packages installed to install them
> and regenerate the crash report before it goes away.

Every project provides its own bug tracking tool (GNOME -bugbuddy-, KDE,
kernel -kerneloops-, mozilla...). There are tools for handling crashes
but you have to install "debug" packages associates for each application.
But, OTOH, these bug crashes reports say not much for the plain user:
they are targeted to devels and won't help us to solve our "mere mortal"
problem ;-)

> What would it take to get some error message standards in place so that
> troubleshooting Linux is possible for those of us who aren't computer
> science PhD candidates?

Linux provides one of the best tools I've never seen for debugging erros:
understable logs. They're priceless :-)

Greetings,

--
Camaleón


--
To UNSUBSCRIBE, email to debian-user-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: pan.2010.11.15.19.25.13@gmail.com">http://lists.debian.org/pan.2010.11.15.19.25.13@gmail.com
 
Old 11-15-2010, 06:38 PM
Steve Kemp
 
Default Why is troubleshooting Linux so hard?

On Mon Nov 15, 2010 at 13:51:55 -0500, Borden Rhodes wrote:

> What would it take to get some error message standards in place so that
> troubleshooting Linux is possible for those of us who aren't computer
> science PhD candidates?

1. Make a list of all the programs which exist, but which do not
log "useful" information.

2. Persuade every single one of them that your suggestion to add
"useful" logging information is a good one.

3. Wait for them all to update.

This is the problem - There is no single place this change could
be made. Even if it were in KDE and all KDE applications go it you'd
be missing other things such as sudo, screen, less, GNOME, etc.

You suggest "troubleshooting linux", but what you really mean (I
guess) is troubleshooting any available program which just happens
to run upon Linux systems (and possibly others).

Even if you were to get a wide agreement I suspect you'd still
be frustrated - as what is useful information to me is probably
not useful information to you. As things stand you typically get
access to source code, documentation, and tutorials online. I suspect
the pragmatic thing to do if you encounter problems is to explore
that particular program/system thoroughly and learn in the process
the things you're missing.

No PhD candidacy required.

Steve
--
Let me steal your soul?
http://stolen-souls.com


--
To UNSUBSCRIBE, email to debian-user-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: 20101115193828.GA3250@steve.org.uk">http://lists.debian.org/20101115193828.GA3250@steve.org.uk
 

Thread Tools




All times are GMT. The time now is 01:37 AM.

VBulletin, Copyright ©2000 - 2013, Jelsoft Enterprises Ltd.
Content Relevant URLs by vBSEO ©2007, Crawlability, Inc.
Copyright ©2007 - 2008, www.linux-archive.org