Message ID | 1473482849-5706-1-git-send-email-zlang@redhat.com (mailing list archive) |
---|---|
State | Superseded, archived |
Headers | show |
On 9/9/16 11:47 PM, Zorro Lang wrote: > The man 8 xfs_repair said "xfs_repair run without the -n option will > always return a status code of 0". That's not correct. > > xfs_repair will return 2 if it find valuable metadata changes in log > which needs to be replayed, 1 if it can't fix the corruption or some > other errors happened and 0 if nothing wrong or all the corruptions > were fixed. > > Generally xfs_repair -L will always return 0, except it can't clear > the log. And I think that's an operational type error, not the result of a filesystem problem; more like an IO error, or a code bug, I *think* ... more below. > Signed-off-by: Zorro Lang <zlang@redhat.com> > --- > > Hi, > > I trusted the xfs_repair manpage, and thought xfs_repair will always return 0. > But recently I found it lies when I tried to review someone xfstests case. > > A correct manpage will help more people to write right cases, so I try to modify > the manpage, by search all exit/do_error in xfsprogs/repair. I'm not the best > one who learn about xfs_repair, so I just hope I did the right thing:-P Please > feel free to correct me. > > Thanks, > Zorro > > man/man8/xfs_repair.8 | 13 ++++++++++++- > 1 file changed, 12 insertions(+), 1 deletion(-) > > diff --git a/man/man8/xfs_repair.8 b/man/man8/xfs_repair.8 > index 1b4d9e3..1f8f13b 100644 > --- a/man/man8/xfs_repair.8 > +++ b/man/man8/xfs_repair.8 > @@ -504,12 +504,23 @@ that is known to be free. The entry is therefore invalid and is deleted. > This message refers to a large directory. > If the directory were small, the message would read "junking entry ...". > .SH EXIT STATUS > +.TP > .B xfs_repair \-n > (no modify node) > will return a status of 1 if filesystem corruption was detected and > 0 if no filesystem corruption was detected. > +.TP > .B xfs_repair > -run without the \-n option will always return a status code of 0. > +run without the \-n option will return a status code of 2 if it find the > +filesystem has valuable metadata changes in log which needs to be > +replayed, 1 if there's corruption left to be fixed I'm not sure that's the best description; from a quick look, I think those exit values of 1 result from do_error(), and in repair that's (usually?) due to something like a memory allocation failure, or an inconsistent state in the tool; more like hitting an ASSERT. That might leave corruption, but only as a follow-on effect. > + or can't find log head > +and tail or some other errors happened, Which is the same as above, I think - an internal error. > and 0 if nothing wrong or all the > +corruptions were fixed. > +.TP > +.B xfs_repair \-L > +(Force Log Zeroing) > +will return a status code of 1 if it can't clear the log, or will always > +return 0. How about something like this: .B xfs_repair \-n (no modify node) will return a status of 1 if filesystem corruption was detected and 0 if no filesystem corruption was detected. .TP .B xfs_repair run without the \-n option will return a status code of 2 if it finds a filesystem log which needs to be replayed (by a mount/umount cycle), 1 if a runtime error is encountered, and 0 in all other cases, whether or not filesystem corruption was detected. and I'd leave out the bit about xfs_repair -L; really that's just a runtime error - if we clear the log and then can't find the head/tail, something strange has gone wrong. Thanks, -Eric > .SH BUGS > The filesystem to be checked and repaired must have been > unmounted cleanly using normal system administration procedures >
On Mon, Sep 12, 2016 at 11:01:12AM -0500, Eric Sandeen wrote: > On 9/9/16 11:47 PM, Zorro Lang wrote: > > The man 8 xfs_repair said "xfs_repair run without the -n option will > > always return a status code of 0". That's not correct. > > > > xfs_repair will return 2 if it find valuable metadata changes in log > > which needs to be replayed, 1 if it can't fix the corruption or some > > other errors happened and 0 if nothing wrong or all the corruptions > > were fixed. > > > > Generally xfs_repair -L will always return 0, except it can't clear > > the log. > > And I think that's an operational type error, not the result > of a filesystem problem; more like an IO error, or a code bug, > I *think* ... more below. > > > > Signed-off-by: Zorro Lang <zlang@redhat.com> > > --- > > > > Hi, > > > > I trusted the xfs_repair manpage, and thought xfs_repair will always return 0. > > But recently I found it lies when I tried to review someone xfstests case. > > > > A correct manpage will help more people to write right cases, so I try to modify > > the manpage, by search all exit/do_error in xfsprogs/repair. I'm not the best > > one who learn about xfs_repair, so I just hope I did the right thing:-P Please > > feel free to correct me. > > > > Thanks, > > Zorro > > > > man/man8/xfs_repair.8 | 13 ++++++++++++- > > 1 file changed, 12 insertions(+), 1 deletion(-) > > > > diff --git a/man/man8/xfs_repair.8 b/man/man8/xfs_repair.8 > > index 1b4d9e3..1f8f13b 100644 > > --- a/man/man8/xfs_repair.8 > > +++ b/man/man8/xfs_repair.8 > > @@ -504,12 +504,23 @@ that is known to be free. The entry is therefore invalid and is deleted. > > This message refers to a large directory. > > If the directory were small, the message would read "junking entry ...". > > .SH EXIT STATUS > > +.TP > > .B xfs_repair \-n > > (no modify node) > > will return a status of 1 if filesystem corruption was detected and > > 0 if no filesystem corruption was detected. > > +.TP > > .B xfs_repair > > -run without the \-n option will always return a status code of 0. > > +run without the \-n option will return a status code of 2 if it find the > > +filesystem has valuable metadata changes in log which needs to be > > +replayed, 1 if there's corruption left to be fixed > > I'm not sure that's the best description; from a quick look, I think > those exit values of 1 result from do_error(), and in repair that's > (usually?) due to something like a memory allocation failure, or an > inconsistent state in the tool; more like hitting an ASSERT. That might > leave corruption, but only as a follow-on effect. Hi Eric, Many thanks for you can help to review this patch. I've check all code will exit(1), generally it caused by memory or disk errors. But some other situations likes: - No enough matching AGs or superblocks - Primary superblock bad after phase 1 - Sector size on host filesystem larger than image sector size, when try to repair a file image ... will exit(1) too. But yes, they're all belong to runtime error:) There're too many situations can return 1. But only one place can return 2, so we can say except return 0 and 2, others will return 1 :-P > > > + or can't find log head > > +and tail or some other errors happened, > > Which is the same as above, I think - an internal error. > > > and 0 if nothing wrong or all the > > +corruptions were fixed. > > +.TP > > +.B xfs_repair \-L > > +(Force Log Zeroing) > > +will return a status code of 1 if it can't clear the log, or will always > > +return 0. > > > How about something like this: > > .B xfs_repair \-n > (no modify node) > will return a status of 1 if filesystem corruption was detected and > 0 if no filesystem corruption was detected. > .TP > .B xfs_repair > run without the \-n option will return a status code of 2 if it finds a > filesystem log which needs to be replayed (by a mount/umount cycle), 1 if > a runtime error is encountered, and 0 in all other cases, whether or not > filesystem corruption was detected. Your patch(xfs_repair: exit with status 2 if log dirtiness is unknown) will make xfs_repair return 2, when it can't find log head/tail. I think xfs_repair won't think the log needs to be replayed if it can't find the log tail/head. So how about "return a status code of 2 if it finds filesystem log needs to be replayed or cleared"? Thanks, Zorro > > and I'd leave out the bit about xfs_repair -L; really that's just a runtime > error - if we clear the log and then can't find the head/tail, something > strange has gone wrong. > > Thanks, > > -Eric > > > .SH BUGS > > The filesystem to be checked and repaired must have been > > unmounted cleanly using normal system administration procedures > > > > _______________________________________________ > xfs mailing list > xfs@oss.sgi.com > http://oss.sgi.com/mailman/listinfo/xfs
On 9/13/16 9:44 AM, Zorro Lang wrote: > On Mon, Sep 12, 2016 at 11:01:12AM -0500, Eric Sandeen wrote: >> On 9/9/16 11:47 PM, Zorro Lang wrote: >>> The man 8 xfs_repair said "xfs_repair run without the -n option will >>> always return a status code of 0". That's not correct. >>> >>> xfs_repair will return 2 if it find valuable metadata changes in log >>> which needs to be replayed, 1 if it can't fix the corruption or some >>> other errors happened and 0 if nothing wrong or all the corruptions >>> were fixed. >>> >>> Generally xfs_repair -L will always return 0, except it can't clear >>> the log. >> >> And I think that's an operational type error, not the result >> of a filesystem problem; more like an IO error, or a code bug, >> I *think* ... more below. >> >> >>> Signed-off-by: Zorro Lang <zlang@redhat.com> >>> --- >>> >>> Hi, >>> >>> I trusted the xfs_repair manpage, and thought xfs_repair will always return 0. >>> But recently I found it lies when I tried to review someone xfstests case. >>> >>> A correct manpage will help more people to write right cases, so I try to modify >>> the manpage, by search all exit/do_error in xfsprogs/repair. I'm not the best >>> one who learn about xfs_repair, so I just hope I did the right thing:-P Please >>> feel free to correct me. >>> >>> Thanks, >>> Zorro >>> >>> man/man8/xfs_repair.8 | 13 ++++++++++++- >>> 1 file changed, 12 insertions(+), 1 deletion(-) >>> >>> diff --git a/man/man8/xfs_repair.8 b/man/man8/xfs_repair.8 >>> index 1b4d9e3..1f8f13b 100644 >>> --- a/man/man8/xfs_repair.8 >>> +++ b/man/man8/xfs_repair.8 >>> @@ -504,12 +504,23 @@ that is known to be free. The entry is therefore invalid and is deleted. >>> This message refers to a large directory. >>> If the directory were small, the message would read "junking entry ...". >>> .SH EXIT STATUS >>> +.TP >>> .B xfs_repair \-n >>> (no modify node) >>> will return a status of 1 if filesystem corruption was detected and >>> 0 if no filesystem corruption was detected. >>> +.TP >>> .B xfs_repair >>> -run without the \-n option will always return a status code of 0. >>> +run without the \-n option will return a status code of 2 if it find the >>> +filesystem has valuable metadata changes in log which needs to be >>> +replayed, 1 if there's corruption left to be fixed >> >> I'm not sure that's the best description; from a quick look, I think >> those exit values of 1 result from do_error(), and in repair that's >> (usually?) due to something like a memory allocation failure, or an >> inconsistent state in the tool; more like hitting an ASSERT. That might >> leave corruption, but only as a follow-on effect. > > Hi Eric, > > Many thanks for you can help to review this patch. > > I've check all code will exit(1), generally it caused by memory or disk > errors. But some other situations likes: > - No enough matching AGs or superblocks > - Primary superblock bad after phase 1 > - Sector size on host filesystem larger than image sector size, when try > to repair a file image > ... > > will exit(1) too. Sigh, ok. I guess the exit(1) has proliferated a lot. :( > But yes, they're all belong to runtime error:) There're too many situations > can return 1. But only one place can return 2, so we can say except return 0 > and 2, others will return 1 :-P > > >> >>> + or can't find log head >>> +and tail or some other errors happened, >> >> Which is the same as above, I think - an internal error. >> >>> and 0 if nothing wrong or all the >>> +corruptions were fixed. >>> +.TP >>> +.B xfs_repair \-L >>> +(Force Log Zeroing) >>> +will return a status code of 1 if it can't clear the log, or will always >>> +return 0. >> >> >> How about something like this: >> >> .B xfs_repair \-n >> (no modify node) >> will return a status of 1 if filesystem corruption was detected and >> 0 if no filesystem corruption was detected. >> .TP >> .B xfs_repair >> run without the \-n option will return a status code of 2 if it finds a >> filesystem log which needs to be replayed (by a mount/umount cycle), 1 if >> a runtime error is encountered, and 0 in all other cases, whether or not >> filesystem corruption was detected. > > Your patch(xfs_repair: exit with status 2 if log dirtiness is unknown) will > make xfs_repair return 2, when it can't find log head/tail. I think xfs_repair > won't think the log needs to be replayed if it can't find the log tail/head. > > So how about "return a status code of 2 if it finds filesystem log needs to be > replayed or cleared"? That seems reasonable... -Eric > Thanks, > Zorro > >> >> and I'd leave out the bit about xfs_repair -L; really that's just a runtime >> error - if we clear the log and then can't find the head/tail, something >> strange has gone wrong. >> >> Thanks, >> >> -Eric >> >>> .SH BUGS >>> The filesystem to be checked and repaired must have been >>> unmounted cleanly using normal system administration procedures >>> >> >> _______________________________________________ >> xfs mailing list >> xfs@oss.sgi.com >> http://oss.sgi.com/mailman/listinfo/xfs >
diff --git a/man/man8/xfs_repair.8 b/man/man8/xfs_repair.8 index 1b4d9e3..1f8f13b 100644 --- a/man/man8/xfs_repair.8 +++ b/man/man8/xfs_repair.8 @@ -504,12 +504,23 @@ that is known to be free. The entry is therefore invalid and is deleted. This message refers to a large directory. If the directory were small, the message would read "junking entry ...". .SH EXIT STATUS +.TP .B xfs_repair \-n (no modify node) will return a status of 1 if filesystem corruption was detected and 0 if no filesystem corruption was detected. +.TP .B xfs_repair -run without the \-n option will always return a status code of 0. +run without the \-n option will return a status code of 2 if it find the +filesystem has valuable metadata changes in log which needs to be +replayed, 1 if there's corruption left to be fixed or can't find log head +and tail or some other errors happened, and 0 if nothing wrong or all the +corruptions were fixed. +.TP +.B xfs_repair \-L +(Force Log Zeroing) +will return a status code of 1 if it can't clear the log, or will always +return 0. .SH BUGS The filesystem to be checked and repaired must have been unmounted cleanly using normal system administration procedures
The man 8 xfs_repair said "xfs_repair run without the -n option will always return a status code of 0". That's not correct. xfs_repair will return 2 if it find valuable metadata changes in log which needs to be replayed, 1 if it can't fix the corruption or some other errors happened and 0 if nothing wrong or all the corruptions were fixed. Generally xfs_repair -L will always return 0, except it can't clear the log. Signed-off-by: Zorro Lang <zlang@redhat.com> --- Hi, I trusted the xfs_repair manpage, and thought xfs_repair will always return 0. But recently I found it lies when I tried to review someone xfstests case. A correct manpage will help more people to write right cases, so I try to modify the manpage, by search all exit/do_error in xfsprogs/repair. I'm not the best one who learn about xfs_repair, so I just hope I did the right thing:-P Please feel free to correct me. Thanks, Zorro man/man8/xfs_repair.8 | 13 ++++++++++++- 1 file changed, 12 insertions(+), 1 deletion(-)