[V5] Btrfs: snapshot-aware defrag

Message ID	1358339768-2314-1-git-send-email-bo.li.liu@oracle.com (mailing list archive)
State	New, archived
Headers	show Return-Path: <linux-btrfs-owner@vger.kernel.org> From: Liu Bo <bo.li.liu@oracle.com> To: linux-btrfs@vger.kernel.org Cc: chris.mason@fusionio.com, JBacik@fusionio.com, dave@jikos.cz, mitch.harder@sabayonlinux.org, kitayama@cl.bb4u.ne.jp, miaox@cn.fujitsu.com Subject: [PATCH V5] Btrfs: snapshot-aware defrag Date: Wed, 16 Jan 2013 20:36:08 +0800 Message-Id: <1358339768-2314-1-git-send-email-bo.li.liu@oracle.com> Sender: linux-btrfs-owner@vger.kernel.org Precedence: bulk

Liu Bo Jan. 16, 2013, 12:36 p.m. UTC

This comes from one of btrfs's project ideas,
As we defragment files, we break any sharing from other snapshots.
The balancing code will preserve the sharing, and defrag needs to grow this
as well.

Now we're able to fill the blank with this patch, in which we make full use of
backref walking stuff.

Here is the basic idea,
o  set the writeback ranges started by defragment with flag EXTENT_DEFRAG
o  at endio, after we finish updating fs tree, we use backref walking to find
   all parents of the ranges and re-link them with the new COWed file layout by
   adding corresponding backrefs.

Signed-off-by: Li Zefan <lizf@cn.fujitsu.com>
Signed-off-by: Liu Bo <bo.li.liu@oracle.com>
---
v4->v5:
      - Clarify the comments for duplicated refs.
      - Clear defrag flag after we're ready to defrag.
      - Fix a bug on HOLE extent.
v3->v4:
      - Fix duplicated refs bugs detected by mounting with autodefrag, thanks
        for the bug report from Mitch and Chris.
v2->v3:
      - Rebase
v1->v2:
      - Address comments from David.

 fs/btrfs/inode.c |  644 ++++++++++++++++++++++++++++++++++++++++++++++++++++++
 1 files changed, 644 insertions(+), 0 deletions(-)

Mitch Harder Jan. 17, 2013, 2:42 p.m. UTC | #1

On Wed, Jan 16, 2013 at 6:36 AM, Liu Bo <bo.li.liu@oracle.com> wrote:
> This comes from one of btrfs's project ideas,
> As we defragment files, we break any sharing from other snapshots.
> The balancing code will preserve the sharing, and defrag needs to grow this
> as well.
>
> Now we're able to fill the blank with this patch, in which we make full use of
> backref walking stuff.
>
> Here is the basic idea,
> o  set the writeback ranges started by defragment with flag EXTENT_DEFRAG
> o  at endio, after we finish updating fs tree, we use backref walking to find
>    all parents of the ranges and re-link them with the new COWed file layout by
>    adding corresponding backrefs.
>
> Signed-off-by: Li Zefan <lizf@cn.fujitsu.com>
> Signed-off-by: Liu Bo <bo.li.liu@oracle.com>
> ---
> v4->v5:
>       - Clarify the comments for duplicated refs.
>       - Clear defrag flag after we're ready to defrag.
>       - Fix a bug on HOLE extent.
> v3->v4:
>       - Fix duplicated refs bugs detected by mounting with autodefrag, thanks
>         for the bug report from Mitch and Chris.
> v2->v3:
>       - Rebase
> v1->v2:
>       - Address comments from David.
>

I've been testing this patch on a 3.7.2 kernel merged with the
for-linus branch for the 3.8_rc kernels, and I'm seeing the following
error:

[16028.159400] general protection fault: 0000 [#1] SMP
[16028.159461] Modules linked in: ipv6 snd_hda_codec_analog
snd_hda_intel snd_hda_codec tg3 snd_hwdep snd_pcm snd_page_alloc
snd_timer snd sr_mod ppdev parport_pc parport microcode iTCO_wdt
iTCO_vendor_support floppy lpc_ich i2c_i801 serio_raw pcspkr
ablk_helper cryptd lrw xts gf128mul aes_x86_64 sha256_generic fuse xfs
nfs lockd sunrpc reiserfs btrfs zlib_deflate ext4 jbd2 ext3 jbd ext2
mbcache sl811_hcd hid_generic xhci_hcd ohci_hcd uhci_hcd ehci_hcd
[16028.159952] CPU 0
[16028.159975] Pid: 4420, comm: btrfs-cleaner Not tainted 3.7.2-sad+
#4 Dell Inc.                 OptiPlex 745                 /0WF810
[16028.160002] RIP: 0010:[<ffffffffa017b4f2>]  [<ffffffffa017b4f2>]
btrfs_clean_old_snapshots+0xa6/0x12c [btrfs]
[16028.160002] RSP: 0000:ffff880078609e38  EFLAGS: 00010282
[16028.160002] RAX: dead000000200200 RBX: ffff880000000000 RCX: 0000000000018e20
[16028.160002] RDX: dead000000100100 RSI: 000000000000001b RDI: 000000000000001b
[16028.160002] RBP: ffff880078609e78 R08: 00000000001c001b R09: ffffffffa015aa01
[16028.160002] R10: ffffffffa016bbbd R11: ffff8800183a4800 R12: 0000160000000000
[16028.160002] R13: ffff880078609e38 R14: ffff8800183a4800 R15: ffff8800183a4c38
[16028.160002] FS:  0000000000000000(0000) GS:ffff88007f200000(0000)
knlGS:0000000000000000
[16028.160002] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[16028.160002] CR2: 00007f64f5214d96 CR3: 0000000011ef2000 CR4: 00000000000007f0
[16028.160002] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[16028.160002] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[16028.160002] Process btrfs-cleaner (pid: 4420, threadinfo
ffff880078608000, task ffff88007ca62120)
[16028.160002] Stack:
[16028.160002]  ffff8800183a4c38 ffff8800020e3c38 ffff880078609e48
ffff88007921b800
[16028.160002]  ffff88007ca62120 ffff88007ca62120 ffff88007ca62120
0000000000000000
[16028.160002]  ffff880078609eb8 ffffffffa0173f68 ffff88007921b800
0000000000000000
[16028.160002] Call Trace:
[16028.160002]  [<ffffffffa0173f68>] cleaner_kthread+0x5a/0xe6 [btrfs]
[16028.160002]  [<ffffffffa0173f0e>] ? transaction_kthread+0x1a0/0x1a0 [btrfs]
[16028.160002]  [<ffffffff8104c9c3>] kthread+0xba/0xc2
[16028.160002]  [<ffffffff8104c909>] ? kthread_freezable_should_stop+0x52/0x52
[16028.160002]  [<ffffffff815f9d9c>] ret_from_fork+0x7c/0xb0
[16028.160002]  [<ffffffff8104c909>] ? kthread_freezable_should_stop+0x52/0x52
[16028.160002] Code: 49 bc 00 00 00 00 00 16 00 00 48 bb 00 00 00 00
00 88 ff ff eb 7d 4d 8d b7 c8 fb ff ff 4d 85 ff 75 02 0f 0b 49 8b 17
49 8b 47 08 <48> 89 42 08 48 89 10 48 be 00 01 10 00 00 00 ad de 49 89
37 48
[16028.160002] RIP  [<ffffffffa017b4f2>]
btrfs_clean_old_snapshots+0xa6/0x12c [btrfs]
[16028.160002]  RSP <ffff880078609e38>
[16028.170584] ---[ end trace 4034e68ac40e6c2b ]---

Using gdb to identify the location of the GPF gives me the following:

(gdb) list *(btrfs_clean_old_snapshots+0xa6)
0x2a4f2 is in btrfs_clean_old_snapshots (include/linux/list.h:88).
83       * This is only for internal list manipulation where we know
84       * the prev/next entries already!
85       */
86      static inline void __list_del(struct list_head * prev, struct
list_head * next)
87      {
88              next->prev = prev;
89              prev->next = next;
90      }
91
92      /**

I've tried to trap the error with a BUG_ON prior to deleting the list,
but my attempt isn't catching the error:

@@ -1769,6 +1769,7 @@ int btrfs_clean_old_snapshots(struct btrfs_root *root)
                int ret;

                root = list_entry(list.next, struct btrfs_root, root_list);
+               BUG_ON(&root->root_list == NULL);
                list_del(&root->root_list);

                btrfs_kill_all_delayed_nodes(root);
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Liu Bo Jan. 18, 2013, 12:53 a.m. UTC | #2

On Thu, Jan 17, 2013 at 08:42:46AM -0600, Mitch Harder wrote:
> On Wed, Jan 16, 2013 at 6:36 AM, Liu Bo <bo.li.liu@oracle.com> wrote:
> > This comes from one of btrfs's project ideas,
> > As we defragment files, we break any sharing from other snapshots.
> > The balancing code will preserve the sharing, and defrag needs to grow this
> > as well.
> >
> > Now we're able to fill the blank with this patch, in which we make full use of
> > backref walking stuff.
> >
> > Here is the basic idea,
> > o  set the writeback ranges started by defragment with flag EXTENT_DEFRAG
> > o  at endio, after we finish updating fs tree, we use backref walking to find
> >    all parents of the ranges and re-link them with the new COWed file layout by
> >    adding corresponding backrefs.
> >
> > Signed-off-by: Li Zefan <lizf@cn.fujitsu.com>
> > Signed-off-by: Liu Bo <bo.li.liu@oracle.com>
> > ---
> > v4->v5:
> >       - Clarify the comments for duplicated refs.
> >       - Clear defrag flag after we're ready to defrag.
> >       - Fix a bug on HOLE extent.
> > v3->v4:
> >       - Fix duplicated refs bugs detected by mounting with autodefrag, thanks
> >         for the bug report from Mitch and Chris.
> > v2->v3:
> >       - Rebase
> > v1->v2:
> >       - Address comments from David.
> >
> 
> I've been testing this patch on a 3.7.2 kernel merged with the
> for-linus branch for the 3.8_rc kernels, and I'm seeing the following
> error:

Hi Mitch,

Insteresting!  I don't even change the snapshot code ever.

Is it reproducable stably from your side?  Still with the
snapshot-test-pub scripts?

thanks,
liubo

> 
> [16028.159400] general protection fault: 0000 [#1] SMP
> [16028.159461] Modules linked in: ipv6 snd_hda_codec_analog
> snd_hda_intel snd_hda_codec tg3 snd_hwdep snd_pcm snd_page_alloc
> snd_timer snd sr_mod ppdev parport_pc parport microcode iTCO_wdt
> iTCO_vendor_support floppy lpc_ich i2c_i801 serio_raw pcspkr
> ablk_helper cryptd lrw xts gf128mul aes_x86_64 sha256_generic fuse xfs
> nfs lockd sunrpc reiserfs btrfs zlib_deflate ext4 jbd2 ext3 jbd ext2
> mbcache sl811_hcd hid_generic xhci_hcd ohci_hcd uhci_hcd ehci_hcd
> [16028.159952] CPU 0
> [16028.159975] Pid: 4420, comm: btrfs-cleaner Not tainted 3.7.2-sad+
> #4 Dell Inc.                 OptiPlex 745                 /0WF810
> [16028.160002] RIP: 0010:[<ffffffffa017b4f2>]  [<ffffffffa017b4f2>]
> btrfs_clean_old_snapshots+0xa6/0x12c [btrfs]
> [16028.160002] RSP: 0000:ffff880078609e38  EFLAGS: 00010282
> [16028.160002] RAX: dead000000200200 RBX: ffff880000000000 RCX: 0000000000018e20
> [16028.160002] RDX: dead000000100100 RSI: 000000000000001b RDI: 000000000000001b
> [16028.160002] RBP: ffff880078609e78 R08: 00000000001c001b R09: ffffffffa015aa01
> [16028.160002] R10: ffffffffa016bbbd R11: ffff8800183a4800 R12: 0000160000000000
> [16028.160002] R13: ffff880078609e38 R14: ffff8800183a4800 R15: ffff8800183a4c38
> [16028.160002] FS:  0000000000000000(0000) GS:ffff88007f200000(0000)
> knlGS:0000000000000000
> [16028.160002] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
> [16028.160002] CR2: 00007f64f5214d96 CR3: 0000000011ef2000 CR4: 00000000000007f0
> [16028.160002] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
> [16028.160002] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
> [16028.160002] Process btrfs-cleaner (pid: 4420, threadinfo
> ffff880078608000, task ffff88007ca62120)
> [16028.160002] Stack:
> [16028.160002]  ffff8800183a4c38 ffff8800020e3c38 ffff880078609e48
> ffff88007921b800
> [16028.160002]  ffff88007ca62120 ffff88007ca62120 ffff88007ca62120
> 0000000000000000
> [16028.160002]  ffff880078609eb8 ffffffffa0173f68 ffff88007921b800
> 0000000000000000
> [16028.160002] Call Trace:
> [16028.160002]  [<ffffffffa0173f68>] cleaner_kthread+0x5a/0xe6 [btrfs]
> [16028.160002]  [<ffffffffa0173f0e>] ? transaction_kthread+0x1a0/0x1a0 [btrfs]
> [16028.160002]  [<ffffffff8104c9c3>] kthread+0xba/0xc2
> [16028.160002]  [<ffffffff8104c909>] ? kthread_freezable_should_stop+0x52/0x52
> [16028.160002]  [<ffffffff815f9d9c>] ret_from_fork+0x7c/0xb0
> [16028.160002]  [<ffffffff8104c909>] ? kthread_freezable_should_stop+0x52/0x52
> [16028.160002] Code: 49 bc 00 00 00 00 00 16 00 00 48 bb 00 00 00 00
> 00 88 ff ff eb 7d 4d 8d b7 c8 fb ff ff 4d 85 ff 75 02 0f 0b 49 8b 17
> 49 8b 47 08 <48> 89 42 08 48 89 10 48 be 00 01 10 00 00 00 ad de 49 89
> 37 48
> [16028.160002] RIP  [<ffffffffa017b4f2>]
> btrfs_clean_old_snapshots+0xa6/0x12c [btrfs]
> [16028.160002]  RSP <ffff880078609e38>
> [16028.170584] ---[ end trace 4034e68ac40e6c2b ]---
> 
> Using gdb to identify the location of the GPF gives me the following:
> 
> (gdb) list *(btrfs_clean_old_snapshots+0xa6)
> 0x2a4f2 is in btrfs_clean_old_snapshots (include/linux/list.h:88).
> 83       * This is only for internal list manipulation where we know
> 84       * the prev/next entries already!
> 85       */
> 86      static inline void __list_del(struct list_head * prev, struct
> list_head * next)
> 87      {
> 88              next->prev = prev;
> 89              prev->next = next;
> 90      }
> 91
> 92      /**
> 
> I've tried to trap the error with a BUG_ON prior to deleting the list,
> but my attempt isn't catching the error:
> 
> @@ -1769,6 +1769,7 @@ int btrfs_clean_old_snapshots(struct btrfs_root *root)
>                 int ret;
> 
>                 root = list_entry(list.next, struct btrfs_root, root_list);
> +               BUG_ON(&root->root_list == NULL);
>                 list_del(&root->root_list);
> 
>                 btrfs_kill_all_delayed_nodes(root);
> --
> To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Mitch Harder Jan. 18, 2013, 5:23 a.m. UTC | #3

On Thu, Jan 17, 2013 at 6:53 PM, Liu Bo <bo.li.liu@oracle.com> wrote:
> On Thu, Jan 17, 2013 at 08:42:46AM -0600, Mitch Harder wrote:
>> On Wed, Jan 16, 2013 at 6:36 AM, Liu Bo <bo.li.liu@oracle.com> wrote:
>> > This comes from one of btrfs's project ideas,
>> > As we defragment files, we break any sharing from other snapshots.
>> > The balancing code will preserve the sharing, and defrag needs to grow this
>> > as well.
>> >
>> > Now we're able to fill the blank with this patch, in which we make full use of
>> > backref walking stuff.
>> >
>> > Here is the basic idea,
>> > o  set the writeback ranges started by defragment with flag EXTENT_DEFRAG
>> > o  at endio, after we finish updating fs tree, we use backref walking to find
>> >    all parents of the ranges and re-link them with the new COWed file layout by
>> >    adding corresponding backrefs.
>> >
>> > Signed-off-by: Li Zefan <lizf@cn.fujitsu.com>
>> > Signed-off-by: Liu Bo <bo.li.liu@oracle.com>
>> > ---
>> > v4->v5:
>> >       - Clarify the comments for duplicated refs.
>> >       - Clear defrag flag after we're ready to defrag.
>> >       - Fix a bug on HOLE extent.
>> > v3->v4:
>> >       - Fix duplicated refs bugs detected by mounting with autodefrag, thanks
>> >         for the bug report from Mitch and Chris.
>> > v2->v3:
>> >       - Rebase
>> > v1->v2:
>> >       - Address comments from David.
>> >
>>
>> I've been testing this patch on a 3.7.2 kernel merged with the
>> for-linus branch for the 3.8_rc kernels, and I'm seeing the following
>> error:
>
> Hi Mitch,
>
> Insteresting!  I don't even change the snapshot code ever.

Yes, this patch series has been excellent at tickling unrelated issues.

> Is it reproducable stably from your side?  Still with the
> snapshot-test-pub scripts?

I'm still using the same snapshot-test scripts, but they don't
reproduce reliably.  I have to run for a while after my script reaches
the point where it starts deleting snapshots to make space.

But, I've been able to hit this error four times with this script.

I'll try to keep playing with this to make a better reproducer, and to
isolate the problem with the parameter supplied to list_del.
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

David Sterba Jan. 18, 2013, 12:19 p.m. UTC | #4

On Thu, Jan 17, 2013 at 08:42:46AM -0600, Mitch Harder wrote:
> [16028.160002] RAX: dead000000200200 RBX: ffff880000000000 RCX: 0000000000018e20
> [16028.160002] RDX: dead000000100100 RSI: 000000000000001b RDI: 000000000000001b

RAX: dead000000200200
RDX: dead000000100100

list_head poisons to mark deleted entries

> I've tried to trap the error with a BUG_ON prior to deleting the list,
> but my attempt isn't catching the error:
> 
> @@ -1769,6 +1769,7 @@ int btrfs_clean_old_snapshots(struct btrfs_root *root)
>                 int ret;
> 
>                 root = list_entry(list.next, struct btrfs_root, root_list);
> +               BUG_ON(&root->root_list == NULL);

You're taking an address and comparing it to NULL? This works, but in
under very limited conditions :)

If root is not null, then the structure is valid, but the root_list hook
is not valid anymore, ie. an inconsistency.

david
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Mitch Harder Jan. 18, 2013, 10:01 p.m. UTC | #5

On Fri, Jan 18, 2013 at 6:19 AM, David Sterba <dsterba@suse.cz> wrote:
> On Thu, Jan 17, 2013 at 08:42:46AM -0600, Mitch Harder wrote:
>> [16028.160002] RAX: dead000000200200 RBX: ffff880000000000 RCX: 0000000000018e20
>> [16028.160002] RDX: dead000000100100 RSI: 000000000000001b RDI: 000000000000001b
>
> RAX: dead000000200200
> RDX: dead000000100100
>
> list_head poisons to mark deleted entries
>
>> I've tried to trap the error with a BUG_ON prior to deleting the list,
>> but my attempt isn't catching the error:
>>
>> @@ -1769,6 +1769,7 @@ int btrfs_clean_old_snapshots(struct btrfs_root *root)
>>                 int ret;
>>
>>                 root = list_entry(list.next, struct btrfs_root, root_list);
>> +               BUG_ON(&root->root_list == NULL);
>
> You're taking an address and comparing it to NULL? This works, but in
> under very limited conditions :)
>
> If root is not null, then the structure is valid, but the root_list hook
> is not valid anymore, ie. an inconsistency.

Thanks, your feedback is kind.

I wasn't thinking when I wrote that.
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Mitch Harder Jan. 22, 2013, 5:41 p.m. UTC | #6

On Thu, Jan 17, 2013 at 8:42 AM, Mitch Harder
<mitch.harder@sabayonlinux.org> wrote:
> On Wed, Jan 16, 2013 at 6:36 AM, Liu Bo <bo.li.liu@oracle.com> wrote:
>> This comes from one of btrfs's project ideas,
>> As we defragment files, we break any sharing from other snapshots.
>> The balancing code will preserve the sharing, and defrag needs to grow this
>> as well.
>>
>> Now we're able to fill the blank with this patch, in which we make full use of
>> backref walking stuff.
>>
>> Here is the basic idea,
>> o  set the writeback ranges started by defragment with flag EXTENT_DEFRAG
>> o  at endio, after we finish updating fs tree, we use backref walking to find
>>    all parents of the ranges and re-link them with the new COWed file layout by
>>    adding corresponding backrefs.
>>
>> Signed-off-by: Li Zefan <lizf@cn.fujitsu.com>
>> Signed-off-by: Liu Bo <bo.li.liu@oracle.com>
>> ---
>> v4->v5:
>>       - Clarify the comments for duplicated refs.
>>       - Clear defrag flag after we're ready to defrag.
>>       - Fix a bug on HOLE extent.
>> v3->v4:
>>       - Fix duplicated refs bugs detected by mounting with autodefrag, thanks
>>         for the bug report from Mitch and Chris.
>> v2->v3:
>>       - Rebase
>> v1->v2:
>>       - Address comments from David.
>>
>
> I've been testing this patch on a 3.7.2 kernel merged with the
> for-linus branch for the 3.8_rc kernels, and I'm seeing the following
> error:
>

I've reproduced the error with CONFIG_DEBUG_LIST enabled, which shows
some problem with an entry in the list.

[59312.260441] ------------[ cut here ]------------
[59312.260454] WARNING: at lib/list_debug.c:62 __list_del_entry+0x8d/0x98()
[59312.260458] Hardware name: OptiPlex 745
[59312.260461] list_del corruption. next->prev should be
ffff88006511c438, but was dead000000200200
[59312.260464] Modules linked in: ipv6 snd_hda_codec_analog
snd_hda_intel i2c_i801 tg3 snd_hda_codec iTCO_wdt snd_hwdep snd_pcm
ppdev parport_pc sr_mod microcode floppy parport snd_page_alloc
snd_timer snd iTCO_vendor_support lpc_ich serio_raw pcspkr ablk_helper
cryptd lrw xts gf128mul aes_x86_64 sha256_generic fuse xfs nfs lockd
sunrpc reiserfs btrfs zlib_deflate ext4 jbd2 ext3 jbd ext2 mbcache
sl811_hcd hid_generic xhci_hcd ohci_hcd uhci_hcd ehci_hcd
[59312.260519] Pid: 20523, comm: btrfs-cleaner Not tainted 3.7.2-sad+ #1
[59312.260521] Call Trace:
[59312.260529]  [<ffffffff81030586>] warn_slowpath_common+0x83/0x9b
[59312.260549]  [<ffffffffa015aa01>] ? reada_for_balance+0x187/0x218 [btrfs]
[59312.260554]  [<ffffffff81030641>] warn_slowpath_fmt+0x46/0x48
[59312.260566]  [<ffffffffa015aa01>] ? reada_for_balance+0x187/0x218 [btrfs]
[59312.260570]  [<ffffffff812099e5>] __list_del_entry+0x8d/0x98
[59312.260574]  [<ffffffff812099fe>] list_del+0xe/0x2e
[59312.260590]  [<ffffffffa017b325>]
btrfs_clean_old_snapshots+0x101/0x168 [btrfs]
[59312.260605]  [<ffffffffa0173d99>] cleaner_kthread+0x5a/0xe6 [btrfs]
[59312.260619]  [<ffffffffa0173d3f>] ? transaction_kthread+0x1a0/0x1a0 [btrfs]
[59312.260624]  [<ffffffff8104c750>] kthread+0xba/0xc2
[59312.260629]  [<ffffffff8104c696>] ? kthread_freezable_should_stop+0x52/0x52
[59312.260634]  [<ffffffff815f2f1c>] ret_from_fork+0x7c/0xb0
[59312.260639]  [<ffffffff8104c696>] ? kthread_freezable_should_stop+0x52/0x52
[59312.260642] ---[ end trace 61b4cbd93690300f ]---
[59318.623735] ------------[ cut here ]------------
[59318.623751] WARNING: at lib/list_debug.c:53 __list_del_entry+0x8d/0x98()
[59318.623755] Hardware name: OptiPlex 745
[59318.623760] list_del corruption, ffff88006511c438->next is
LIST_POISON1 (dead000000100100)
[59318.623766] Modules linked in: ipv6 snd_hda_codec_analog
snd_hda_intel i2c_i801 tg3 snd_hda_codec iTCO_wdt snd_hwdep snd_pcm
ppdev parport_pc sr_mod microcode floppy parport snd_page_alloc
snd_timer snd iTCO_vendor_support lpc_ich serio_raw pcspkr ablk_helper
cryptd lrw xts gf128mul aes_x86_64 sha256_generic fuse xfs nfs lockd
sunrpc reiserfs btrfs zlib_deflate ext4 jbd2 ext3 jbd ext2 mbcache
sl811_hcd hid_generic xhci_hcd ohci_hcd uhci_hcd ehci_hcd
[59318.623840] Pid: 20523, comm: btrfs-cleaner Tainted: G        W
3.7.2-sad+ #1
[59318.623844] Call Trace:
[59318.623855]  [<ffffffff81030586>] warn_slowpath_common+0x83/0x9b
[59318.623878]  [<ffffffffa015aab9>] ? btrfs_free_path+0x27/0x2c [btrfs]
[59318.623885]  [<ffffffff81030641>] warn_slowpath_fmt+0x46/0x48
[59318.623901]  [<ffffffffa015aab9>] ? btrfs_free_path+0x27/0x2c [btrfs]
[59318.623907]  [<ffffffff812099e5>] __list_del_entry+0x8d/0x98
[59318.623912]  [<ffffffff812099fe>] list_del+0xe/0x2e
[59318.623935]  [<ffffffffa017b325>]
btrfs_clean_old_snapshots+0x101/0x168 [btrfs]
[59318.623955]  [<ffffffffa0173d99>] cleaner_kthread+0x5a/0xe6 [btrfs]
[59318.623975]  [<ffffffffa0173d3f>] ? transaction_kthread+0x1a0/0x1a0 [btrfs]
[59318.623981]  [<ffffffff8104c750>] kthread+0xba/0xc2
[59318.623988]  [<ffffffff8104c696>] ? kthread_freezable_should_stop+0x52/0x52
[59318.623994]  [<ffffffff815f2f1c>] ret_from_fork+0x7c/0xb0
[59318.624000]  [<ffffffff8104c696>] ? kthread_freezable_should_stop+0x52/0x52
[59318.624022] ---[ end trace 61b4cbd936903010 ]---
[59318.626394] general protection fault: 0000 [#1] SMP
[59318.626439] Modules linked in: ipv6 snd_hda_codec_analog
snd_hda_intel i2c_i801 tg3 snd_hda_codec iTCO_wdt snd_hwdep snd_pcm
ppdev parport_pc sr_mod microcode floppy parport snd_page_alloc
snd_timer snd iTCO_vendor_support lpc_ich serio_raw pcspkr ablk_helper
cryptd lrw xts gf128mul aes_x86_64 sha256_generic fuse xfs nfs lockd
sunrpc reiserfs btrfs zlib_deflate ext4 jbd2 ext3 jbd ext2 mbcache
sl811_hcd hid_generic xhci_hcd ohci_hcd uhci_hcd ehci_hcd
[59318.626832] CPU 0
[59318.626849] Pid: 20523, comm: btrfs-cleaner Tainted: G        W
3.7.2-sad+ #1 Dell Inc.                 OptiPlex 745
/0WF810
[59318.626926] RIP: 0010:[<ffffffffa017b349>]  [<ffffffffa017b349>]
btrfs_clean_old_snapshots+0x125/0x168 [btrfs]
[59318.627018] RSP: 0018:ffff880078f43e38  EFLAGS: 00010206
[59318.627054] RAX: 0005800000021000 RBX: ffff880000000000 RCX: 0000000000000008
[59318.627098] RDX: 0000000000000000 RSI: ffff880078f43d70 RDI: ffff88006511c470
[59318.627141] RBP: ffff880078f43e78 R08: 0000000000000000 R09: ffff88004b61c3f0
[59318.627184] R10: 0000000000000001 R11: 0000000000000000 R12: 0000160000000000
[59318.627228] R13: ffff880078f43e38 R14: ffff88006511c000 R15: ffff88006511c438
[59318.627272] FS:  0000000000000000(0000) GS:ffff88007f200000(0000)
knlGS:0000000000000000
[59318.627322] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[59318.627358] CR2: 00007ff8b6d33375 CR3: 00000000788a0000 CR4: 00000000000007f0
[59318.627402] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[59318.627445] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[59318.627490] Process btrfs-cleaner (pid: 20523, threadinfo
ffff880078f42000, task ffff88007c999a80)
[59318.627543] Stack:
[59318.627557]  ffff88006511c438 ffff880002046438 ffff880078f43e48
ffff880028017800
[59318.627611]  ffff88007c999a80 ffff88007c999a80 ffff88007c999a80
0000000000000000
[59318.627663]  ffff880078f43eb8 ffffffffa0173d99 ffff880028017800
0000000000000000
[59318.627719] Call Trace:
[59318.627751]  [<ffffffffa0173d99>] cleaner_kthread+0x5a/0xe6 [btrfs]
[59318.627804]  [<ffffffffa0173d3f>] ? transaction_kthread+0x1a0/0x1a0 [btrfs]
[59318.627853]  [<ffffffff8104c750>] kthread+0xba/0xc2
[59318.627898]  [<ffffffff8104c696>] ? kthread_freezable_should_stop+0x52/0x52
[59318.627948]  [<ffffffff815f2f1c>] ret_from_fork+0x7c/0xb0
[59318.627986]  [<ffffffff8104c696>] ? kthread_freezable_should_stop+0x52/0x52
[59318.628033] Code: 89 ff e8 cb e6 08 e1 4c 89 f7 e8 22 f4 03 00 49
8b 87 c8 fb ff ff 48 8b 80 50 01 00 00 48 8b 00 4c 01 e0 48 c1 f8 06
48 c1 e0 0c <0f> b6 44 18 3f 31 c9 31 d2 85 c0 7e 05 ba 01 00 00 00 31
f6 4c
[59318.628279] RIP  [<ffffffffa017b349>]
btrfs_clean_old_snapshots+0x125/0x168 [btrfs]
[59318.628295]  RSP <ffff880078f43e38>
[59318.634447] ---[ end trace 61b4cbd936903011 ]---
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Liu Bo Jan. 23, 2013, 7:51 a.m. UTC | #7

On Tue, Jan 22, 2013 at 11:41:19AM -0600, Mitch Harder wrote:
> On Thu, Jan 17, 2013 at 8:42 AM, Mitch Harder
> <mitch.harder@sabayonlinux.org> wrote:
> > On Wed, Jan 16, 2013 at 6:36 AM, Liu Bo <bo.li.liu@oracle.com> wrote:
> >> This comes from one of btrfs's project ideas,
> >> As we defragment files, we break any sharing from other snapshots.
> >> The balancing code will preserve the sharing, and defrag needs to grow this
> >> as well.
[...]
> >
> > I've been testing this patch on a 3.7.2 kernel merged with the
> > for-linus branch for the 3.8_rc kernels, and I'm seeing the following
> > error:
> >
> 
> I've reproduced the error with CONFIG_DEBUG_LIST enabled, which shows
> some problem with an entry in the list.
> 
> [59312.260441] ------------[ cut here ]------------
> [59312.260454] WARNING: at lib/list_debug.c:62 __list_del_entry+0x8d/0x98()
> [59312.260458] Hardware name: OptiPlex 745
> [59312.260461] list_del corruption. next->prev should be
> ffff88006511c438, but was dead000000200200

LIST_POISON2 -> (000000200200)
So we can know that the next one is deleted from the list even _earlier_
than the current one is.

Any other messages before this warning complains?

thanks,
liubo

> [59312.260464] Modules linked in: ipv6 snd_hda_codec_analog
> snd_hda_intel i2c_i801 tg3 snd_hda_codec iTCO_wdt snd_hwdep snd_pcm
> ppdev parport_pc sr_mod microcode floppy parport snd_page_alloc
> snd_timer snd iTCO_vendor_support lpc_ich serio_raw pcspkr ablk_helper
> cryptd lrw xts gf128mul aes_x86_64 sha256_generic fuse xfs nfs lockd
> sunrpc reiserfs btrfs zlib_deflate ext4 jbd2 ext3 jbd ext2 mbcache
> sl811_hcd hid_generic xhci_hcd ohci_hcd uhci_hcd ehci_hcd
> [59312.260519] Pid: 20523, comm: btrfs-cleaner Not tainted 3.7.2-sad+ #1
> [59312.260521] Call Trace:
> [59312.260529]  [<ffffffff81030586>] warn_slowpath_common+0x83/0x9b
> [59312.260549]  [<ffffffffa015aa01>] ? reada_for_balance+0x187/0x218 [btrfs]
> [59312.260554]  [<ffffffff81030641>] warn_slowpath_fmt+0x46/0x48
> [59312.260566]  [<ffffffffa015aa01>] ? reada_for_balance+0x187/0x218 [btrfs]
> [59312.260570]  [<ffffffff812099e5>] __list_del_entry+0x8d/0x98
> [59312.260574]  [<ffffffff812099fe>] list_del+0xe/0x2e
> [59312.260590]  [<ffffffffa017b325>]
> btrfs_clean_old_snapshots+0x101/0x168 [btrfs]
> [59312.260605]  [<ffffffffa0173d99>] cleaner_kthread+0x5a/0xe6 [btrfs]
> [59312.260619]  [<ffffffffa0173d3f>] ? transaction_kthread+0x1a0/0x1a0 [btrfs]
> [59312.260624]  [<ffffffff8104c750>] kthread+0xba/0xc2
> [59312.260629]  [<ffffffff8104c696>] ? kthread_freezable_should_stop+0x52/0x52
> [59312.260634]  [<ffffffff815f2f1c>] ret_from_fork+0x7c/0xb0
> [59312.260639]  [<ffffffff8104c696>] ? kthread_freezable_should_stop+0x52/0x52
> [59312.260642] ---[ end trace 61b4cbd93690300f ]---
> [59318.623735] ------------[ cut here ]------------
> [59318.623751] WARNING: at lib/list_debug.c:53 __list_del_entry+0x8d/0x98()
> [59318.623755] Hardware name: OptiPlex 745
> [59318.623760] list_del corruption, ffff88006511c438->next is
> LIST_POISON1 (dead000000100100)
> [59318.623766] Modules linked in: ipv6 snd_hda_codec_analog
> snd_hda_intel i2c_i801 tg3 snd_hda_codec iTCO_wdt snd_hwdep snd_pcm
> ppdev parport_pc sr_mod microcode floppy parport snd_page_alloc
> snd_timer snd iTCO_vendor_support lpc_ich serio_raw pcspkr ablk_helper
> cryptd lrw xts gf128mul aes_x86_64 sha256_generic fuse xfs nfs lockd
> sunrpc reiserfs btrfs zlib_deflate ext4 jbd2 ext3 jbd ext2 mbcache
> sl811_hcd hid_generic xhci_hcd ohci_hcd uhci_hcd ehci_hcd
> [59318.623840] Pid: 20523, comm: btrfs-cleaner Tainted: G        W
> 3.7.2-sad+ #1
> [59318.623844] Call Trace:
> [59318.623855]  [<ffffffff81030586>] warn_slowpath_common+0x83/0x9b
> [59318.623878]  [<ffffffffa015aab9>] ? btrfs_free_path+0x27/0x2c [btrfs]
> [59318.623885]  [<ffffffff81030641>] warn_slowpath_fmt+0x46/0x48
> [59318.623901]  [<ffffffffa015aab9>] ? btrfs_free_path+0x27/0x2c [btrfs]
> [59318.623907]  [<ffffffff812099e5>] __list_del_entry+0x8d/0x98
> [59318.623912]  [<ffffffff812099fe>] list_del+0xe/0x2e
> [59318.623935]  [<ffffffffa017b325>]
> btrfs_clean_old_snapshots+0x101/0x168 [btrfs]
> [59318.623955]  [<ffffffffa0173d99>] cleaner_kthread+0x5a/0xe6 [btrfs]
> [59318.623975]  [<ffffffffa0173d3f>] ? transaction_kthread+0x1a0/0x1a0 [btrfs]
> [59318.623981]  [<ffffffff8104c750>] kthread+0xba/0xc2
> [59318.623988]  [<ffffffff8104c696>] ? kthread_freezable_should_stop+0x52/0x52
> [59318.623994]  [<ffffffff815f2f1c>] ret_from_fork+0x7c/0xb0
> [59318.624000]  [<ffffffff8104c696>] ? kthread_freezable_should_stop+0x52/0x52
> [59318.624022] ---[ end trace 61b4cbd936903010 ]---
> [59318.626394] general protection fault: 0000 [#1] SMP
> [59318.626439] Modules linked in: ipv6 snd_hda_codec_analog
> snd_hda_intel i2c_i801 tg3 snd_hda_codec iTCO_wdt snd_hwdep snd_pcm
> ppdev parport_pc sr_mod microcode floppy parport snd_page_alloc
> snd_timer snd iTCO_vendor_support lpc_ich serio_raw pcspkr ablk_helper
> cryptd lrw xts gf128mul aes_x86_64 sha256_generic fuse xfs nfs lockd
> sunrpc reiserfs btrfs zlib_deflate ext4 jbd2 ext3 jbd ext2 mbcache
> sl811_hcd hid_generic xhci_hcd ohci_hcd uhci_hcd ehci_hcd
> [59318.626832] CPU 0
> [59318.626849] Pid: 20523, comm: btrfs-cleaner Tainted: G        W
> 3.7.2-sad+ #1 Dell Inc.                 OptiPlex 745
> /0WF810
> [59318.626926] RIP: 0010:[<ffffffffa017b349>]  [<ffffffffa017b349>]
> btrfs_clean_old_snapshots+0x125/0x168 [btrfs]
> [59318.627018] RSP: 0018:ffff880078f43e38  EFLAGS: 00010206
> [59318.627054] RAX: 0005800000021000 RBX: ffff880000000000 RCX: 0000000000000008
> [59318.627098] RDX: 0000000000000000 RSI: ffff880078f43d70 RDI: ffff88006511c470
> [59318.627141] RBP: ffff880078f43e78 R08: 0000000000000000 R09: ffff88004b61c3f0
> [59318.627184] R10: 0000000000000001 R11: 0000000000000000 R12: 0000160000000000
> [59318.627228] R13: ffff880078f43e38 R14: ffff88006511c000 R15: ffff88006511c438
> [59318.627272] FS:  0000000000000000(0000) GS:ffff88007f200000(0000)
> knlGS:0000000000000000
> [59318.627322] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
> [59318.627358] CR2: 00007ff8b6d33375 CR3: 00000000788a0000 CR4: 00000000000007f0
> [59318.627402] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
> [59318.627445] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
> [59318.627490] Process btrfs-cleaner (pid: 20523, threadinfo
> ffff880078f42000, task ffff88007c999a80)
> [59318.627543] Stack:
> [59318.627557]  ffff88006511c438 ffff880002046438 ffff880078f43e48
> ffff880028017800
> [59318.627611]  ffff88007c999a80 ffff88007c999a80 ffff88007c999a80
> 0000000000000000
> [59318.627663]  ffff880078f43eb8 ffffffffa0173d99 ffff880028017800
> 0000000000000000
> [59318.627719] Call Trace:
> [59318.627751]  [<ffffffffa0173d99>] cleaner_kthread+0x5a/0xe6 [btrfs]
> [59318.627804]  [<ffffffffa0173d3f>] ? transaction_kthread+0x1a0/0x1a0 [btrfs]
> [59318.627853]  [<ffffffff8104c750>] kthread+0xba/0xc2
> [59318.627898]  [<ffffffff8104c696>] ? kthread_freezable_should_stop+0x52/0x52
> [59318.627948]  [<ffffffff815f2f1c>] ret_from_fork+0x7c/0xb0
> [59318.627986]  [<ffffffff8104c696>] ? kthread_freezable_should_stop+0x52/0x52
> [59318.628033] Code: 89 ff e8 cb e6 08 e1 4c 89 f7 e8 22 f4 03 00 49
> 8b 87 c8 fb ff ff 48 8b 80 50 01 00 00 48 8b 00 4c 01 e0 48 c1 f8 06
> 48 c1 e0 0c <0f> b6 44 18 3f 31 c9 31 d2 85 c0 7e 05 ba 01 00 00 00 31
> f6 4c
> [59318.628279] RIP  [<ffffffffa017b349>]
> btrfs_clean_old_snapshots+0x125/0x168 [btrfs]
> [59318.628295]  RSP <ffff880078f43e38>
> [59318.634447] ---[ end trace 61b4cbd936903011 ]---
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Mitch Harder Jan. 23, 2013, 4:05 p.m. UTC | #8

On Wed, Jan 23, 2013 at 1:51 AM, Liu Bo <bo.li.liu@oracle.com> wrote:
> On Tue, Jan 22, 2013 at 11:41:19AM -0600, Mitch Harder wrote:
>> On Thu, Jan 17, 2013 at 8:42 AM, Mitch Harder
>> <mitch.harder@sabayonlinux.org> wrote:
>> > On Wed, Jan 16, 2013 at 6:36 AM, Liu Bo <bo.li.liu@oracle.com> wrote:
>> >> This comes from one of btrfs's project ideas,
>> >> As we defragment files, we break any sharing from other snapshots.
>> >> The balancing code will preserve the sharing, and defrag needs to grow this
>> >> as well.
> [...]
>> >
>> > I've been testing this patch on a 3.7.2 kernel merged with the
>> > for-linus branch for the 3.8_rc kernels, and I'm seeing the following
>> > error:
>> >
>>
>> I've reproduced the error with CONFIG_DEBUG_LIST enabled, which shows
>> some problem with an entry in the list.
>>
>> [59312.260441] ------------[ cut here ]------------
>> [59312.260454] WARNING: at lib/list_debug.c:62 __list_del_entry+0x8d/0x98()
>> [59312.260458] Hardware name: OptiPlex 745
>> [59312.260461] list_del corruption. next->prev should be
>> ffff88006511c438, but was dead000000200200
>
> LIST_POISON2 -> (000000200200)
> So we can know that the next one is deleted from the list even _earlier_
> than the current one is.
>
> Any other messages before this warning complains?
>

Just some normal feedback from a metadata balance I had run.

[14057.193343] device fsid 28c688c5-7dbd-4071-b271-1bf6726d8835 devid
1 transid 4 /dev/sda7
[14057.194438] btrfs: force lzo compression
[14057.194446] btrfs: enabling auto defrag
[14057.194449] btrfs: disk space caching is enabled
[14057.194452] btrfs flagging fs with big metadata feature
[14057.194455] btrfs: lzo incompat flag set.
[57508.799193] btrfs: relocating block group 14516486144 flags 4
[57632.178797] btrfs: found 6775 extents
[57633.214701] btrfs: relocating block group 11832131584 flags 4
[57776.400102] btrfs: found 6480 extents
[57777.021175] btrfs: relocating block group 10489954304 flags 4
[57949.182725] btrfs: found 6681 extents
[59312.260441] ------------[ cut here ]------------
[59312.260454] WARNING: at lib/list_debug.c:62 __list_del_entry+0x8d/0x98()
[59312.260458] Hardware name: OptiPlex 745
...

I'm going to try to wrap some debugging around the section of code in
btrfs_clean_old_snapshots() where the dead_roots list is spliced onto
the root list being processed.  The double entry may be slipping in
here.

1764         spin_lock(&fs_info->trans_lock);
1765         list_splice_init(&fs_info->dead_roots, &list);
1766         spin_unlock(&fs_info->trans_lock);
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Liu Bo Jan. 24, 2013, 12:52 a.m. UTC | #9

On Wed, Jan 23, 2013 at 10:05:04AM -0600, Mitch Harder wrote:
> On Wed, Jan 23, 2013 at 1:51 AM, Liu Bo <bo.li.liu@oracle.com> wrote:
> > On Tue, Jan 22, 2013 at 11:41:19AM -0600, Mitch Harder wrote:
> >> On Thu, Jan 17, 2013 at 8:42 AM, Mitch Harder
> >> <mitch.harder@sabayonlinux.org> wrote:
> >> > On Wed, Jan 16, 2013 at 6:36 AM, Liu Bo <bo.li.liu@oracle.com> wrote:
> >> >> This comes from one of btrfs's project ideas,
> >> >> As we defragment files, we break any sharing from other snapshots.
> >> >> The balancing code will preserve the sharing, and defrag needs to grow this
> >> >> as well.
> > [...]
> >> >
> >> > I've been testing this patch on a 3.7.2 kernel merged with the
> >> > for-linus branch for the 3.8_rc kernels, and I'm seeing the following
> >> > error:
> >> >
> >>
> >> I've reproduced the error with CONFIG_DEBUG_LIST enabled, which shows
> >> some problem with an entry in the list.
> >>
> >> [59312.260441] ------------[ cut here ]------------
> >> [59312.260454] WARNING: at lib/list_debug.c:62 __list_del_entry+0x8d/0x98()
> >> [59312.260458] Hardware name: OptiPlex 745
> >> [59312.260461] list_del corruption. next->prev should be
> >> ffff88006511c438, but was dead000000200200
> >
> > LIST_POISON2 -> (000000200200)
> > So we can know that the next one is deleted from the list even _earlier_
> > than the current one is.
> >
> > Any other messages before this warning complains?
> >
> 
> Just some normal feedback from a metadata balance I had run.

Well, these do fit my expectation, since balance also involves with playing with
root_list, which may lead to the bad situation.

> 
> [14057.193343] device fsid 28c688c5-7dbd-4071-b271-1bf6726d8835 devid
> 1 transid 4 /dev/sda7
> [14057.194438] btrfs: force lzo compression
> [14057.194446] btrfs: enabling auto defrag
> [14057.194449] btrfs: disk space caching is enabled
> [14057.194452] btrfs flagging fs with big metadata feature
> [14057.194455] btrfs: lzo incompat flag set.
> [57508.799193] btrfs: relocating block group 14516486144 flags 4
> [57632.178797] btrfs: found 6775 extents
> [57633.214701] btrfs: relocating block group 11832131584 flags 4
> [57776.400102] btrfs: found 6480 extents
> [57777.021175] btrfs: relocating block group 10489954304 flags 4
> [57949.182725] btrfs: found 6681 extents
> [59312.260441] ------------[ cut here ]------------
> [59312.260454] WARNING: at lib/list_debug.c:62 __list_del_entry+0x8d/0x98()
> [59312.260458] Hardware name: OptiPlex 745
> ...
> 
> I'm going to try to wrap some debugging around the section of code in
> btrfs_clean_old_snapshots() where the dead_roots list is spliced onto
> the root list being processed.  The double entry may be slipping in
> here.
> 
> 1764         spin_lock(&fs_info->trans_lock);
> 1765         list_splice_init(&fs_info->dead_roots, &list);
> 1766         spin_unlock(&fs_info->trans_lock);

hmm, I don't think there is anything wrong in this code.  But you can
give it a shot anyway :)

thanks,
liubo
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Mitch Harder Jan. 25, 2013, 2:55 p.m. UTC | #10

On Wed, Jan 23, 2013 at 6:52 PM, Liu Bo <bo.li.liu@oracle.com> wrote:
> On Wed, Jan 23, 2013 at 10:05:04AM -0600, Mitch Harder wrote:
>> On Wed, Jan 23, 2013 at 1:51 AM, Liu Bo <bo.li.liu@oracle.com> wrote:
>> > On Tue, Jan 22, 2013 at 11:41:19AM -0600, Mitch Harder wrote:
>> >> On Thu, Jan 17, 2013 at 8:42 AM, Mitch Harder
>> >> <mitch.harder@sabayonlinux.org> wrote:
>> >> > On Wed, Jan 16, 2013 at 6:36 AM, Liu Bo <bo.li.liu@oracle.com> wrote:
>> >> >> This comes from one of btrfs's project ideas,
>> >> >> As we defragment files, we break any sharing from other snapshots.
>> >> >> The balancing code will preserve the sharing, and defrag needs to grow this
>> >> >> as well.
>> > [...]
>> >> >
>> >> > I've been testing this patch on a 3.7.2 kernel merged with the
>> >> > for-linus branch for the 3.8_rc kernels, and I'm seeing the following
>> >> > error:
>> >> >
>> >>
>> >> I've reproduced the error with CONFIG_DEBUG_LIST enabled, which shows
>> >> some problem with an entry in the list.
>> >>
>> >> [59312.260441] ------------[ cut here ]------------
>> >> [59312.260454] WARNING: at lib/list_debug.c:62 __list_del_entry+0x8d/0x98()
>> >> [59312.260458] Hardware name: OptiPlex 745
>> >> [59312.260461] list_del corruption. next->prev should be
>> >> ffff88006511c438, but was dead000000200200
>> >
>> > LIST_POISON2 -> (000000200200)
>> > So we can know that the next one is deleted from the list even _earlier_
>> > than the current one is.
>> >
>> > Any other messages before this warning complains?
>> >
>>
>> Just some normal feedback from a metadata balance I had run.
>
> Well, these do fit my expectation, since balance also involves with playing with
> root_list, which may lead to the bad situation.
>
>>
>> [14057.193343] device fsid 28c688c5-7dbd-4071-b271-1bf6726d8835 devid
>> 1 transid 4 /dev/sda7
>> [14057.194438] btrfs: force lzo compression
>> [14057.194446] btrfs: enabling auto defrag
>> [14057.194449] btrfs: disk space caching is enabled
>> [14057.194452] btrfs flagging fs with big metadata feature
>> [14057.194455] btrfs: lzo incompat flag set.
>> [57508.799193] btrfs: relocating block group 14516486144 flags 4
>> [57632.178797] btrfs: found 6775 extents
>> [57633.214701] btrfs: relocating block group 11832131584 flags 4
>> [57776.400102] btrfs: found 6480 extents
>> [57777.021175] btrfs: relocating block group 10489954304 flags 4
>> [57949.182725] btrfs: found 6681 extents
>> [59312.260441] ------------[ cut here ]------------
>> [59312.260454] WARNING: at lib/list_debug.c:62 __list_del_entry+0x8d/0x98()
>> [59312.260458] Hardware name: OptiPlex 745
>> ...
>>
>> I'm going to try to wrap some debugging around the section of code in
>> btrfs_clean_old_snapshots() where the dead_roots list is spliced onto
>> the root list being processed.  The double entry may be slipping in
>> here.
>>
>> 1764         spin_lock(&fs_info->trans_lock);
>> 1765         list_splice_init(&fs_info->dead_roots, &list);
>> 1766         spin_unlock(&fs_info->trans_lock);
>
> hmm, I don't think there is anything wrong in this code.  But you can
> give it a shot anyway :)
>

I've changed up my reproducer to try some things that may hit the
issue quicker and more reliably.

It gave me a slightly different set of warnings in dmesg, which seem
to suggest issues in the dead_root list.

[43925.656065] device fsid a8f6fadb-3022-4c01-b369-f1f3f638c052 devid
1 transid 310 /dev/sda7
[43925.658062] btrfs: force lzo compression
[43925.658072] btrfs: enabling auto defrag
[43925.658075] btrfs: disk space caching is enabled
[43925.658078] btrfs: lzo incompat flag set.
[44503.421293] btrfs: unlinked 1 orphans
[44898.287365] btrfs: unlinked 1 orphans
[45080.641383] btrfs: unlinked 1 orphans
[45250.063773] btrfs: unlinked 1 orphans
[46223.387355] btrfs: unlinked 1 orphans
[46476.473944] btrfs: unlinked 1 orphans
[46499.665615] btrfs: unlinked 1 orphans
[46769.785454] ------------[ cut here ]------------
[46769.785471] WARNING: at lib/list_debug.c:36 __list_add+0x9d/0xba()
[46769.785474] Hardware name: OptiPlex 745
[46769.785478] list_add double add: new=ffff880050c27c38,
prev=ffff880078f3e720, next=ffff880050c27c38.
[46769.785480] Modules linked in: ipv6 snd_hda_codec_analog
snd_hda_intel snd_hda_codec snd_hwdep snd_pcm snd_page_alloc snd_timer
tg3 sr_mod snd i2c_i801 ppdev parport_pc iTCO_wdt iTCO_vendor_support
lpc_ich pcspkr parport floppy serio_raw microcode ablk_helper cryptd
lrw xts gf128mul aes_x86_64 sha256_generic fuse xfs nfs lockd sunrpc
reiserfs btrfs zlib_deflate ext4 jbd2 ext3 jbd ext2 mbcache sl811_hcd
hid_generic xhci_hcd ohci_hcd uhci_hcd ehci_hcd
[46769.785537] Pid: 18291, comm: btrfs-endio-wri Not tainted 3.7.4-sad-v1+ #3
[46769.785539] Call Trace:
[46769.785549]  [<ffffffff81030586>] warn_slowpath_common+0x83/0x9b
[46769.785553]  [<ffffffff81030641>] warn_slowpath_fmt+0x46/0x48
[46769.785558]  [<ffffffff8120987b>] __list_add+0x9d/0xba
[46769.785586]  [<ffffffffa0179dd6>] btrfs_add_dead_root+0x42/0x56 [btrfs]
[46769.785603]  [<ffffffffa0187b67>] btrfs_destroy_inode+0x227/0x25b [btrfs]
[46769.785611]  [<ffffffff8111393a>] destroy_inode+0x3b/0x54
[46769.785615]  [<ffffffff81113a9c>] evict+0x149/0x151
[46769.785619]  [<ffffffff81114322>] iput+0x12c/0x135
[46769.785636]  [<ffffffffa018455f>] relink_extent_backref+0x669/0x6af [btrfs]
[46769.785642]  [<ffffffff815e9849>] ? __slab_free+0x17c/0x21b
[46769.785658]  [<ffffffffa0184d15>] ?
btrfs_finish_ordered_io+0x770/0x827 [btrfs]
[46769.785674]  [<ffffffffa0184ce5>] btrfs_finish_ordered_io+0x740/0x827 [btrfs]
[46769.785691]  [<ffffffffa0184de1>] finish_ordered_fn+0x15/0x17 [btrfs]
[46769.785706]  [<ffffffffa019e5c9>] worker_loop+0x14c/0x493 [btrfs]
[46769.785722]  [<ffffffffa019e47d>] ? btrfs_queue_worker+0x258/0x258 [btrfs]
[46769.785728]  [<ffffffff8104c750>] kthread+0xba/0xc2
[46769.785732]  [<ffffffff8104c696>] ? kthread_freezable_should_stop+0x52/0x52
[46769.785737]  [<ffffffff815f301c>] ret_from_fork+0x7c/0xb0
[46769.785741]  [<ffffffff8104c696>] ? kthread_freezable_should_stop+0x52/0x52
[46769.785745] ---[ end trace 7528086f91b151b5 ]---
[46799.053062] ------------[ cut here ]------------
[46799.053078] WARNING: at lib/list_debug.c:62 __list_del_entry+0x8d/0x98()
[46799.053082] Hardware name: OptiPlex 745
[46799.053087] list_del corruption. next->prev should be
ffff880050c27c38, but was ffff8800057fde38
[46799.053090] Modules linked in: ipv6 snd_hda_codec_analog
snd_hda_intel snd_hda_codec snd_hwdep snd_pcm snd_page_alloc snd_timer
tg3 sr_mod snd i2c_i801 ppdev parport_pc iTCO_wdt iTCO_vendor_support
lpc_ich pcspkr parport floppy serio_raw microcode ablk_helper cryptd
lrw xts gf128mul aes_x86_64 sha256_generic fuse xfs nfs lockd sunrpc
reiserfs btrfs zlib_deflate ext4 jbd2 ext3 jbd ext2 mbcache sl811_hcd
hid_generic xhci_hcd ohci_hcd uhci_hcd ehci_hcd
[46799.053163] Pid: 18210, comm: btrfs-cleaner Tainted: G        W
3.7.4-sad-v1+ #3
[46799.053166] Call Trace:
[46799.053180]  [<ffffffff81030586>] warn_slowpath_common+0x83/0x9b
[46799.053184]  [<ffffffff81030641>] warn_slowpath_fmt+0x46/0x48
[46799.053190]  [<ffffffff810ab4e9>] ? __trace_bprintk+0x48/0x4a
[46799.053194]  [<ffffffff812097a5>] __list_del_entry+0x8d/0x98
[46799.053198]  [<ffffffff812097be>] list_del+0xe/0x2e
[46799.053220]  [<ffffffffa017b2f5>]
btrfs_clean_old_snapshots+0xed/0x150 [btrfs]
[46799.053235]  [<ffffffffa0173d7d>] cleaner_kthread+0x5a/0xe6 [btrfs]
[46799.053249]  [<ffffffffa0173d23>] ? transaction_kthread+0x1a0/0x1a0 [btrfs]
[46799.053254]  [<ffffffff8104c750>] kthread+0xba/0xc2
[46799.053259]  [<ffffffff8104c696>] ? kthread_freezable_should_stop+0x52/0x52
[46799.053264]  [<ffffffff815f301c>] ret_from_fork+0x7c/0xb0
[46799.053269]  [<ffffffff8104c696>] ? kthread_freezable_should_stop+0x52/0x52
[46799.053272] ---[ end trace 7528086f91b151b6 ]---
[46811.162649] ------------[ cut here ]------------
[46811.162665] WARNING: at lib/list_debug.c:53 __list_del_entry+0x8d/0x98()
[46811.162669] Hardware name: OptiPlex 745
[46811.162674] list_del corruption, ffff880050c27c38->next is
LIST_POISON1 (dead000000100100)
[46811.162678] Modules linked in: ipv6 snd_hda_codec_analog
snd_hda_intel snd_hda_codec snd_hwdep snd_pcm snd_page_alloc snd_timer
tg3 sr_mod snd i2c_i801 ppdev parport_pc iTCO_wdt iTCO_vendor_support
lpc_ich pcspkr parport floppy serio_raw microcode ablk_helper cryptd
lrw xts gf128mul aes_x86_64 sha256_generic fuse xfs nfs lockd sunrpc
reiserfs btrfs zlib_deflate ext4 jbd2 ext3 jbd ext2 mbcache sl811_hcd
hid_generic xhci_hcd ohci_hcd uhci_hcd ehci_hcd
[46811.162750] Pid: 18210, comm: btrfs-cleaner Tainted: G        W
3.7.4-sad-v1+ #3
[46811.162754] Call Trace:
[46811.162764]  [<ffffffff81030586>] warn_slowpath_common+0x83/0x9b
[46811.162771]  [<ffffffff81030641>] warn_slowpath_fmt+0x46/0x48
[46811.162779]  [<ffffffff810ab4e9>] ? __trace_bprintk+0x48/0x4a
[46811.162785]  [<ffffffff812097a5>] __list_del_entry+0x8d/0x98
[46811.162791]  [<ffffffff812097be>] list_del+0xe/0x2e
[46811.162820]  [<ffffffffa017b2f5>]
btrfs_clean_old_snapshots+0xed/0x150 [btrfs]
[46811.162841]  [<ffffffffa0173d7d>] cleaner_kthread+0x5a/0xe6 [btrfs]
[46811.162862]  [<ffffffffa0173d23>] ? transaction_kthread+0x1a0/0x1a0 [btrfs]
[46811.162869]  [<ffffffff8104c750>] kthread+0xba/0xc2
[46811.162875]  [<ffffffff8104c696>] ? kthread_freezable_should_stop+0x52/0x52
[46811.162882]  [<ffffffff815f301c>] ret_from_fork+0x7c/0xb0
[46811.162888]  [<ffffffff8104c696>] ? kthread_freezable_should_stop+0x52/0x52
[46811.162892] ---[ end trace 7528086f91b151b7 ]---
[46811.162904] BUG: unable to handle kernel paging request at 0000000047c5a000
[46811.163003] IP: [<ffffffffa017b30b>]
btrfs_clean_old_snapshots+0x103/0x150 [btrfs]
[46811.163003] PGD 0
[46811.163003] Oops: 0000 [#1] SMP
[46811.163003] Modules linked in: ipv6 snd_hda_codec_analog
snd_hda_intel snd_hda_codec snd_hwdep snd_pcm snd_page_alloc snd_timer
tg3 sr_mod snd i2c_i801 ppdev parport_pc iTCO_wdt iTCO_vendor_support
lpc_ich pcspkr parport floppy serio_raw microcode ablk_helper cryptd
lrw xts gf128mul aes_x86_64 sha256_generic fuse xfs nfs lockd sunrpc
reiserfs btrfs zlib_deflate ext4 jbd2 ext3 jbd ext2 mbcache sl811_hcd
hid_generic xhci_hcd ohci_hcd uhci_hcd ehci_hcd
[46811.163003] CPU 0
[46811.163003] Pid: 18210, comm: btrfs-cleaner Tainted: G        W
3.7.4-sad-v1+ #3 Dell Inc.                 OptiPlex 745
 /0WF810
[46811.163003] RIP: 0010:[<ffffffffa017b30b>]  [<ffffffffa017b30b>]
btrfs_clean_old_snapshots+0x103/0x150 [btrfs]
[46811.163003] RSP: 0018:ffff8800057fde38  EFLAGS: 00010296
[46811.163003] RAX: 0000000047c5a000 RBX: ffff880050c27800 RCX: 0000000000000008
[46811.163003] RDX: 0000000000000000 RSI: ffff8800057fdd70 RDI: ffff880050c27c70
[46811.163003] RBP: ffff8800057fde78 R08: 0000000000000000 R09: 0000000000000283
[46811.163003] R10: 0000000000000001 R11: 0000000000000000 R12: ffff880000000000
[46811.163003] R13: 0000160000000000 R14: ffff8800057fde38 R15: ffff880050c27c38
[46811.163003] FS:  0000000000000000(0000) GS:ffff88007f200000(0000)
knlGS:0000000000000000
[46811.163003] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[46811.163003] CR2: 0000000047c5a000 CR3: 000000003f270000 CR4: 00000000000007f0
[46811.163003] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[46811.163003] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[46811.163003] Process btrfs-cleaner (pid: 18210, threadinfo
ffff8800057fc000, task ffff88007c030d40)
[46811.163003] Stack:
[46811.163003]  ffff880050c27c38 ffff88001f488438 ffff8800057fde48
ffff88002d15b800
[46811.163003]  ffff88007c030d40 ffff88007c030d40 ffff88007c030d40
0000000000000000
[46811.163003]  ffff8800057fdeb8 ffffffffa0173d7d ffff88002d15b800
0000000000000000
[46811.163003] Call Trace:
[46811.163003]  [<ffffffffa0173d7d>] cleaner_kthread+0x5a/0xe6 [btrfs]
[46811.163003]  [<ffffffffa0173d23>] ? transaction_kthread+0x1a0/0x1a0 [btrfs]
[46811.163003]  [<ffffffff8104c750>] kthread+0xba/0xc2
[46811.163003]  [<ffffffff8104c696>] ? kthread_freezable_should_stop+0x52/0x52
[46811.163003]  [<ffffffff815f301c>] ret_from_fork+0x7c/0xb0
[46811.163003]  [<ffffffff8104c696>] ? kthread_freezable_should_stop+0x52/0x52
[46811.163003] Code: c7 c7 d5 b2 17 a0 31 c0 e8 b4 01 f3 e0 4c 89 ff
e8 bb e4 08 e1 48 89 df e8 f2 f5 03 00 49 8b 87 c8 fb ff ff 48 8b 80
50 01 00 00 <48> 8b 00 4c 01 e8 48 c1 f8 06 48 c1 e0 0c 42 0f b6 44 20
3f 31
[46811.163003] RIP  [<ffffffffa017b30b>]
btrfs_clean_old_snapshots+0x103/0x150 [btrfs]
[46811.163003]  RSP <ffff8800057fde38>
[46811.163003] CR2: 0000000047c5a000
[46811.238512] ---[ end trace 7528086f91b151b8 ]---
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Stefan Behrens Jan. 25, 2013, 3:40 p.m. UTC | #11

On Fri, 25 Jan 2013 08:55:58 -0600, Mitch Harder wrote:
> On Wed, Jan 23, 2013 at 6:52 PM, Liu Bo <bo.li.liu@oracle.com> wrote:
>> On Wed, Jan 23, 2013 at 10:05:04AM -0600, Mitch Harder wrote:
>>> On Wed, Jan 23, 2013 at 1:51 AM, Liu Bo <bo.li.liu@oracle.com> wrote:
>>>> On Tue, Jan 22, 2013 at 11:41:19AM -0600, Mitch Harder wrote:
>>>>> On Thu, Jan 17, 2013 at 8:42 AM, Mitch Harder
>>>>> <mitch.harder@sabayonlinux.org> wrote:
>>>>>> On Wed, Jan 16, 2013 at 6:36 AM, Liu Bo <bo.li.liu@oracle.com> wrote:
>>>>>>> This comes from one of btrfs's project ideas,
>>>>>>> As we defragment files, we break any sharing from other snapshots.
>>>>>>> The balancing code will preserve the sharing, and defrag needs to grow this
>>>>>>> as well.
>>>> [...]
>>>>>>
>>>>>> I've been testing this patch on a 3.7.2 kernel merged with the
>>>>>> for-linus branch for the 3.8_rc kernels, and I'm seeing the following
>>>>>> error:
[...]
> 
> I've changed up my reproducer to try some things that may hit the
> issue quicker and more reliably.
> 
> It gave me a slightly different set of warnings in dmesg, which seem
> to suggest issues in the dead_root list.
[...]
> [46769.785454] ------------[ cut here ]------------
> [46769.785471] WARNING: at lib/list_debug.c:36 __list_add+0x9d/0xba()
> [46769.785474] Hardware name: OptiPlex 745
> [46769.785478] list_add double add: new=ffff880050c27c38,
> prev=ffff880078f3e720, next=ffff880050c27c38.
> [46769.785480] Modules linked in: ipv6 snd_hda_codec_analog
> snd_hda_intel snd_hda_codec snd_hwdep snd_pcm snd_page_alloc snd_timer
> tg3 sr_mod snd i2c_i801 ppdev parport_pc iTCO_wdt iTCO_vendor_support
> lpc_ich pcspkr parport floppy serio_raw microcode ablk_helper cryptd
> lrw xts gf128mul aes_x86_64 sha256_generic fuse xfs nfs lockd sunrpc
> reiserfs btrfs zlib_deflate ext4 jbd2 ext3 jbd ext2 mbcache sl811_hcd
> hid_generic xhci_hcd ohci_hcd uhci_hcd ehci_hcd
> [46769.785537] Pid: 18291, comm: btrfs-endio-wri Not tainted 3.7.4-sad-v1+ #3
> [46769.785539] Call Trace:
> [46769.785549]  [<ffffffff81030586>] warn_slowpath_common+0x83/0x9b
> [46769.785553]  [<ffffffff81030641>] warn_slowpath_fmt+0x46/0x48
> [46769.785558]  [<ffffffff8120987b>] __list_add+0x9d/0xba
> [46769.785586]  [<ffffffffa0179dd6>] btrfs_add_dead_root+0x42/0x56 [btrfs]
> [46769.785603]  [<ffffffffa0187b67>] btrfs_destroy_inode+0x227/0x25b [btrfs]
> [46769.785611]  [<ffffffff8111393a>] destroy_inode+0x3b/0x54
> [46769.785615]  [<ffffffff81113a9c>] evict+0x149/0x151
> [46769.785619]  [<ffffffff81114322>] iput+0x12c/0x135
> [46769.785636]  [<ffffffffa018455f>] relink_extent_backref+0x669/0x6af [btrfs]
> [46769.785642]  [<ffffffff815e9849>] ? __slab_free+0x17c/0x21b
> [46769.785658]  [<ffffffffa0184d15>] ?
> btrfs_finish_ordered_io+0x770/0x827 [btrfs]
> [46769.785674]  [<ffffffffa0184ce5>] btrfs_finish_ordered_io+0x740/0x827 [btrfs]
> [46769.785691]  [<ffffffffa0184de1>] finish_ordered_fn+0x15/0x17 [btrfs]
> [46769.785706]  [<ffffffffa019e5c9>] worker_loop+0x14c/0x493 [btrfs]
> [46769.785722]  [<ffffffffa019e47d>] ? btrfs_queue_worker+0x258/0x258 [btrfs]
> [46769.785728]  [<ffffffff8104c750>] kthread+0xba/0xc2
> [46769.785732]  [<ffffffff8104c696>] ? kthread_freezable_should_stop+0x52/0x52
> [46769.785737]  [<ffffffff815f301c>] ret_from_fork+0x7c/0xb0
> [46769.785741]  [<ffffffff8104c696>] ? kthread_freezable_should_stop+0x52/0x52
> [46769.785745] ---[ end trace 7528086f91b151b5 ]---
> [46799.053062] ------------[ cut here ]------------

Well, the issue that I had reported on IRC some days ago which looks similar (the top part of the call trace is similar: iput -> evict -> destroy_inode -> btrfs_destroy_inode -> btrfs_add_dead_root -> list_add which warns in list_add in your case and crashes in my case) was without Liu Bo's "snapshot-aware defrag" patch. A 3.8.0-rc4 kernel and nothing else.

The reproducer was to create and destroy subvolumes and snapshots. I used btrfs-receive to fill them with data. The crash happened on umount. Every time.

del_fs_roots() is attempting to empty the dead_roots list, and via btrfs_destroy_inode() deeper in the call stack they are added back to the dead_roots list.

BUG: unable to handle kernel paging request at ffff88042503b830
IP: [<ffffffff814532b7>] __list_add+0x17/0xd0
PGD 1e0c063 PUD bf58e067 PMD bf6b7067 PTE 800000042503b160
Oops: 0000 [#1] PREEMPT SMP DEBUG_PAGEALLOC
Modules linked in: btrfs bonding raid1 mpt2sas scsi_transport_sas raid_class
CPU 2
Pid: 10259, comm: umount Not tainted 3.8.0-rc4+ #16 Supermicro X8SIL/X8SIL
RIP: 0010:[<ffffffff814532b7>]  [<ffffffff814532b7>] __list_add+0x17/0xd0
RSP: 0018:ffff8802f67a1bd8  EFLAGS: 00010286
RAX: ffff880425b7c560 RBX: ffff880423ca2828 RCX: 0000000000000001
RDX: ffff88042503b828 RSI: ffff8804257794c0 RDI: ffff880423ca2828
RBP: ffff8802f67a1bf8 R08: 0000000000077850 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000001 R12: ffff880423ca2000
R13: ffff880423ca2898 R14: 0000000000000000 R15: ffff8802f67a1d30
FS:  00007f6e89bba740(0000) GS:ffff88042ea00000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: ffff88042503b830 CR3: 000000029a56c000 CR4: 00000000000007e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process umount (pid: 10259, threadinfo ffff8802f67a0000, task ffff880425b7c560)
Stack:
 ffffffffa00a414f ffff880423ca2000 ffff880423ca2000 ffff880423ca2898
 ffff8802f67a1c18 ffffffffa00a4170 ffff88042a60c1f8 ffff88042a60c1f8
 ffff8802f67a1c48 ffffffffa00b3180 ffff88042a60c1f8 ffff88042a60c280
Call Trace:
 [<ffffffffa00a414f>] ? btrfs_add_dead_root+0x1f/0x60 [btrfs]
 [<ffffffffa00a4170>] btrfs_add_dead_root+0x40/0x60 [btrfs]
 [<ffffffffa00b3180>] btrfs_destroy_inode+0x1d0/0x2d0 [btrfs]
 [<ffffffff811b5d17>] destroy_inode+0x37/0x60
 [<ffffffff811b5e4d>] evict+0x10d/0x1a0
 [<ffffffff811b65f5>] iput+0x105/0x190
 [<ffffffffa009bd68>] free_fs_root+0x18/0x90 [btrfs]
 [<ffffffffa009f1ab>] btrfs_free_fs_root+0x7b/0x90 [btrfs]
 [<ffffffffa009f26f>] del_fs_roots+0xaf/0xf0 [btrfs]
 [<ffffffffa00a0bc6>] close_ctree+0x1c6/0x300 [btrfs]
 [<ffffffff811b6a7c>] ? evict_inodes+0xec/0x100
 [<ffffffffa00763a4>] btrfs_put_super+0x14/0x20 [btrfs]
 [<ffffffff8119dfcc>] generic_shutdown_super+0x5c/0xe0
 [<ffffffff8119e0e1>] kill_anon_super+0x11/0x20
 [<ffffffffa007a3a5>] btrfs_kill_super+0x15/0x90 [btrfs]
 [<ffffffff8119f111>] ? deactivate_super+0x41/0x70
 [<ffffffff8119e4dd>] deactivate_locked_super+0x3d/0x70
 [<ffffffff8119f119>] deactivate_super+0x49/0x70
 [<ffffffff811ba772>] mntput_no_expire+0xd2/0x130
 [<ffffffff811bb621>] sys_umount+0x71/0x390
 [<ffffffff81983012>] system_call_fastpath+0x16/0x1b
Code: 48 83 c4 08 5b 5d c3 66 66 66 66 2e 0f 1f 84 00 00 00 00 00 55 48 89 e5 48 83 ec 20 48 89 5d e8 4c 89 65 f0 48 89 fb 4c 89 6d f8 <4c> 8b 42 08 49 89 f5 49 89 d4 49 39 f0 75 31 4d 8b 45 00 4d 39
RIP  [<ffffffff814532b7>] __list_add+0x17/0xd0
 RSP <ffff8802f67a1bd8>
CR2: ffff88042503b830
---[ end trace 5e44f1afc74751aa ]---

--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[V5] Btrfs: snapshot-aware defrag

Commit Message

Comments

Patch