All the mail mirrored from lore.kernel.org
 help / color / mirror / Atom feed
* 2.6.25-rc8-mm2
@ 2008-04-11  3:33 Andrew Morton
  2008-04-11  6:28 ` 2.6.25-rc8-mm2: boot hang after "ACPI: using IOAPIC for interrupt routing" Alexey Dobriyan
                   ` (11 more replies)
  0 siblings, 12 replies; 71+ messages in thread
From: Andrew Morton @ 2008-04-11  3:33 UTC (permalink / raw
  To: linux-kernel


ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.25-rc8/2.6.25-rc8-mm2/

- Compilation is busted on mips due to the page-flags patches

- Compilation is busted on sparc64 due to the page-flags patches

- Compilation is partially busted on powerpc due to the page-flags patches. 
  It compiles for my g5 but allmodconfig fails.

- git-drm is dropped due to build errors

- git-xfs is dropped due to failure to get a clean git diff

- git-slub has been temporarily replaced by git-pekka.  Pekka is standing in
  while Christoph is away.

- git-kvm is dropped due to clashes with git-s390

- Many patches which weren't in 2.6.25-rc8-mm1's git-x86.patch and
  git-sched.patch have now (belatedly) been introduced.

  x86 works for me, but Rafael is reporting some crashes.



Boilerplate:

- See the `hot-fixes' directory for any important updates to this patchset.

- To fetch an -mm tree using git, use (for example)

  git-fetch git://git.kernel.org/pub/scm/linux/kernel/git/smurf/linux-trees.git tag v2.6.16-rc2-mm1
  git-checkout -b local-v2.6.16-rc2-mm1 v2.6.16-rc2-mm1

- -mm kernel commit activity can be reviewed by subscribing to the
  mm-commits mailing list.

        echo "subscribe mm-commits" | mail majordomo@vger.kernel.org

- If you hit a bug in -mm and it is not obvious which patch caused it, it is
  most valuable if you can perform a bisection search to identify which patch
  introduced the bug.  Instructions for this process are at

        http://www.zip.com.au/~akpm/linux/patches/stuff/bisecting-mm-trees.txt

  But beware that this process takes some time (around ten rebuilds and
  reboots), so consider reporting the bug first and if we cannot immediately
  identify the faulty patch, then perform the bisection search.

- When reporting bugs, please try to Cc: the relevant maintainer and mailing
  list on any email.

- When reporting bugs in this kernel via email, please also rewrite the
  email Subject: in some manner to reflect the nature of the bug.  Some
  developers filter by Subject: when looking for messages to read.

- Occasional snapshots of the -mm lineup are uploaded to
  ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/mm/ and are announced on
  the mm-commits list.  These probably are at least compilable.

- More-than-daily -mm snapshots may be found at
  http://userweb.kernel.org/~akpm/mmotm/.  These are almost certainly not
  compileable.



Changes since 2.6.25-rc8-mm1:

 origin.patch
 git-acpi.patch
 git-x86.patch
 git-kgdb-light.patch
 git-alsa-tiwai.patch
 git-agpgart.patch
 git-arm.patch
 git-avr32.patch
 git-cifs.patch
 git-cpufreq.patch
 git-powerpc.patch
 git-dvb.patch
 git-hwmon.patch
 git-gfs2-nmw.patch
 git-dlm.patch
 git-hid.patch
 git-hrt.patch
 git-ia64.patch
 git-ieee1394.patch
 git-infiniband.patch
 git-input.patch
 git-jfs.patch
 git-leds.patch
 git-libata-all.patch
 git-async-tx.patch
 git-mips.patch
 git-mmc.patch
 git-mtd.patch
 git-ubi.patch
 git-udf.patch
 git-net.patch
 git-nfsd.patch
 git-ocfs2.patch
 git-parisc.patch
 git-selinux.patch
 git-security-testing.patch
 git-s390.patch
 git-sched.patch
 git-sh.patch
 git-scsi-misc.patch
 git-block.patch
 git-sparc64.patch
 git-unionfs.patch
 git-v9fs.patch
 git-vfs.patch
 git-watchdog.patch
 git-cryptodev.patch
 git-xtensa.patch
 git-pekka.patch
 git-semaphore.patch
 git-semaphore-vs-git-x86.patch
 git-semaphore-git-rejects.patch

 git trees

-kvm-provide-kvmh-for-all-architecture-fixes-headers_install.patch
-atmel_serial-avoid-stopping-pdc-during-transmission.patch
-atmel_serial-fix-uart-console-concurrent-access.patch
-alpha-fix-alsa-dma-mmap-crash.patch
-fix-typos-in-documentation-unaligned-memory-accesstxt.patch
-doc-fix-typo-in-submittingpatches.patch
-nbd-prevent-sock_xmit-from-attempting-to-use-a-null-socket.patch
-char-ip2-fix-sparse-warnings.patch
-char-rio-fix-sparse-warnings.patch
-block-fix-the-setting-of-the-bounce-limit.patch
-efs-update-error-msg-to-not-refer-to-deleted-read_inode.patch
-vmcoreinfo-add-the-symbol-phys_base.patch
-markers-use-synchronize_sched.patch
-alpha-get_current-dont-add-zero-to-current_thread_info-task.patch
-lib-swiotlbc-avoid-endless-loops-fix.patch
-git-x86-fix-printk-warning.patch
-x86-remove-redundant-display-of-free-swap-space-in-show_mem.patch
-x86_64-do-not-reserve-ramdisk-two-times.patch
-x86-handle_vm86_trap-cleanup.patch
-i386-arch-x86-math-emu-fpu_entryc-warning-fix.patch
-x86_64-ia32-ptrace-vs-enosys.patch
-x86_64-ptrace-vs-enosys.patch
-x86_64-ia32-ptrace-vs-enosys-sysenter-syscall.patch
-x86_32-fix-visualws-and-voyager-kexec-build-failures.patch
-pm-remove-legacy-pm-fix.patch
-scsi-add-iscsi-ibft-support.patch
-dvb-usb-leadtek-winfast-usb-dtv-dongle-support.patch
-radio-cadet-wrap-pnp-probe-code-in-ifdef-config_pnp.patch
-video-test-cmd-not-definition-in-decoder_command-drivers-media-video-zoran_devicec.patch
-bttv-bt832-fix-possible-null-pointer-deref.patch
-bttv-fix-missed-index-check.patch
-jdelvare-i2c-i2c-bfin-twi-04-driver-description.patch
-ia64-fix-getpid-and-set_tid_address-fast-system-calls-for-pid-namespaces.patch
-kprobes-kprobe-booster-for-ia64.patch
-ia64-make-iommu-respect-the-segment-boundary-limits.patch
-ia64-remove-redundant-display-of-free-swap-space-in-show_mem.patch
-arch-ia64-kernel-use-time_-macros.patch
-xpad-match-xbox-360-devices-with-interface-info.patch
-git-libata-fix-config_pci=n-build.patch
-libata-isolate-and-rework-cable-logic.patch
-ata-swncq-should-be-enabled-by-default.patch
-net-mac80211-debugfs_netdevc-use-of-bool-triggers-a-gcc-bug.patch
-lapb-use-the-shorter-list_head-form-for-brevity.patch
-bluetooth-replace-deprecated-rw_lock_unlocked-macros.patch
-usb-net-asix-does-not-really-need-10-100mbit.patch
-forcedeth-fix-locking-bug-with-netconsole.patch
-tulip-tulip_read_eeprom-fixes-for-bug-4420.patch
-execute-tasklets-in-the-same-order-they-were-queued.patch
-git-sched-schedc-needs-tickh.patch
-scsi-aic94xx-cleanups.patch
-drivers-scsi-hptiopc-fix-build-warning.patch
-scsi-chc-fix-sparse-shadowed-variable-warnings.patch
-sparc32-delete-bogus-js-rtc-records.patch
-gregkh-usb-ub-drop-bug-and-update-small-things.patch
-usb-gadget-dummy_hcdc-fix-nested-switch-statements.patch
-drivers-net-usb-pegasusc-remove-in_atomic-check.patch
-usb-mem-leak-fixes-for-amd-5536-udc-high-full-speed-usb-device-controller-driver.patch
-usb-u132-hcd-driver-style-clean-up.patch
-usb-u132-hcd-driver-semaphore-to-mutex.patch
-usb-ohci-fix-2-timers-to-fire-at-jiffies-1s.patch
-rusty-module-fix_verify_export_symbols.patch
-mac80211-fix-deadlocks-in-debugfs_netdevc.patch
-r8169-fix-irqx-nobody-cared-for-shared-irq-with-intx.patch
-r8169-fix-irqx-nobody-cared-for-shared-irq-with-intx-checkpatch-fixes.patch
-arch-um-os-linux-mainc-fix-memory-leaks.patch
-arch-um-os-linux-mainc-fix-memory-leaks-checkpatch-fixes.patch
-asm-generic-add-node_to_cpumask_ptr-macro.patch
-rcu_batches_completed-prototype-cleanup.patch
-cgroups-add-cgroup-support-for-enabling-controllers-at-boot-time.patch
-cgroups-add-cgroup-support-for-enabling-controllers-at-boot-time-fix-boot-option-parsing.patch
-memory-controller-make-memory-resource-control-aware-of-boot-options.patch
-ipmi-change-device-node-ordering-to-reflect-probe-order.patch
-make-module_sect_attrs-private-to-kernel-modulec-checkpatch-fixes.patch
-clocksource-keep-track-of-original-clocksource-frequency.patch
-clocksource-introduce-clock_monotonic_raw.patch
-clocksource-introduce-clock_monotonic_raw-fix.patch
-clocksource-introduce-clock_monotonic_raw-fix-checkpatch-fixes.patch
-i386-arch-x86-math-emu-reg_ld_strc-fix-warning.patch

 Merged into mainline or a subsystem tree

+cgroups-include-hierarchy-ids-in-proc-pid-cgroup.patch
+rtc-rtc-s35390ac-needs-the-bitreverse-library.patch
+eventfd-kaio-integration-fix.patch
+i2c-fix-platform-driver-hotplug-coldplug.patch
+spi-fix-platform-driver-hotplug-coldplug.patch
+usb-gadget-fix-platform-driver-hotplug-coldplug.patch
+usb-host-fix-platform-driver-hotplug-coldplug.patch
+watchdog-fix-platform-driver-hotplug-coldplug.patch
+rtc-fix-platform-driver-hotplug-coldplug.patch
+cciss-error-implicit-declaration-of-function-sg_init_table.patch
+hfs-fix-unlink-of-links.patch
+md-close-a-livelock-window-in-handle_parity_checks5.patch
+pnp-increase-number-of-devices-supported-per-protocol.patch
+signalfd-fix-for-incorrect-si_queue-user-data-reporting.patch
+lzo-fix-typo-in-decompresor.patch
+generic_file_splice_read-fix-lockups.patch

 2.6.25 queue (some already merged)

+acpi-unneccessary-to-scan-the-pci-bus-already-scanned.patch
+revert-x86-fix-call-to-set_cyc2ns_scale-from-time_cpufreq_notifier.patch

 git-acpi.patch

+git-acpi-revert-suspend-wakeup-code-in-c.patch

 Revert this so that git-x86 applies

+acpi-fix-fadt-parsing.patch
+acpi-adjust-register-handling.patch
+acpi-adjust-_acpi_modulefunction_name-definitions.patch
+acpi-use-__init-on-everything-in-tables-tbfadtc.patch

 ACPI stuff

+revert-include-asm-x86-i387h-checkpatch-cleanups-formatting-only.patch
+git-x86-revert-x86-fpu-lazy-allocation-of-fpu-area-v5.patch
+git-x86-revert-x86-fpu-split-fpu-state-from-task-struct-v5.patch
+arch-x86-mm-ioremapc-fix-printk-warnings.patch
+git-x86-fix-warning-in-arch-x86-kernel-vsmp_64c.patch

 Fix git-x86

+x86_64-do-not-reserve-ramdisk-two-times.patch

 x86 fix
 
+git-x86-hack-around-to-make-git-kgdb-build.patch
+git-x86-vs-git-kgdb-lite.patch

 Fix git-kgdb-light.patch

+git-alsa-tiwai-hda_codec-fix-locking.patch

 Fix git-alsa-tiwai.patch

+es1968-fix-jitter-on-some-maestro-cards.patch
+es1968-fix-jitter-on-some-maestro-cards-checkpatch-fixes.patch
+sound-pci-rme9652-hdspmc-stop-inlining-largish-static-functions.patch

 alsa things

+gregkh-driver-kobject-replace-list_for_each-with-list_for_each_entry.patch
+gregkh-driver-pm-remove-legacy-pm.patch
+gregkh-driver-firmware-add-iscsi-ibft-support.patch
-gregkh-driver-driver-core-remove-no-longer-used-struct-class_device.patch

 driver tree updates

+sysfs-refill-attribute-buffer-when-reading-from-offset-0.patch
+pm-introduce-new-top-level-suspend-and-hibernation-callbacks-rev-7.patch
+pm-introduce-new-top-level-suspend-and-hibernation-callbacks-rev-7-fix.patch
+pm-new-suspend-and-hibernation-callbacks-for-platform-bus-type-rev-3.patch
+pm-new-suspend-and-hibernation-callbacks-for-pci-bus-type-rev-3.patch
+pm-new-suspend-and-hibernation-callbacks-for-pci-bus-type-rev-3-fix.patch

 driver/PM updates

+v4l-common-replace-remaining-__function__-occurences.patch
+v4l-dvb-replace-remaining-__function__-occurences.patch
+v4l-video-replace-remaining-__function__-occurences.patch

 v4l things

+jdelvare-i2c-i2c-superh-mobile-i2c-bus-controller-v5.patch

 I2C tree update

+git-dlm-vs-git-gfs2-nwm.patch

 Fix git-dlm.patch

+time-close-small-window-for-vsyscall-time-inconsistencies.patch
+time-close-small-window-for-vsyscall-time-inconsistencies-checkpatch-fixes.patch

 stuff for git-hrt

+gregkh-driver-ib-convert-struct-class_device-to-struct-device-vs-git-infiniband.patch

 Fix infiniband tree versus the driver tree

-git-kvm-fixup.patch
-git-kvm-vs-git-s390.patch

 git-kvm.patch is dropped

+git-libata-all-unbork-drivers-ata-sata_sx4c.patch

 Fix git-libata-all.

-pata_amd-fix-sparse-warning.patch

 Folded into pata_atiixp-simplex-clear.patch

+pata_atiixp-simplex-clear-fix.patch

 Unbreak pata_atiixp-simplex-clear.patch

+ide-mm-ide-make-32-bit-io-support-for-ide-vlb-available-only-when-it-makes-sense.patch
+ide-mm-ide-add-struct-ide_io_ports.patch
+ide-mm-ide-tape-improve-buffer-allocation-strategy.patch
+ide-mm-ide-tape-mv-tape-stage_size-tape-buffer_size.patch
+ide-mm-ide-tape-mv-tape-pages_per_stage-tape-pages_per_buffer.patch
+ide-mm-ide-tape-improve-buffer-pages-freeing-strategy.patch
+ide-mm-ide-remove-_proc_ide_ali.patch
+ide-mm-ide-remove-now-unused-ide_pci_create_host_proc.patch
+ide-mm-ide-tape-make-__idetape_discard_read_pipeline-of-type-void.patch
+ide-mm-ide-tape-mv-idetape_discard_read_pipeline-ide_tape_discard_merge_buffer.patch
+ide-mm-ide-tape-mv-idetape_empty_write_pipeline-ide_tape_flush_merge_buffer.patch
+ide-mm-ide-tape-mv-tape-merge_stage_size-tape-merge_bh_size.patch
+ide-mm-ide-tape-remove-tape-merge_stage.patch
+ide-mm-ide-tape-fix-mem-leak.patch
+ide-mm-falconide-q40ide-add-atapi_-put_bytes-and-ata_-put_data-methods-take-2.patch
+ide-mm-ide-merge-atapi_-put_bytes-and-ata_-put_data-methods.patch
+ide-mm-scc_pata-add-in-out-put_data-methods.patch
+ide-mm-au1xxx-ide-add-in-out-put_data-methods.patch
+ide-mm-ide-h8300-add-in-out-put_data-methods.patch
+ide-mm-ide-add-ide_hflag_mmio-host-flag-take-2.patch
+ide-mm-ide-remove-inswl-and-outswl-methods.patch
+ide-mm-siimage-do-clocking-register-posting-earlier-in-setup_mmio_siimage.patch
+ide-mm-siimage-add-sil_-io-ops.patch
+ide-mm-siimage-remove-proc_reports_siimage.patch

 IDE tree updates

+jffs2-summary-allocation-dont-use-vmalloc.patch

 jffs2 fix (not mergeable)

+sc92031-use-net_device-stats.patch
+ehea-fix-dlpar-memory-add-support.patch
+ehea-fix-dlpar-memory-add-support-fix.patch
+sundance-set-carrier-status-on-link-change-events.patch
+sundance-set-carrier-status-on-link-change-events-checkpatch-fixes.patch
+forcedeth-new-backoff-implementation.patch
+forcedeth-new-backoff-implementation-update.patch

 netdev stuff

+gregkh-pci-pci-update-via-cx700-quirk.patch
+gregkh-pci-pci-aerdrv_acpic-remove-unneeded-null-check.patch
+gregkh-pci-pci-clean-up-resource-alignment-management.patch

 PCI tree updates

-git-s390-git-rejects.patch

 Unneeded

+sh-arch-sh-kernel-traps_32c-needs-asm-fpuh.patch
+sh-export-empty_zero_page.patch

 sh fixes

+cfq-iosched-do-not-leak-ioc_data-across-iosched-switches.patch

 CFQ fix

+sparc32-export-empty_zero_page.patch

 sparc32 fix

+gregkh-usb-usb-add-usb_dt_cs_radio_control-define-to-ch9h.patch
+gregkh-usb-usb-serial-sierra-clean-up.patch
+gregkh-usb-usb-serial-sierra-c597-fix.patch
+gregkh-usb-u132-hcd-fix-checkpatch-style-issues.patch
+gregkh-usb-u132-hdc-semaphore-to-mutex.patch
+gregkh-usb-usb-fix-comments-of-2-functions-in-hcdc.patch
+gregkh-usb-usb-serial-remove-obsolete-contact-addresses.patch
+gregkh-usb-usb-test-for-null-return-from-platform_get_resource-in-ohci_hcd_sm501_drv_remove.patch
+gregkh-usb-usb-g_printer-bugfixes.patch
+gregkh-usb-usb-usb-storage-use-adaptive-dma-mask.patch
+gregkh-usb-usb-mem-leak-fixes-for-amd-5536-udc-high-full-speed-usb-device-controller-driver.patch
+gregkh-usb-usb-gadget-dummy_hcdc-fix-nested-switch-statements.patch
+gregkh-usb-wusb-add-authenticated-bit-to-usb-dev.patch
+gregkh-usb-usb-add-usb_dev_reset_delayed.patch
+gregkh-usb-wusb-add-link-wusb-usb-dev.patch
+gregkh-usb-wusb-teach-choose-address-about-wireless-devices.patch
+gregkh-usb-wusb-devices-dont-use-a-set-address.patch
+gregkh-usb-wusb-make-ep0-reinit-available.patch
+gregkh-usb-wusb-update-devnum.patch
+gregkh-usb-wusb-documentation.patch
+gregkh-usb-bitmap-add-bitmap_copy_le.patch
+gregkh-usb-wusb-add-usb-specific-wusb-include-files.patch
+gregkh-usb-wusb-add-the-uwb-include-files.patch

 USB tree updates

+usb-asix-add-buffalo-lua-u2-gt-10-100-1000.patch

 USB device support

+mm-only-b43-rewrite-lo-calibration.patch

 b43 testing patch

-git-cryptodev-fixup.patch

 Unneeded

+git-cryptodev-misplaced-hunk.patch
+git-cryptodev-fix-conflict.patch
+git-cryptodev-fix-conflict-hack.patch

 git-cryptodev repairs

+rusty-virtio-broken-virtqueues.patch
+rusty-module-find_symbol-cleanup.patch
+rusty-module-use-unused_gpl_crcs.patch
+rusty-module-more-checking-in-verify_export_symbols.patch
+rusty-module-markers_support_for_proprierary_s.patch
+rusty-lguest-avoid-nr_cpus-for-number-of-cpus.patch
+rusty-lguest-remove-bogus-null-cpu-check.patch
+rusty-virtio-fix_sparse_return_void_valued_expression_warnings.patch

 Rustytree updates

+make-module_sect_attrs-private-to-kernel-modulec-checkpatch-fixes.patch

 Some stranded patch

-git-slub.patch

 Dropped

-git-slub-git-rejects.patch

 Unneeded

+git-pekka.patch

 New slub tree

+git-semaphore-vs-git-x86.patch

 Repair git-semaphore.patch

+ext4-is-busted-on-m68k.patch

 ext4 doesn't build on m68k

+acpi-bus-check-once-more-for-an-empty-list-after-locking-it.patch
+acpi-thermal-trip-points-increased-to-12.patch
+sc92031-sysfs-link-missing.patch
+usb-fix-interface-deregistration-during-unbind-operations.patch

 Fixes for 2.6.25 (I think)

-mm-make-mem_map-allocation-continuous.patch
-mm-make-mem_map-allocation-continuous-checkpatch-fixes.patch
+mm-make-mem_map-allocation-continuous-v2.patch

 Replaced p[atch

-mm-allocate-section_map-for-sparse_init.patch
-mm-allocate-section_map-for-sparse_init-update.patch
-mm-allocate-section_map-for-sparse_init-update-fix.patch

 These got moved into mm-make-mem_map-allocation-continuous-v2.patch

+mm-get-rid-of-__zone_count.patch
+pageflags_extended-and-separate-page-flags-for-head-and-tail.patch

 Update page-flags patches in -mm.

-s390-remove-struct-page-entries-for-dcss-memory-segments.patch

 Accidentally dropped.  Will come back.

+mempolicy-rename-mpol_free-to-mpol_put.patch
+mempolicy-rename-mpol_copy-to-mpol_dup.patch
+mempolicy-write-lock-mmap_sem-while-changing-task-mempolicy.patch
+mempolicy-fixup-fallback-for-default-shmem-policy.patch
+mempolicy-rename-struct-mempolicy-policy-member-to-mode.patch
+mempolicy-mark-shared-policies-for-unref.patch
+mempolicy-document-setget_policy-vm_ops-apis.patch
+mempolicy-rework-mempolicy-reference-counting.patch
+mempolicy-use-mpol_preferred-for-system-wide-default-policy.patch
+mempolicy-mpol_preferred-cleanups-for-local-allocation.patch
+mempolicy-use-mpol_f_local-to-indicate-preferred-local-policy.patch
+mempolicy-clean-up-mpol-to-str-mempolicy-formatting.patch
+mempolicy-rework-shmem-mpol-parsing-and-display.patch
+mempolicy-support-mpol=local-tmpfs-mount-option.patch
+mempolicy-use-struct-mempolicy-pointer-in-shmem_sb_info.patch
+mempolicy-use-struct-mempolicy-pointer-in-shmem_sb_info-fix.patch
+mempolicy-use-struct-mempolicy-pointer-in-shmem_sb_info-fix-fix.patch
+mempolicy-use-struct-mempolicy-pointer-in-shmem_sb_info-fix-fix-fix.patch

 mempolicy updates

+hugetlbfs-architecture-header-cleanup.patch
+hugetlbfs-add-missing-tlb-flush-to-hugetlb_cow.patch
+hugetlbfs-common-code-update-for-s390.patch

 hugetlbfs updates

+memory-hotplug-register-section-node-id-to-free.patch
+memory-hotplug-align-memmap-to-page-size.patch
+memory-hotplug-make-alloc_bootmem_section.patch
+memory-hotplugallocate-usemap-on-the-section-with-pgdat.patch
+memory-hotplug-free-memmaps-allocated-by-bootmem.patch

 memory hotplug updates

+oom_kill-remove-unused-parameter-in-badness.patch

 cleanup

+frv-handle-update_mmu_cache-being-called-when-current-mm-is-null.patch
+frv-move-stack_top_max-up.patch
+frv-add-support-for-emulation-of-userspace-atomic-ops.patch
+frv-make-nommu-mode-work-with-base-addresses-other-than-0xc0000000.patch
+frv-dont-make-smp_r-w-mb-interpolate-membar-when-config_smp=n.patch

 FRV updates

+gen_initramfs_listsh-fix-unportability.patch
+update-checkpatchpl-to-version-017.patch
+swiotlb-use-iommu_is_span_boundary-helper-function.patch

 Misc

+spi-spi_imx-updates.patch

 SPI updates

+ecryptfs-add-missing-lock-around-notify_change.patch

 ecryptfs updates

+gpiochip_reserve-fix-2.patch

 Fix gpiochip_reserve.patch some more

+gxfb-add-power-management-functionality-update.patch

 Update gxfb-add-power-management-functionality-fix.patch

+pxafb-introduce-lcd_readwritel-to-wrap-the-__raw_readwritel.patch
+pxafb-use-completion-for-lcd-disable-wait-code.patch
+pxafb-move-parallel-lcd-timing-setup-into-dedicate-function.patch
+pxafb-preliminary-smart-panel-interface-support.patch
+pxafb-preliminary-smart-panel-interface-support-update.patch
+lxfb-extend-pll-table-to-support-dotclocks-below-25-mhz.patch
+lxfb-add-power-management-functionality-update.patch
+lxfb-gxfb-when-blanking-with-fb_blank_powerdown-also-turn-off-the-crt.patch
+fbdev-metronomefb-bugfix-v4.patch
+fbdev-hecubafb-bugfix-v4.patch
+fbdev-platforming-metronomefb-and-am200epd-v4.patch
+fbdev-platforming-hecubafb-and-n411-v4.patch

 fbdev updates

+ext3-fix-hang-on-umount-with-quotas-when-journal-is-aborted.patch

 ext3 fix

+reiserfs-fix-hang-on-umount-with-quotas-when-journal-is-aborted.patch

 reiserfs fix

+docbook-some-kernel-locking-fixes.patch
+kernel-doc-detect-prevent-duplicate-doc-section-names.patch
+kernel-doc-detect-trailing-kernel-doc-line-trash.patch
+documentation-build-source-files-in-documentation-sub-dir.patch
+documentation-build-source-files-in-documentation-sub-dir-update.patch
+documentation-build-source-files-in-documentation-sub-dir-disable.patch

 Documentation updates

+cgroups-use-a-hash-table-for-css_set-finding.patch
+cgroups-simplify-init_subsys.patch
+cgroups-remove-the-css_set-linked-list.patch
+cgroups-introduce-cft-read_seq-v2.patch

 cgroups udpates

+memcg-remove-redundant-function-calls.patch
+memcgroup-make-the-memory-controller-more-desktop-responsive.patch

 memory controller updates

+signals-add-set_restore_sigmask.patch
+signals-set_restore_sigmask-tif_sigpending.patch
+signals-s390-renumber-tif_restore_sigmask.patch
+signals-ia64-renumber-tif_restore_sigmask.patch
+signals-use-have_set_restore_sigmask.patch
+signals-x86-ts_restore_sigmask.patch

 More signals work.

-ext4-is-broken.patch

 Dropped

+ext4-fix-hang-on-umount-with-quotas-when-journal-is-aborted.patch

 ext4 fix

+doc-fix-dma-api-function-parameters.patch

 DMA API update

+tty-serial-lay-the-foundations-for-the-next-set-of-reworks.patch
+usb-serial-more-fixes-and-groundwork-for-tty-changes.patch

 TTY updates

+proc-switch-proc-driver-ray_cs-ray_cs-to-seq_file-interface.patch
+proc-switch-proc-driver-radio-typhoon-to-seq_file-interface.patch
+proc-switch-proc-bus-zorro-devices-to-seq_file-interface.patch
+proc-switch-proc-apm-to-seq_file-interface.patch
+proc-remove-proc-mac_iop.patch
+proc-switch-proc-bus-ecard-devices-to-seq_file-interface.patch
+proc-switch-proc-excite-unit_id-to-seq_file-interface.patch
+proc-switch-proc-irda-irnet-to-seq_file-interface.patch
+proc-convert-proc-bus-nubus-to-seq_file-interface.patch
+proc-switch-proc-ip2mem-to-seq_file-interface.patch
+proc-switch-proc-scsi-device_info-to-seq_file-interface.patch
+proc-remove-get_info-infrastructure.patch

 procfs work

+nbd-delete-superfluous-test-for-__gnuc__.patch

 nbd cleanup

+printk-dont-read-beyond-string-arguments-terminating-zero.patch

 printk fixlet

+edac-remove-unneeded-functions-and-add-static-accessor.patch
+edac-fix-module-initialization-on-several-modules-2nd-time.patch

 EDAC updates

-linuxpps-core-support.patch
-pps-userland-header-file-for-pps-api.patch
-pps-documentation-programs-and-examples.patch
-pps-linuxpps-clients-support.patch
-pps-serial-clients-support.patch
-pps-example-program-to-enable-pps-support-on-serial-ports.patch
-pps-parallel-port-clients-support.patch

 Dropped, keeps being updated

+afs-the-afs-rpc-op-cbgetcapabilities-is-actually-cbtellmeaboutyourself.patch
+afs-the-afs-rpc-op-cbgetcapabilities-is-actually-cbtellmeaboutyourself-try-3.patch

 AFS updates

+modify-ramdisk-brd-device-to-be-able-to-manage-partitions.patch

 ramdisk fix

+sysv-fs-remove-superfluous-check-for-__gnuc__-compiler.patch

 sysvfs cleanup

+remove-__kernel__-tests-of-unexported-headers-under-asm-generic.patch
+drop-the-exporting-of-empty-linux-byteorder-generich.patch

 cleanups

+notify_change-callers-must-hold-i_mutex.patch

 debug check

+likely_prof-update-to-test_and_set_bit_lock-clear_bit_unlock.patch

 Update -mm's likeliness profiling patch.



5191 commits in 1682 patch files

All patches:

ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.25-rc8/2.6.25-rc8-mm2/patch-list



^ permalink raw reply	[flat|nested] 71+ messages in thread

* 2.6.25-rc8-mm2: boot hang after "ACPI: using IOAPIC for interrupt routing"
  2008-04-11  3:33 2.6.25-rc8-mm2 Andrew Morton
@ 2008-04-11  6:28 ` Alexey Dobriyan
  2008-04-11  6:29 ` Alexey Dobriyan
                   ` (10 subsequent siblings)
  11 siblings, 0 replies; 71+ messages in thread
From: Alexey Dobriyan @ 2008-04-11  6:28 UTC (permalink / raw
  To: Andrew Morton; +Cc: linux-kernel

On Thu, Apr 10, 2008 at 08:33:54PM -0700, Andrew Morton wrote:
> - git-slub has been temporarily replaced by git-pekka.  Pekka is standing in
>   while Christoph is away.

I bisected boot hang after "ACPI: using IOAPIC for interrupt routing"
down to git-pekka.

normal dmesg from 2.6.25-rc8-something and .config snippets for -mm

# CONFIG_SLUB_DEBUG is not set		(hey, I thought this was on!)
CONFIG_SLUB=y
# CONFIG_SLUB_STATS is not set

# CONFIG_DEBUG_DRIVER is not set
# CONFIG_DEBUG_DEVRES is not set
# CONFIG_DEBUG_FS is not set
CONFIG_DEBUG_KERNEL=y
CONFIG_DEBUG_SHIRQ=y
CONFIG_DEBUG_RT_MUTEXES=y
CONFIG_DEBUG_PI_LIST=y
CONFIG_DEBUG_SPINLOCK=y
CONFIG_DEBUG_MUTEXES=y
CONFIG_DEBUG_LOCK_ALLOC=y
# CONFIG_DEBUG_LOCKDEP is not set
CONFIG_DEBUG_SPINLOCK_SLEEP=y
# CONFIG_DEBUG_LOCKING_API_SELFTESTS is not set
# CONFIG_DEBUG_KOBJECT is not set
CONFIG_DEBUG_BUGVERBOSE=y
# CONFIG_DEBUG_INFO is not set
CONFIG_DEBUG_VM=y
CONFIG_DEBUG_WRITECOUNT=y
CONFIG_DEBUG_LIST=y
CONFIG_DEBUG_SG=y
# CONFIG_DEBUG_STACKOVERFLOW is not set
# CONFIG_DEBUG_STACK_USAGE is not set
CONFIG_DEBUG_PAGEALLOC=y
CONFIG_DEBUG_PER_CPU_MAPS=y
CONFIG_DEBUG_RODATA=y
# CONFIG_DEBUG_RODATA_TEST is not set
# CONFIG_DEBUG_NX_TEST is not set

P.S.: I now suspect that one bisection point was wrong, however,
      make-module_sect_attrs-private-to-kernel-modulec-checkpatch-fixes
      was definitely good.


[    0.000000] Linux version 2.6.25-rc8 (ad@martell) (gcc version 4.1.2 (Gentoo 4.1.2 p1.0.2)) #2 SMP Fri Apr 11 00:54:03 MSD 2008
[    0.000000] Command line: root=/dev/sda2 netconsole=@192.168.0.1/eth0,9353@192.168.0.42/00:1b:38:af:22:49 ignore_loglevel
[    0.000000] BIOS-provided physical RAM map:
[    0.000000]  BIOS-e820: 0000000000000000 - 000000000009fc00 (usable)
[    0.000000]  BIOS-e820: 000000000009fc00 - 00000000000a0000 (reserved)
[    0.000000]  BIOS-e820: 00000000000e4000 - 0000000000100000 (reserved)
[    0.000000]  BIOS-e820: 0000000000100000 - 000000007ff90000 (usable)
[    0.000000]  BIOS-e820: 000000007ff90000 - 000000007ff9e000 (ACPI data)
[    0.000000]  BIOS-e820: 000000007ff9e000 - 000000007ffe0000 (ACPI NVS)
[    0.000000]  BIOS-e820: 000000007ffe0000 - 0000000080000000 (reserved)
[    0.000000]  BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved)
[    0.000000]  BIOS-e820: 00000000ffb00000 - 0000000100000000 (reserved)
[    0.000000]  BIOS-e820: 0000000100000000 - 0000000180000000 (usable)
[    0.000000] debug: ignoring loglevel setting.
[    0.000000] Entering add_active_range(0, 0, 159) 0 entries of 256 used
[    0.000000] Entering add_active_range(0, 256, 524176) 1 entries of 256 used
[    0.000000] Entering add_active_range(0, 1048576, 1572864) 2 entries of 256 used
[    0.000000] end_pfn_map = 1572864
[    0.000000] DMI 2.4 present.
[    0.000000] ACPI: RSDP 000FA980, 0024 (r2 ACPIAM)
[    0.000000] ACPI: XSDT 7FF90100, 0054 (r1 KOZIRO FRONTIER  2000707 MSFT       97)
[    0.000000] ACPI: FACP 7FF90290, 00F4 (r3 MSTEST OEMFACP   2000707 MSFT       97)
[    0.000000] ACPI: DSDT 7FF905C0, 8FA9 (r1  A0637 A0637000        0 INTL 20060113)
[    0.000000] ACPI: FACS 7FF9E000, 0040
[    0.000000] ACPI: APIC 7FF90390, 006C (r1 MSTEST OEMAPIC   2000707 MSFT       97)
[    0.000000] ACPI: MCFG 7FF90400, 003C (r1 MSTEST OEMMCFG   2000707 MSFT       97)
[    0.000000] ACPI: SLIC 7FF90440, 0176 (r1 KOZIRO FRONTIER  2000707 MSFT       97)
[    0.000000] ACPI: OEMB 7FF9E040, 007B (r1 MSTEST AMI_OEM   2000707 MSFT       97)
[    0.000000] ACPI: HPET 7FF99570, 0038 (r1 MSTEST OEMHPET   2000707 MSFT       97)
[    0.000000] Entering add_active_range(0, 0, 159) 0 entries of 256 used
[    0.000000] Entering add_active_range(0, 256, 524176) 1 entries of 256 used
[    0.000000] Entering add_active_range(0, 1048576, 1572864) 2 entries of 256 used
[    0.000000] early res: 0 [0-fff] BIOS data page
[    0.000000] early res: 1 [6000-7fff] SMP_TRAMPOLINE
[    0.000000] early res: 2 [200000-5eead3] TEXT DATA BSS
[    0.000000] early res: 3 [9fc00-a0bff] EBDA
[    0.000000] early res: 4 [8000-dfff] PGTABLE
[    0.000000]  [ffffe20000000000-ffffe200001fffff] PMD ->ffff810001200000 on node 0
[    0.000000]  [ffffe20000200000-ffffe200003fffff] PMD ->ffff810001600000 on node 0
[    0.000000]  [ffffe20000400000-ffffe200005fffff] PMD ->ffff810001a00000 on node 0
[    0.000000]  [ffffe20000600000-ffffe200007fffff] PMD ->ffff810001e00000 on node 0
[    0.000000]  [ffffe20000800000-ffffe200009fffff] PMD ->ffff810002200000 on node 0
[    0.000000]  [ffffe20000a00000-ffffe20000bfffff] PMD ->ffff810002600000 on node 0
[    0.000000]  [ffffe20000c00000-ffffe20000dfffff] PMD ->ffff810002a00000 on node 0
[    0.000000]  [ffffe20000e00000-ffffe20000ffffff] PMD ->ffff810002e00000 on node 0
[    0.000000]  [ffffe20001000000-ffffe200011fffff] PMD ->ffff810003200000 on node 0
[    0.000000]  [ffffe20001200000-ffffe200013fffff] PMD ->ffff810003600000 on node 0
[    0.000000]  [ffffe20001400000-ffffe200015fffff] PMD ->ffff810003a00000 on node 0
[    0.000000]  [ffffe20001600000-ffffe200017fffff] PMD ->ffff810003e00000 on node 0
[    0.000000]  [ffffe20001800000-ffffe200019fffff] PMD ->ffff810004200000 on node 0
[    0.000000]  [ffffe20001a00000-ffffe20001bfffff] PMD ->ffff810004600000 on node 0
[    0.000000]  [ffffe20001c00000-ffffe20001dfffff] PMD ->ffff810004a00000 on node 0
[    0.000000]  [ffffe20001e00000-ffffe20001ffffff] PMD ->ffff810004e00000 on node 0
[    0.000000]  [ffffe20002000000-ffffe200021fffff] PMD ->ffff810005200000 on node 0
[    0.000000]  [ffffe20002200000-ffffe200023fffff] PMD ->ffff810005600000 on node 0
[    0.000000]  [ffffe20002400000-ffffe200025fffff] PMD ->ffff810005a00000 on node 0
[    0.000000]  [ffffe20002600000-ffffe200027fffff] PMD ->ffff810005e00000 on node 0
[    0.000000]  [ffffe20002800000-ffffe200029fffff] PMD ->ffff810006200000 on node 0
[    0.000000]  [ffffe20002a00000-ffffe20002bfffff] PMD ->ffff810006600000 on node 0
[    0.000000]  [ffffe20002c00000-ffffe20002dfffff] PMD ->ffff810006a00000 on node 0
[    0.000000]  [ffffe20002e00000-ffffe20002ffffff] PMD ->ffff810006e00000 on node 0
[    0.000000]  [ffffe20003000000-ffffe200031fffff] PMD ->ffff810007200000 on node 0
[    0.000000]  [ffffe20003200000-ffffe200033fffff] PMD ->ffff810007600000 on node 0
[    0.000000]  [ffffe20003400000-ffffe200035fffff] PMD ->ffff810007a00000 on node 0
[    0.000000]  [ffffe20003600000-ffffe200037fffff] PMD ->ffff810007e00000 on node 0
[    0.000000]  [ffffe20003800000-ffffe200039fffff] PMD ->ffff810008200000 on node 0
[    0.000000]  [ffffe20003a00000-ffffe20003bfffff] PMD ->ffff810008600000 on node 0
[    0.000000]  [ffffe20003c00000-ffffe20003dfffff] PMD ->ffff810008a00000 on node 0
[    0.000000]  [ffffe20003e00000-ffffe20003ffffff] PMD ->ffff810008e00000 on node 0
[    0.000000]  [ffffe20004000000-ffffe200041fffff] PMD ->ffff810009200000 on node 0
[    0.000000]  [ffffe20004200000-ffffe200043fffff] PMD ->ffff810009600000 on node 0
[    0.000000]  [ffffe20004400000-ffffe200045fffff] PMD ->ffff810009a00000 on node 0
[    0.000000]  [ffffe20004600000-ffffe200047fffff] PMD ->ffff810009e00000 on node 0
[    0.000000]  [ffffe20004800000-ffffe200049fffff] PMD ->ffff81000a200000 on node 0
[    0.000000]  [ffffe20004a00000-ffffe20004bfffff] PMD ->ffff81000a600000 on node 0
[    0.000000]  [ffffe20004c00000-ffffe20004dfffff] PMD ->ffff81000aa00000 on node 0
[    0.000000]  [ffffe20004e00000-ffffe20004ffffff] PMD ->ffff81000ae00000 on node 0
[    0.000000]  [ffffe20005000000-ffffe200051fffff] PMD ->ffff81000b200000 on node 0
[    0.000000]  [ffffe20005200000-ffffe200053fffff] PMD ->ffff81000b600000 on node 0
[    0.000000] Zone PFN ranges:
[    0.000000]   DMA             0 ->     4096
[    0.000000]   DMA32        4096 ->  1048576
[    0.000000]   Normal    1048576 ->  1572864
[    0.000000] Movable zone start PFN for each node
[    0.000000] early_node_map[3] active PFN ranges
[    0.000000]     0:        0 ->      159
[    0.000000]     0:      256 ->   524176
[    0.000000]     0:  1048576 ->  1572864
[    0.000000] On node 0 totalpages: 1048367
[    0.000000]   DMA zone: 56 pages used for memmap
[    0.000000]   DMA zone: 1018 pages reserved
[    0.000000]   DMA zone: 2925 pages, LIFO batch:0
[    0.000000]   DMA32 zone: 14280 pages used for memmap
[    0.000000]   DMA32 zone: 505800 pages, LIFO batch:31
[    0.000000]   Normal zone: 7168 pages used for memmap
[    0.000000]   Normal zone: 517120 pages, LIFO batch:31
[    0.000000]   Movable zone: 0 pages used for memmap
[    0.000000] ACPI: Local APIC address 0xfee00000
[    0.000000] ACPI: LAPIC (acpi_id[0x01] lapic_id[0x00] enabled)
[    0.000000] Processor #0 (Bootup-CPU)
[    0.000000] ACPI: LAPIC (acpi_id[0x02] lapic_id[0x01] enabled)
[    0.000000] Processor #1
[    0.000000] ACPI: LAPIC (acpi_id[0x03] lapic_id[0x82] disabled)
[    0.000000] ACPI: LAPIC (acpi_id[0x04] lapic_id[0x83] disabled)
[    0.000000] ACPI: IOAPIC (id[0x02] address[0xfec00000] gsi_base[0])
[    0.000000] IOAPIC[0]: apic_id 2, address 0xfec00000, GSI 0-23
[    0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
[    0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level)
[    0.000000] ACPI: IRQ0 used by override.
[    0.000000] ACPI: IRQ2 used by override.
[    0.000000] ACPI: IRQ9 used by override.
[    0.000000] Setting APIC routing to flat
[    0.000000] ACPI: HPET id: 0x8086a202 base: 0xfed00000
[    0.000000] Using ACPI (MADT) for SMP configuration information
[    0.000000] Allocating PCI resources starting at 88000000 (gap: 80000000:7ee00000)
[    0.000000] PERCPU: Allocating 32200 bytes of per cpu data
[    0.000000] Built 1 zonelists in Zone order, mobility grouping on.  Total pages: 1025845
[    0.000000] Kernel command line: root=/dev/sda2 netconsole=@192.168.0.1/eth0,9353@192.168.0.42/00:1b:38:af:22:49 ignore_loglevel
[    0.000000] Initializing CPU#0
[    0.000000] PID hash table entries: 4096 (order: 12, 32768 bytes)
[    0.000000] Extended CMOS year: 2000
[    0.000000] TSC calibrated against PIT
[    0.000000] time.c: Detected 2135.096 MHz processor.
[    0.000000] Console: colour VGA+ 80x25
[    0.000000] console [tty0] enabled
[    0.000000] Dentry cache hash table entries: 524288 (order: 10, 4194304 bytes)
[    0.000000] Inode-cache hash table entries: 262144 (order: 9, 2097152 bytes)
[    0.000000] Checking aperture...
[    0.000000] PCI-DMA: Using software bounce buffering for IO (SWIOTLB)
[    0.000000] Placing software IO TLB between 0xbe27000 - 0xfe27000
[    0.000000] Memory: 4030128k/6291456k available (2283k kernel code, 162620k reserved, 1113k data, 228k init)
[    0.000000] CPA: page pool initialized 1 of 1 pages preallocated
[    0.000000] SLUB: Genslabs=12, HWalign=64, Order=0-1, MinObjects=4, CPUs=2, Nodes=1
[    0.000000] hpet clockevent registered
[    0.150000] Calibrating delay using timer specific routine.. 4272.54 BogoMIPS (lpj=21362745)
[    0.150000] Mount-cache hash table entries: 256
[    0.150000] CPU: L1 I cache: 32K, L1 D cache: 32K
[    0.150000] CPU: L2 cache: 2048K
[    0.150000] CPU: Physical Processor ID: 0
[    0.150000] CPU: Processor Core ID: 0
[    0.150000] CPU0: Thermal monitoring enabled (TM2)
[    0.150000] using mwait in idle threads.
[    0.150000] Freeing SMP alternatives: 21k freed
[    0.150000] ACPI: Core revision 20070126
[    0.270000] Using local APIC timer interrupts.
[    0.290000] APIC timer calibration result 16680437
[    0.290000] Detected 16.680 MHz APIC timer.
[    0.290000] Booting processor 1/2 APIC 0x1
[    0.000000] Initializing CPU#1
[    0.000000] Calibrating delay using timer specific routine.. 4270.25 BogoMIPS (lpj=21351254)
[    0.000000] CPU: L1 I cache: 32K, L1 D cache: 32K
[    0.000000] CPU: L2 cache: 2048K
[    0.000000] CPU: Physical Processor ID: 0
[    0.000000] CPU: Processor Core ID: 1
[    0.000000] CPU1: Thermal monitoring enabled (TM2)
[    0.000000] Intel(R) Core(TM)2 CPU          6400  @ 2.13GHz stepping 02
[    0.450000] checking TSC synchronization [CPU#0 -> CPU#1]: passed.
[    0.460000] Brought up 2 CPUs
[    0.460000] net_namespace: 872 bytes
[    0.460000] NET: Registered protocol family 16
[    0.460000] No dock devices found.
[    0.460000] ACPI: bus type pci registered
[    0.460000] PCI: Using configuration type 1
[    0.460000] ACPI: EC: Look up EC in DSDT
[    0.470000] ACPI: Interpreter enabled
[    0.470000] ACPI: (supports S0 S5)
[    0.470000] ACPI: Using IOAPIC for interrupt routing
[    0.480000] ACPI: PCI Root Bridge [PCI0] (0000:00)
[    0.480000] pci 0000:00:1f.0: quirk: region 0800-087f claimed by ICH6 ACPI/GPIO/TCO
[    0.480000] pci 0000:00:1f.0: quirk: region 0480-04bf claimed by ICH6 GPIO
[    0.480000] PCI: Transparent bridge - 0000:00:1e.0
[    0.480000] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0._PRT]
[    0.480000] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.P0P2._PRT]
[    0.480000] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.P0P1._PRT]
[    0.480000] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.P0P4._PRT]
[    0.480000] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.P0P7._PRT]
[    0.480000] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.P0P8._PRT]
[    0.490000] ACPI: PCI Interrupt Link [LNKA] (IRQs 3 4 5 6 7 10 *11 12 14 15)
[    0.490000] ACPI: PCI Interrupt Link [LNKB] (IRQs 3 4 *5 6 7 10 11 12 14 15)
[    0.490000] ACPI: PCI Interrupt Link [LNKC] (IRQs 3 4 5 6 7 10 11 12 14 *15)
[    0.490000] ACPI: PCI Interrupt Link [LNKD] (IRQs 3 4 5 6 7 *10 11 12 14 15)
[    0.490000] ACPI: PCI Interrupt Link [LNKE] (IRQs 3 4 5 6 7 10 11 12 14 15) *0, disabled.
[    0.500000] ACPI: PCI Interrupt Link [LNKF] (IRQs 3 4 5 6 7 10 11 12 14 15) *0, disabled.
[    0.500000] ACPI: PCI Interrupt Link [LNKG] (IRQs *3 4 5 6 7 10 11 12 14 15)
[    0.500000] ACPI: PCI Interrupt Link [LNKH] (IRQs 3 4 5 6 7 10 11 12 *14 15)
[    0.500000] ACPI Warning (tbutils-0217): Incorrect checksum in table [OEMB] -  08, should be 03 [20070126]
[    0.500000] Linux Plug and Play Support v0.97 (c) Adam Belay
[    0.500000] pnp: PnP ACPI init
[    0.500000] ACPI: bus type pnp registered
[    0.500000] pnp: PnP ACPI: found 15 devices
[    0.500000] ACPI: ACPI bus type pnp unregistered
[    0.500000] SCSI subsystem initialized
[    0.500000] libata version 3.00 loaded.
[    0.500000] PCI: Using ACPI for IRQ routing
[    0.500000] PCI: If a device doesn't work, try "pci=routeirq".  If it helps, post a report
[    0.540000] PCI-GART: No AMD northbridge found.
[    0.550000] Switched to high resolution mode on CPU 0
[    0.090000] Switched to high resolution mode on CPU 1
[    0.580000] system 00:01: iomem range 0xfed14000-0xfed19fff has been reserved
[    0.580000] system 00:07: ioport range 0x290-0x297 has been reserved
[    0.580000] system 00:08: ioport range 0x4d0-0x4d1 has been reserved
[    0.580000] system 00:08: ioport range 0x800-0x87f has been reserved
[    0.580000] system 00:08: ioport range 0x480-0x4bf has been reserved
[    0.580000] system 00:08: iomem range 0xffafe000-0xffb0cbff could not be reserved
[    0.580000] system 00:08: iomem range 0xffb00000-0xffbfffff could not be reserved
[    0.580000] system 00:08: iomem range 0xfed1c000-0xfed1ffff has been reserved
[    0.580000] system 00:08: iomem range 0xfed20000-0xfed8ffff has been reserved
[    0.580000] system 00:08: iomem range 0xfff00000-0xfffffffe could not be reserved
[    0.580000] system 00:08: iomem range 0xfebfe000-0xfebfec00 has been reserved
[    0.580000] system 00:0b: iomem range 0xfec00000-0xfec00fff has been reserved
[    0.580000] system 00:0b: iomem range 0xfee00000-0xfee00fff could not be reserved
[    0.580000] system 00:0d: iomem range 0xe0000000-0xefffffff has been reserved
[    0.580000] system 00:0e: iomem range 0x0-0x9ffff could not be reserved
[    0.580000] system 00:0e: iomem range 0xc0000-0xcffff has been reserved
[    0.580000] system 00:0e: iomem range 0xe0000-0xfffff could not be reserved
[    0.580000] system 00:0e: iomem range 0x100000-0x7fffffff could not be reserved
[    0.580000] system 00:0e: iomem range 0x0-0x0 could not be reserved
[    0.580000] PCI: Bridge: 0000:00:01.0
[    0.580000]   IO window: 9000-9fff
[    0.580000]   MEM window: 0xf8700000-0xfe7fffff
[    0.580000]   PREFETCH window: 0x00000000bfe00000-0x00000000dfdfffff
[    0.580000] PCI: Bridge: 0000:00:1c.0
[    0.580000]   IO window: disabled.
[    0.580000]   MEM window: disabled.
[    0.580000]   PREFETCH window: 0x00000000dfe00000-0x00000000dfefffff
[    0.580000] PCI: Bridge: 0000:00:1c.3
[    0.580000]   IO window: disabled.
[    0.580000]   MEM window: 0xfe900000-0xfe9fffff
[    0.580000]   PREFETCH window: disabled.
[    0.580000] PCI: Bridge: 0000:00:1c.4
[    0.580000]   IO window: a000-afff
[    0.580000]   MEM window: 0xfe800000-0xfe8fffff
[    0.580000]   PREFETCH window: disabled.
[    0.580000] PCI: Bridge: 0000:00:1e.0
[    0.580000]   IO window: b000-bfff
[    0.580000]   MEM window: 0xfea00000-0xfeafffff
[    0.580000]   PREFETCH window: 0x0000000088000000-0x00000000880fffff
[    0.580000] ACPI: PCI Interrupt 0000:00:01.0[A] -> GSI 16 (level, low) -> IRQ 16
[    0.580000] PCI: Setting latency timer of device 0000:00:01.0 to 64
[    0.580000] ACPI: PCI Interrupt 0000:00:1c.0[A] -> GSI 16 (level, low) -> IRQ 16
[    0.580000] PCI: Setting latency timer of device 0000:00:1c.0 to 64
[    0.580000] ACPI: PCI Interrupt 0000:00:1c.3[D] -> GSI 19 (level, low) -> IRQ 19
[    0.580000] PCI: Setting latency timer of device 0000:00:1c.3 to 64
[    0.580000] ACPI: PCI Interrupt 0000:00:1c.4[A] -> GSI 16 (level, low) -> IRQ 16
[    0.580000] PCI: Setting latency timer of device 0000:00:1c.4 to 64
[    0.580000] PCI: Setting latency timer of device 0000:00:1e.0 to 64
[    0.580000] NET: Registered protocol family 2
[    0.670000] IP route cache hash table entries: 131072 (order: 8, 1048576 bytes)
[    0.670000] TCP established hash table entries: 262144 (order: 10, 4194304 bytes)
[    0.670000] TCP bind hash table entries: 65536 (order: 8, 1048576 bytes)
[    0.670000] TCP: Hash tables configured (established 262144 bind 65536)
[    0.670000] TCP reno registered
[    0.240000] io scheduler noop registered
[    0.240000] io scheduler cfq registered (default)
[    0.240000] pci 0000:01:00.0: Boot video device
[    0.250000] Real Time Clock Driver v1.12ac
[    0.250000] Linux agpgart interface v0.103
[    0.250000] ACPI: PCI Interrupt 0000:03:00.0[A] -> GSI 19 (level, low) -> IRQ 19
[    0.250000] PCI: Setting latency timer of device 0000:03:00.0 to 64
[    0.250000] atl1 0000:03:00.0: version 2.0.7
[    0.390000] 8139too Fast Ethernet driver 0.9.28
[    0.390000] ACPI: PCI Interrupt 0000:05:02.0[A] -> GSI 23 (level, low) -> IRQ 23
[    0.390000] eth1: RealTek RTL8139 at 0xb800, 00:80:48:2e:06:2e, IRQ 23
[    0.390000] eth1:  Identified 8139 chip type 'RTL-8100B/8139D'
[    0.390000] netconsole: local port 6665
[    0.390000] netconsole: local IP 192.168.0.1
[    0.390000] netconsole: interface eth0
[    0.390000] netconsole: remote port 9353
[    0.390000] netconsole: remote IP 192.168.0.42
[    0.390000] netconsole: remote ethernet address 00:1b:38:af:22:49
[    0.390000] netconsole: device eth0 not up yet, forcing it
[    3.050000] atl1 0000:03:00.0: eth0 link is up 1000 Mbps full duplex
[    2.760000] console [netcon0] enabled
[    2.770000] netconsole: network logging started
[    2.770000] Driver 'sd' needs updating - please use bus_type methods
[    2.770000] ahci 0000:02:00.0: version 3.0
[    2.770000] ACPI: PCI Interrupt 0000:02:00.0[A] -> GSI 16 (level, low) -> IRQ 16
[    2.800000] ahci 0000:02:00.0: AHCI 0001.0000 32 slots 2 ports 3 Gbps 0x3 impl SATA mode
[    2.800000] ahci 0000:02:00.0: flags: 64bit ncq pm led clo pmp pio slum part 
[    2.800000] PCI: Setting latency timer of device 0000:02:00.0 to 64
[    2.800000] scsi0 : ahci
[    2.800000] scsi1 : ahci
[    2.800000] ata1: SATA max UDMA/133 abar m8192@0xfe8fe000 port 0xfe8fe100 irq 16
[    2.800000] ata2: SATA max UDMA/133 abar m8192@0xfe8fe000 port 0xfe8fe180 irq 16
[    2.910000] ata1: SATA link down (SStatus 0 SControl 300)
[    3.010000] ata2: SATA link down (SStatus 0 SControl 300)
[    3.010000] ata_piix 0000:00:1f.2: version 2.12
[    3.010000] ACPI: PCI Interrupt 0000:00:1f.2[B] -> GSI 19 (level, low) -> IRQ 19
[    3.010000] ata_piix 0000:00:1f.2: MAP [ P0 P2 P1 P3 ]
[    3.010000] PCI: Setting latency timer of device 0000:00:1f.2 to 64
[    3.010000] scsi2 : ata_piix
[    3.010000] scsi3 : ata_piix
[    3.020000] ata3: SATA max UDMA/133 cmd 0xec00 ctl 0xe880 bmdma 0xe400 irq 19
[    3.020000] ata4: SATA max UDMA/133 cmd 0xe800 ctl 0xe480 bmdma 0xe408 irq 19
[    3.130000] ata3: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
[    3.270000] ata3.00: ATA-8: ST3750330AS, SD15, max UDMA/133
[    3.270000] ata3.00: 1465149168 sectors, multi 16: LBA48 NCQ (depth 0/32)
[    3.270000] ata3.01: ATA-7: ST3160811AS, 3.AAE, max UDMA/133
[    3.270000] ata3.01: 312581808 sectors, multi 16: LBA48 NCQ (depth 0/32)
[    3.290000] ata3.00: configured for UDMA/133
[    3.310000] ata3.01: configured for UDMA/133
[    3.230000] ata4: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
[    3.330000] ata4.00: ATA-7: ST3250620AS, 3.AAE, max UDMA/133
[    3.330000] ata4.00: 488397168 sectors, multi 16: LBA48 NCQ (depth 0/32)
[    3.360000] ata4.00: configured for UDMA/133
[    3.240000] scsi 2:0:0:0: Direct-Access     ATA      ST3750330AS      SD15 PQ: 0 ANSI: 5
[    3.240000] sd 2:0:0:0: [sda] 1465149168 512-byte hardware sectors (750156 MB)
[    3.240000] sd 2:0:0:0: [sda] Write Protect is off
[    3.250000] sd 2:0:0:0: [sda] Mode Sense: 00 3a 00 00
[    3.250000] sd 2:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[    3.250000] sd 2:0:0:0: [sda] 1465149168 512-byte hardware sectors (750156 MB)
[    3.250000] sd 2:0:0:0: [sda] Write Protect is off
[    3.250000] sd 2:0:0:0: [sda] Mode Sense: 00 3a 00 00
[    3.250000] sd 2:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[    3.250000]  sda: sda1 sda2
[    3.370000] sd 2:0:0:0: [sda] Attached SCSI disk
[    3.370000] scsi 2:0:1:0: Direct-Access     ATA      ST3160811AS      3.AA PQ: 0 ANSI: 5
[    3.370000] sd 2:0:1:0: [sdb] 312581808 512-byte hardware sectors (160042 MB)
[    3.370000] sd 2:0:1:0: [sdb] Write Protect is off
[    3.370000] sd 2:0:1:0: [sdb] Mode Sense: 00 3a 00 00
[    3.370000] sd 2:0:1:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[    3.370000] sd 2:0:1:0: [sdb] 312581808 512-byte hardware sectors (160042 MB)
[    3.370000] sd 2:0:1:0: [sdb] Write Protect is off
[    3.370000] sd 2:0:1:0: [sdb] Mode Sense: 00 3a 00 00
[    3.370000] sd 2:0:1:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[    3.370000]  sdb: sdb4
[    3.390000]  sdb4: <openbsd: sdb5 sdb6 >
[    3.390000] sd 2:0:1:0: [sdb] Attached SCSI disk
[    3.390000] scsi 3:0:0:0: Direct-Access     ATA      ST3250620AS      3.AA PQ: 0 ANSI: 5
[    3.390000] sd 3:0:0:0: [sdc] 488397168 512-byte hardware sectors (250059 MB)
[    3.390000] sd 3:0:0:0: [sdc] Write Protect is off
[    3.390000] sd 3:0:0:0: [sdc] Mode Sense: 00 3a 00 00
[    3.390000] sd 3:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[    3.390000] sd 3:0:0:0: [sdc] 488397168 512-byte hardware sectors (250059 MB)
[    3.390000] sd 3:0:0:0: [sdc] Write Protect is off
[    3.390000] sd 3:0:0:0: [sdc] Mode Sense: 00 3a 00 00
[    3.390000] sd 3:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[    3.390000]  sdc: sdc1
[    3.410000] sd 3:0:0:0: [sdc] Attached SCSI disk
[    3.410000] ACPI: PCI Interrupt 0000:00:1f.5[B] -> GSI 19 (level, low) -> IRQ 19
[    3.410000] ata_piix 0000:00:1f.5: MAP [ P0 -- P1 -- ]
[    3.410000] PCI: Setting latency timer of device 0000:00:1f.5 to 64
[    3.410000] scsi4 : ata_piix
[    3.410000] scsi5 : ata_piix
[    3.420000] ata5: SATA max UDMA/133 cmd 0xd400 ctl 0xd080 bmdma 0xc880 irq 19
[    3.420000] ata6: SATA max UDMA/133 cmd 0xd000 ctl 0xcc00 bmdma 0xc888 irq 19
[    3.420000] ata5: SATA link down (SStatus 0 SControl 300)
[    3.520000] ata6: SATA link down (SStatus 0 SControl 300)
[    3.500000] ACPI: PCI Interrupt 0000:02:00.1[B] -> GSI 17 (level, low) -> IRQ 17
[    3.500000] PCI: Setting latency timer of device 0000:02:00.1 to 64
[    3.500000] scsi6 : pata_jmicron
[    3.500000] scsi7 : pata_jmicron
[    3.500000] ata7: PATA max UDMA/100 cmd 0xac00 ctl 0xa880 bmdma 0xa400 irq 17
[    3.500000] ata8: PATA max UDMA/100 cmd 0xa800 ctl 0xa480 bmdma 0xa408 irq 17
[    3.560000] ata7.01: ATAPI: Optiarc DVD RW AD-7173A, 1-01, max UDMA/66
[    3.600000] ata7.01: configured for UDMA/66
[    3.620000] scsi 6:0:1:0: CD-ROM            Optiarc  DVD RW AD-7173A  1-01 PQ: 0 ANSI: 5
[    3.620000] PNP: PS/2 Controller [PNP0303:PS2K] at 0x60,0x64 irq 1
[    3.620000] PNP: PS/2 appears to have AUX port disabled, if this is incorrect please boot with i8042.nopnp
[    3.620000] serio: i8042 KBD port at 0x60,0x64 irq 1
[    3.620000] mice: PS/2 mouse device common for all mice
[    3.620000] Advanced Linux Sound Architecture Driver Version 1.0.16rc2 (Thu Jan 31 16:40:16 2008 UTC).
[    3.620000] ACPI: PCI Interrupt 0000:00:1b.0[A] -> GSI 22 (level, low) -> IRQ 22
[    3.620000] PCI: Setting latency timer of device 0000:00:1b.0 to 64
[    3.630000] input: AT Translated Set 2 keyboard as /class/input/input0
[    3.880000] ALSA device list:
[    3.880000]   #0: HDA Intel at 0xfebf8000 irq 22
[    3.880000] TCP cubic registered
[    3.880000] NET: Registered protocol family 1
[    3.650000] kjournald starting.  Commit interval 5 seconds
[    3.910000] EXT3-fs: mounted filesystem with ordered data mode.
[    3.910000] VFS: Mounted root (ext3 filesystem) readonly.
[    3.910000] Freeing unused kernel memory: 228k freed
[    4.930000] Driver 'sr' needs updating - please use bus_type methods
[    5.480000] usbcore: registered new interface driver usbfs
[    5.480000] usbcore: registered new interface driver hub
[    5.480000] usbcore: registered new device driver usb
[    5.480000] sr0: scsi3-mmc drive: 48x/48x writer dvd-ram cd/rw xa/form2 cdda tray
[    5.480000] Uniform CD-ROM driver Revision: 3.20
[    5.480000] sr 6:0:1:0: Attached scsi CD-ROM sr0
[    5.480000] USB Universal Host Controller Interface driver v3.0
[    5.480000] ACPI: PCI Interrupt 0000:00:1a.0[A] -> GSI 16 (level, low) -> IRQ 16
[    5.480000] PCI: Setting latency timer of device 0000:00:1a.0 to 64
[    5.480000] uhci_hcd 0000:00:1a.0: UHCI Host Controller
[    5.480000] uhci_hcd 0000:00:1a.0: new USB bus registered, assigned bus number 1
[    5.480000] uhci_hcd 0000:00:1a.0: irq 16, io base 0x0000dc00
[    5.480000] usb usb1: configuration #1 chosen from 1 choice
[    5.480000] hub 1-0:1.0: USB hub found
[    5.480000] hub 1-0:1.0: 2 ports detected
[    5.590000] usb usb1: New USB device found, idVendor=1d6b, idProduct=0001
[    5.590000] usb usb1: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[    5.590000] usb usb1: Product: UHCI Host Controller
[    5.590000] usb usb1: Manufacturer: Linux 2.6.25-rc8 uhci_hcd
[    5.590000] usb usb1: SerialNumber: 0000:00:1a.0
[    5.590000] ACPI: PCI Interrupt 0000:00:1a.1[B] -> GSI 17 (level, low) -> IRQ 17
[    5.590000] PCI: Setting latency timer of device 0000:00:1a.1 to 64
[    5.590000] uhci_hcd 0000:00:1a.1: UHCI Host Controller
[    5.590000] uhci_hcd 0000:00:1a.1: new USB bus registered, assigned bus number 2
[    5.590000] uhci_hcd 0000:00:1a.1: irq 17, io base 0x0000e000
[    5.590000] usb usb2: configuration #1 chosen from 1 choice
[    5.590000] hub 2-0:1.0: USB hub found
[    5.590000] hub 2-0:1.0: 2 ports detected
[    5.700000] usb usb2: New USB device found, idVendor=1d6b, idProduct=0001
[    5.700000] usb usb2: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[    5.700000] usb usb2: Product: UHCI Host Controller
[    5.700000] usb usb2: Manufacturer: Linux 2.6.25-rc8 uhci_hcd
[    5.700000] usb usb2: SerialNumber: 0000:00:1a.1
[    5.700000] ACPI: PCI Interrupt 0000:00:1d.0[A] -> GSI 23 (level, low) -> IRQ 23
[    5.700000] PCI: Setting latency timer of device 0000:00:1d.0 to 64
[    5.700000] uhci_hcd 0000:00:1d.0: UHCI Host Controller
[    5.700000] uhci_hcd 0000:00:1d.0: new USB bus registered, assigned bus number 3
[    5.700000] uhci_hcd 0000:00:1d.0: irq 23, io base 0x0000d480
[    5.700000] usb usb3: configuration #1 chosen from 1 choice
[    5.700000] hub 3-0:1.0: USB hub found
[    5.700000] hub 3-0:1.0: 2 ports detected
[    5.770000] usb usb3: New USB device found, idVendor=1d6b, idProduct=0001
[    5.770000] usb usb3: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[    5.770000] usb usb3: Product: UHCI Host Controller
[    5.770000] usb usb3: Manufacturer: Linux 2.6.25-rc8 uhci_hcd
[    5.770000] usb usb3: SerialNumber: 0000:00:1d.0
[    5.770000] ACPI: PCI Interrupt 0000:00:1d.1[B] -> GSI 19 (level, low) -> IRQ 19
[    5.770000] PCI: Setting latency timer of device 0000:00:1d.1 to 64
[    5.770000] uhci_hcd 0000:00:1d.1: UHCI Host Controller
[    5.770000] uhci_hcd 0000:00:1d.1: new USB bus registered, assigned bus number 4
[    5.770000] uhci_hcd 0000:00:1d.1: irq 19, io base 0x0000d800
[    5.770000] usb usb4: configuration #1 chosen from 1 choice
[    5.770000] hub 4-0:1.0: USB hub found
[    5.770000] hub 4-0:1.0: 2 ports detected
[    5.810000] usb usb4: New USB device found, idVendor=1d6b, idProduct=0001
[    5.810000] usb usb4: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[    5.810000] usb usb4: Product: UHCI Host Controller
[    5.810000] usb usb4: Manufacturer: Linux 2.6.25-rc8 uhci_hcd
[    5.810000] usb usb4: SerialNumber: 0000:00:1d.1
[    5.810000] ACPI: PCI Interrupt 0000:00:1d.2[C] -> GSI 18 (level, low) -> IRQ 18
[    5.810000] PCI: Setting latency timer of device 0000:00:1d.2 to 64
[    5.810000] uhci_hcd 0000:00:1d.2: UHCI Host Controller
[    5.810000] uhci_hcd 0000:00:1d.2: new USB bus registered, assigned bus number 5
[    5.810000] uhci_hcd 0000:00:1d.2: irq 18, io base 0x0000d880
[    5.810000] usb usb5: configuration #1 chosen from 1 choice
[    5.810000] hub 5-0:1.0: USB hub found
[    5.810000] hub 5-0:1.0: 2 ports detected
[    5.860000] usb usb5: New USB device found, idVendor=1d6b, idProduct=0001
[    5.860000] usb usb5: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[    5.860000] usb usb5: Product: UHCI Host Controller
[    5.860000] usb usb5: Manufacturer: Linux 2.6.25-rc8 uhci_hcd
[    5.860000] usb usb5: SerialNumber: 0000:00:1d.2
[    5.860000] ACPI: PCI Interrupt 0000:00:1a.7[C] -> GSI 18 (level, low) -> IRQ 18
[    5.860000] PCI: Setting latency timer of device 0000:00:1a.7 to 64
[    5.860000] ehci_hcd 0000:00:1a.7: EHCI Host Controller
[    5.860000] ehci_hcd 0000:00:1a.7: new USB bus registered, assigned bus number 6
[    5.860000] ehci_hcd 0000:00:1a.7: debug port 1
[    5.860000] PCI: cache line size of 32 is not supported by device 0000:00:1a.7
[    5.860000] ehci_hcd 0000:00:1a.7: irq 18, io mem 0xfebffc00
[    5.880000] ehci_hcd 0000:00:1a.7: USB 2.0 started, EHCI 1.00, driver 10 Dec 2004
[    5.880000] usb usb6: configuration #1 chosen from 1 choice
[    5.880000] hub 6-0:1.0: USB hub found
[    5.880000] hub 6-0:1.0: 4 ports detected
[    5.970000] usb usb6: New USB device found, idVendor=1d6b, idProduct=0002
[    5.970000] usb usb6: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[    5.970000] usb usb6: Product: EHCI Host Controller
[    5.970000] usb usb6: Manufacturer: Linux 2.6.25-rc8 ehci_hcd
[    5.970000] usb usb6: SerialNumber: 0000:00:1a.7
[    5.970000] ACPI: PCI Interrupt 0000:00:1d.7[A] -> GSI 23 (level, low) -> IRQ 23
[    5.970000] PCI: Setting latency timer of device 0000:00:1d.7 to 64
[    5.970000] ehci_hcd 0000:00:1d.7: EHCI Host Controller
[    5.970000] ehci_hcd 0000:00:1d.7: new USB bus registered, assigned bus number 7
[    5.970000] ehci_hcd 0000:00:1d.7: debug port 1
[    5.970000] PCI: cache line size of 32 is not supported by device 0000:00:1d.7
[    5.970000] ehci_hcd 0000:00:1d.7: irq 23, io mem 0xfebff800
[    5.990000] ehci_hcd 0000:00:1d.7: USB 2.0 started, EHCI 1.00, driver 10 Dec 2004
[    5.990000] usb usb7: configuration #1 chosen from 1 choice
[    5.990000] hub 7-0:1.0: USB hub found
[    5.990000] hub 7-0:1.0: 6 ports detected
[    6.040000] usb usb7: New USB device found, idVendor=1d6b, idProduct=0002
[    6.040000] usb usb7: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[    6.040000] usb usb7: Product: EHCI Host Controller
[    6.040000] usb usb7: Manufacturer: Linux 2.6.25-rc8 ehci_hcd
[    6.040000] usb usb7: SerialNumber: 0000:00:1d.7
[    6.870000] EXT3 FS on sda2, internal journal
[    7.130000] Adding 9775512k swap on /dev/sda1.  Priority:-1 extents:1 across:9775512k
[   12.610000] ip_tables: (C) 2000-2006 Netfilter Core Team
[   12.620000] nf_conntrack version 0.5.0 (16384 buckets, 65536 max)
[   13.000000] eth1: link up, 100Mbps, full-duplex, lpa 0x45E1


^ permalink raw reply	[flat|nested] 71+ messages in thread

* 2.6.25-rc8-mm2: boot hang after "ACPI: using IOAPIC for interrupt routing"
  2008-04-11  3:33 2.6.25-rc8-mm2 Andrew Morton
  2008-04-11  6:28 ` 2.6.25-rc8-mm2: boot hang after "ACPI: using IOAPIC for interrupt routing" Alexey Dobriyan
@ 2008-04-11  6:29 ` Alexey Dobriyan
  2008-04-11  6:43   ` Pekka Enberg
  2008-04-11 10:35   ` Pekka Enberg
  2008-04-11  9:57 ` 2.6.25-rc8-mm2 KAMEZAWA Hiroyuki
                   ` (9 subsequent siblings)
  11 siblings, 2 replies; 71+ messages in thread
From: Alexey Dobriyan @ 2008-04-11  6:29 UTC (permalink / raw
  To: Andrew Morton; +Cc: linux-kernel, Pekka Enberg

On Thu, Apr 10, 2008 at 08:33:54PM -0700, Andrew Morton wrote:
> - git-slub has been temporarily replaced by git-pekka.  Pekka is standing in
>   while Christoph is away.

I bisected boot hang after "ACPI: using IOAPIC for interrupt routing"
down to git-pekka.

normal dmesg from 2.6.25-rc8-something and .config snippets for -mm

# CONFIG_SLUB_DEBUG is not set		(hey, I thought this was on!)
CONFIG_SLUB=y
# CONFIG_SLUB_STATS is not set

# CONFIG_DEBUG_DRIVER is not set
# CONFIG_DEBUG_DEVRES is not set
# CONFIG_DEBUG_FS is not set
CONFIG_DEBUG_KERNEL=y
CONFIG_DEBUG_SHIRQ=y
CONFIG_DEBUG_RT_MUTEXES=y
CONFIG_DEBUG_PI_LIST=y
CONFIG_DEBUG_SPINLOCK=y
CONFIG_DEBUG_MUTEXES=y
CONFIG_DEBUG_LOCK_ALLOC=y
# CONFIG_DEBUG_LOCKDEP is not set
CONFIG_DEBUG_SPINLOCK_SLEEP=y
# CONFIG_DEBUG_LOCKING_API_SELFTESTS is not set
# CONFIG_DEBUG_KOBJECT is not set
CONFIG_DEBUG_BUGVERBOSE=y
# CONFIG_DEBUG_INFO is not set
CONFIG_DEBUG_VM=y
CONFIG_DEBUG_WRITECOUNT=y
CONFIG_DEBUG_LIST=y
CONFIG_DEBUG_SG=y
# CONFIG_DEBUG_STACKOVERFLOW is not set
# CONFIG_DEBUG_STACK_USAGE is not set
CONFIG_DEBUG_PAGEALLOC=y
CONFIG_DEBUG_PER_CPU_MAPS=y
CONFIG_DEBUG_RODATA=y
# CONFIG_DEBUG_RODATA_TEST is not set
# CONFIG_DEBUG_NX_TEST is not set

P.S.: I now suspect that one bisection point was wrong, however,
      make-module_sect_attrs-private-to-kernel-modulec-checkpatch-fixes
      was definitely good.


[    0.000000] Linux version 2.6.25-rc8 (ad@martell) (gcc version 4.1.2 (Gentoo 4.1.2 p1.0.2)) #2 SMP Fri Apr 11 00:54:03 MSD 2008
[    0.000000] Command line: root=/dev/sda2 netconsole=@192.168.0.1/eth0,9353@192.168.0.42/00:1b:38:af:22:49 ignore_loglevel
[    0.000000] BIOS-provided physical RAM map:
[    0.000000]  BIOS-e820: 0000000000000000 - 000000000009fc00 (usable)
[    0.000000]  BIOS-e820: 000000000009fc00 - 00000000000a0000 (reserved)
[    0.000000]  BIOS-e820: 00000000000e4000 - 0000000000100000 (reserved)
[    0.000000]  BIOS-e820: 0000000000100000 - 000000007ff90000 (usable)
[    0.000000]  BIOS-e820: 000000007ff90000 - 000000007ff9e000 (ACPI data)
[    0.000000]  BIOS-e820: 000000007ff9e000 - 000000007ffe0000 (ACPI NVS)
[    0.000000]  BIOS-e820: 000000007ffe0000 - 0000000080000000 (reserved)
[    0.000000]  BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved)
[    0.000000]  BIOS-e820: 00000000ffb00000 - 0000000100000000 (reserved)
[    0.000000]  BIOS-e820: 0000000100000000 - 0000000180000000 (usable)
[    0.000000] debug: ignoring loglevel setting.
[    0.000000] Entering add_active_range(0, 0, 159) 0 entries of 256 used
[    0.000000] Entering add_active_range(0, 256, 524176) 1 entries of 256 used
[    0.000000] Entering add_active_range(0, 1048576, 1572864) 2 entries of 256 used
[    0.000000] end_pfn_map = 1572864
[    0.000000] DMI 2.4 present.
[    0.000000] ACPI: RSDP 000FA980, 0024 (r2 ACPIAM)
[    0.000000] ACPI: XSDT 7FF90100, 0054 (r1 KOZIRO FRONTIER  2000707 MSFT       97)
[    0.000000] ACPI: FACP 7FF90290, 00F4 (r3 MSTEST OEMFACP   2000707 MSFT       97)
[    0.000000] ACPI: DSDT 7FF905C0, 8FA9 (r1  A0637 A0637000        0 INTL 20060113)
[    0.000000] ACPI: FACS 7FF9E000, 0040
[    0.000000] ACPI: APIC 7FF90390, 006C (r1 MSTEST OEMAPIC   2000707 MSFT       97)
[    0.000000] ACPI: MCFG 7FF90400, 003C (r1 MSTEST OEMMCFG   2000707 MSFT       97)
[    0.000000] ACPI: SLIC 7FF90440, 0176 (r1 KOZIRO FRONTIER  2000707 MSFT       97)
[    0.000000] ACPI: OEMB 7FF9E040, 007B (r1 MSTEST AMI_OEM   2000707 MSFT       97)
[    0.000000] ACPI: HPET 7FF99570, 0038 (r1 MSTEST OEMHPET   2000707 MSFT       97)
[    0.000000] Entering add_active_range(0, 0, 159) 0 entries of 256 used
[    0.000000] Entering add_active_range(0, 256, 524176) 1 entries of 256 used
[    0.000000] Entering add_active_range(0, 1048576, 1572864) 2 entries of 256 used
[    0.000000] early res: 0 [0-fff] BIOS data page
[    0.000000] early res: 1 [6000-7fff] SMP_TRAMPOLINE
[    0.000000] early res: 2 [200000-5eead3] TEXT DATA BSS
[    0.000000] early res: 3 [9fc00-a0bff] EBDA
[    0.000000] early res: 4 [8000-dfff] PGTABLE
[    0.000000]  [ffffe20000000000-ffffe200001fffff] PMD ->ffff810001200000 on node 0
[    0.000000]  [ffffe20000200000-ffffe200003fffff] PMD ->ffff810001600000 on node 0
[    0.000000]  [ffffe20000400000-ffffe200005fffff] PMD ->ffff810001a00000 on node 0
[    0.000000]  [ffffe20000600000-ffffe200007fffff] PMD ->ffff810001e00000 on node 0
[    0.000000]  [ffffe20000800000-ffffe200009fffff] PMD ->ffff810002200000 on node 0
[    0.000000]  [ffffe20000a00000-ffffe20000bfffff] PMD ->ffff810002600000 on node 0
[    0.000000]  [ffffe20000c00000-ffffe20000dfffff] PMD ->ffff810002a00000 on node 0
[    0.000000]  [ffffe20000e00000-ffffe20000ffffff] PMD ->ffff810002e00000 on node 0
[    0.000000]  [ffffe20001000000-ffffe200011fffff] PMD ->ffff810003200000 on node 0
[    0.000000]  [ffffe20001200000-ffffe200013fffff] PMD ->ffff810003600000 on node 0
[    0.000000]  [ffffe20001400000-ffffe200015fffff] PMD ->ffff810003a00000 on node 0
[    0.000000]  [ffffe20001600000-ffffe200017fffff] PMD ->ffff810003e00000 on node 0
[    0.000000]  [ffffe20001800000-ffffe200019fffff] PMD ->ffff810004200000 on node 0
[    0.000000]  [ffffe20001a00000-ffffe20001bfffff] PMD ->ffff810004600000 on node 0
[    0.000000]  [ffffe20001c00000-ffffe20001dfffff] PMD ->ffff810004a00000 on node 0
[    0.000000]  [ffffe20001e00000-ffffe20001ffffff] PMD ->ffff810004e00000 on node 0
[    0.000000]  [ffffe20002000000-ffffe200021fffff] PMD ->ffff810005200000 on node 0
[    0.000000]  [ffffe20002200000-ffffe200023fffff] PMD ->ffff810005600000 on node 0
[    0.000000]  [ffffe20002400000-ffffe200025fffff] PMD ->ffff810005a00000 on node 0
[    0.000000]  [ffffe20002600000-ffffe200027fffff] PMD ->ffff810005e00000 on node 0
[    0.000000]  [ffffe20002800000-ffffe200029fffff] PMD ->ffff810006200000 on node 0
[    0.000000]  [ffffe20002a00000-ffffe20002bfffff] PMD ->ffff810006600000 on node 0
[    0.000000]  [ffffe20002c00000-ffffe20002dfffff] PMD ->ffff810006a00000 on node 0
[    0.000000]  [ffffe20002e00000-ffffe20002ffffff] PMD ->ffff810006e00000 on node 0
[    0.000000]  [ffffe20003000000-ffffe200031fffff] PMD ->ffff810007200000 on node 0
[    0.000000]  [ffffe20003200000-ffffe200033fffff] PMD ->ffff810007600000 on node 0
[    0.000000]  [ffffe20003400000-ffffe200035fffff] PMD ->ffff810007a00000 on node 0
[    0.000000]  [ffffe20003600000-ffffe200037fffff] PMD ->ffff810007e00000 on node 0
[    0.000000]  [ffffe20003800000-ffffe200039fffff] PMD ->ffff810008200000 on node 0
[    0.000000]  [ffffe20003a00000-ffffe20003bfffff] PMD ->ffff810008600000 on node 0
[    0.000000]  [ffffe20003c00000-ffffe20003dfffff] PMD ->ffff810008a00000 on node 0
[    0.000000]  [ffffe20003e00000-ffffe20003ffffff] PMD ->ffff810008e00000 on node 0
[    0.000000]  [ffffe20004000000-ffffe200041fffff] PMD ->ffff810009200000 on node 0
[    0.000000]  [ffffe20004200000-ffffe200043fffff] PMD ->ffff810009600000 on node 0
[    0.000000]  [ffffe20004400000-ffffe200045fffff] PMD ->ffff810009a00000 on node 0
[    0.000000]  [ffffe20004600000-ffffe200047fffff] PMD ->ffff810009e00000 on node 0
[    0.000000]  [ffffe20004800000-ffffe200049fffff] PMD ->ffff81000a200000 on node 0
[    0.000000]  [ffffe20004a00000-ffffe20004bfffff] PMD ->ffff81000a600000 on node 0
[    0.000000]  [ffffe20004c00000-ffffe20004dfffff] PMD ->ffff81000aa00000 on node 0
[    0.000000]  [ffffe20004e00000-ffffe20004ffffff] PMD ->ffff81000ae00000 on node 0
[    0.000000]  [ffffe20005000000-ffffe200051fffff] PMD ->ffff81000b200000 on node 0
[    0.000000]  [ffffe20005200000-ffffe200053fffff] PMD ->ffff81000b600000 on node 0
[    0.000000] Zone PFN ranges:
[    0.000000]   DMA             0 ->     4096
[    0.000000]   DMA32        4096 ->  1048576
[    0.000000]   Normal    1048576 ->  1572864
[    0.000000] Movable zone start PFN for each node
[    0.000000] early_node_map[3] active PFN ranges
[    0.000000]     0:        0 ->      159
[    0.000000]     0:      256 ->   524176
[    0.000000]     0:  1048576 ->  1572864
[    0.000000] On node 0 totalpages: 1048367
[    0.000000]   DMA zone: 56 pages used for memmap
[    0.000000]   DMA zone: 1018 pages reserved
[    0.000000]   DMA zone: 2925 pages, LIFO batch:0
[    0.000000]   DMA32 zone: 14280 pages used for memmap
[    0.000000]   DMA32 zone: 505800 pages, LIFO batch:31
[    0.000000]   Normal zone: 7168 pages used for memmap
[    0.000000]   Normal zone: 517120 pages, LIFO batch:31
[    0.000000]   Movable zone: 0 pages used for memmap
[    0.000000] ACPI: Local APIC address 0xfee00000
[    0.000000] ACPI: LAPIC (acpi_id[0x01] lapic_id[0x00] enabled)
[    0.000000] Processor #0 (Bootup-CPU)
[    0.000000] ACPI: LAPIC (acpi_id[0x02] lapic_id[0x01] enabled)
[    0.000000] Processor #1
[    0.000000] ACPI: LAPIC (acpi_id[0x03] lapic_id[0x82] disabled)
[    0.000000] ACPI: LAPIC (acpi_id[0x04] lapic_id[0x83] disabled)
[    0.000000] ACPI: IOAPIC (id[0x02] address[0xfec00000] gsi_base[0])
[    0.000000] IOAPIC[0]: apic_id 2, address 0xfec00000, GSI 0-23
[    0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
[    0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level)
[    0.000000] ACPI: IRQ0 used by override.
[    0.000000] ACPI: IRQ2 used by override.
[    0.000000] ACPI: IRQ9 used by override.
[    0.000000] Setting APIC routing to flat
[    0.000000] ACPI: HPET id: 0x8086a202 base: 0xfed00000
[    0.000000] Using ACPI (MADT) for SMP configuration information
[    0.000000] Allocating PCI resources starting at 88000000 (gap: 80000000:7ee00000)
[    0.000000] PERCPU: Allocating 32200 bytes of per cpu data
[    0.000000] Built 1 zonelists in Zone order, mobility grouping on.  Total pages: 1025845
[    0.000000] Kernel command line: root=/dev/sda2 netconsole=@192.168.0.1/eth0,9353@192.168.0.42/00:1b:38:af:22:49 ignore_loglevel
[    0.000000] Initializing CPU#0
[    0.000000] PID hash table entries: 4096 (order: 12, 32768 bytes)
[    0.000000] Extended CMOS year: 2000
[    0.000000] TSC calibrated against PIT
[    0.000000] time.c: Detected 2135.096 MHz processor.
[    0.000000] Console: colour VGA+ 80x25
[    0.000000] console [tty0] enabled
[    0.000000] Dentry cache hash table entries: 524288 (order: 10, 4194304 bytes)
[    0.000000] Inode-cache hash table entries: 262144 (order: 9, 2097152 bytes)
[    0.000000] Checking aperture...
[    0.000000] PCI-DMA: Using software bounce buffering for IO (SWIOTLB)
[    0.000000] Placing software IO TLB between 0xbe27000 - 0xfe27000
[    0.000000] Memory: 4030128k/6291456k available (2283k kernel code, 162620k reserved, 1113k data, 228k init)
[    0.000000] CPA: page pool initialized 1 of 1 pages preallocated
[    0.000000] SLUB: Genslabs=12, HWalign=64, Order=0-1, MinObjects=4, CPUs=2, Nodes=1
[    0.000000] hpet clockevent registered
[    0.150000] Calibrating delay using timer specific routine.. 4272.54 BogoMIPS (lpj=21362745)
[    0.150000] Mount-cache hash table entries: 256
[    0.150000] CPU: L1 I cache: 32K, L1 D cache: 32K
[    0.150000] CPU: L2 cache: 2048K
[    0.150000] CPU: Physical Processor ID: 0
[    0.150000] CPU: Processor Core ID: 0
[    0.150000] CPU0: Thermal monitoring enabled (TM2)
[    0.150000] using mwait in idle threads.
[    0.150000] Freeing SMP alternatives: 21k freed
[    0.150000] ACPI: Core revision 20070126
[    0.270000] Using local APIC timer interrupts.
[    0.290000] APIC timer calibration result 16680437
[    0.290000] Detected 16.680 MHz APIC timer.
[    0.290000] Booting processor 1/2 APIC 0x1
[    0.000000] Initializing CPU#1
[    0.000000] Calibrating delay using timer specific routine.. 4270.25 BogoMIPS (lpj=21351254)
[    0.000000] CPU: L1 I cache: 32K, L1 D cache: 32K
[    0.000000] CPU: L2 cache: 2048K
[    0.000000] CPU: Physical Processor ID: 0
[    0.000000] CPU: Processor Core ID: 1
[    0.000000] CPU1: Thermal monitoring enabled (TM2)
[    0.000000] Intel(R) Core(TM)2 CPU          6400  @ 2.13GHz stepping 02
[    0.450000] checking TSC synchronization [CPU#0 -> CPU#1]: passed.
[    0.460000] Brought up 2 CPUs
[    0.460000] net_namespace: 872 bytes
[    0.460000] NET: Registered protocol family 16
[    0.460000] No dock devices found.
[    0.460000] ACPI: bus type pci registered
[    0.460000] PCI: Using configuration type 1
[    0.460000] ACPI: EC: Look up EC in DSDT
[    0.470000] ACPI: Interpreter enabled
[    0.470000] ACPI: (supports S0 S5)
[    0.470000] ACPI: Using IOAPIC for interrupt routing
[    0.480000] ACPI: PCI Root Bridge [PCI0] (0000:00)
[    0.480000] pci 0000:00:1f.0: quirk: region 0800-087f claimed by ICH6 ACPI/GPIO/TCO
[    0.480000] pci 0000:00:1f.0: quirk: region 0480-04bf claimed by ICH6 GPIO
[    0.480000] PCI: Transparent bridge - 0000:00:1e.0
[    0.480000] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0._PRT]
[    0.480000] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.P0P2._PRT]
[    0.480000] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.P0P1._PRT]
[    0.480000] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.P0P4._PRT]
[    0.480000] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.P0P7._PRT]
[    0.480000] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.P0P8._PRT]
[    0.490000] ACPI: PCI Interrupt Link [LNKA] (IRQs 3 4 5 6 7 10 *11 12 14 15)
[    0.490000] ACPI: PCI Interrupt Link [LNKB] (IRQs 3 4 *5 6 7 10 11 12 14 15)
[    0.490000] ACPI: PCI Interrupt Link [LNKC] (IRQs 3 4 5 6 7 10 11 12 14 *15)
[    0.490000] ACPI: PCI Interrupt Link [LNKD] (IRQs 3 4 5 6 7 *10 11 12 14 15)
[    0.490000] ACPI: PCI Interrupt Link [LNKE] (IRQs 3 4 5 6 7 10 11 12 14 15) *0, disabled.
[    0.500000] ACPI: PCI Interrupt Link [LNKF] (IRQs 3 4 5 6 7 10 11 12 14 15) *0, disabled.
[    0.500000] ACPI: PCI Interrupt Link [LNKG] (IRQs *3 4 5 6 7 10 11 12 14 15)
[    0.500000] ACPI: PCI Interrupt Link [LNKH] (IRQs 3 4 5 6 7 10 11 12 *14 15)
[    0.500000] ACPI Warning (tbutils-0217): Incorrect checksum in table [OEMB] -  08, should be 03 [20070126]
[    0.500000] Linux Plug and Play Support v0.97 (c) Adam Belay
[    0.500000] pnp: PnP ACPI init
[    0.500000] ACPI: bus type pnp registered
[    0.500000] pnp: PnP ACPI: found 15 devices
[    0.500000] ACPI: ACPI bus type pnp unregistered
[    0.500000] SCSI subsystem initialized
[    0.500000] libata version 3.00 loaded.
[    0.500000] PCI: Using ACPI for IRQ routing
[    0.500000] PCI: If a device doesn't work, try "pci=routeirq".  If it helps, post a report
[    0.540000] PCI-GART: No AMD northbridge found.
[    0.550000] Switched to high resolution mode on CPU 0
[    0.090000] Switched to high resolution mode on CPU 1
[    0.580000] system 00:01: iomem range 0xfed14000-0xfed19fff has been reserved
[    0.580000] system 00:07: ioport range 0x290-0x297 has been reserved
[    0.580000] system 00:08: ioport range 0x4d0-0x4d1 has been reserved
[    0.580000] system 00:08: ioport range 0x800-0x87f has been reserved
[    0.580000] system 00:08: ioport range 0x480-0x4bf has been reserved
[    0.580000] system 00:08: iomem range 0xffafe000-0xffb0cbff could not be reserved
[    0.580000] system 00:08: iomem range 0xffb00000-0xffbfffff could not be reserved
[    0.580000] system 00:08: iomem range 0xfed1c000-0xfed1ffff has been reserved
[    0.580000] system 00:08: iomem range 0xfed20000-0xfed8ffff has been reserved
[    0.580000] system 00:08: iomem range 0xfff00000-0xfffffffe could not be reserved
[    0.580000] system 00:08: iomem range 0xfebfe000-0xfebfec00 has been reserved
[    0.580000] system 00:0b: iomem range 0xfec00000-0xfec00fff has been reserved
[    0.580000] system 00:0b: iomem range 0xfee00000-0xfee00fff could not be reserved
[    0.580000] system 00:0d: iomem range 0xe0000000-0xefffffff has been reserved
[    0.580000] system 00:0e: iomem range 0x0-0x9ffff could not be reserved
[    0.580000] system 00:0e: iomem range 0xc0000-0xcffff has been reserved
[    0.580000] system 00:0e: iomem range 0xe0000-0xfffff could not be reserved
[    0.580000] system 00:0e: iomem range 0x100000-0x7fffffff could not be reserved
[    0.580000] system 00:0e: iomem range 0x0-0x0 could not be reserved
[    0.580000] PCI: Bridge: 0000:00:01.0
[    0.580000]   IO window: 9000-9fff
[    0.580000]   MEM window: 0xf8700000-0xfe7fffff
[    0.580000]   PREFETCH window: 0x00000000bfe00000-0x00000000dfdfffff
[    0.580000] PCI: Bridge: 0000:00:1c.0
[    0.580000]   IO window: disabled.
[    0.580000]   MEM window: disabled.
[    0.580000]   PREFETCH window: 0x00000000dfe00000-0x00000000dfefffff
[    0.580000] PCI: Bridge: 0000:00:1c.3
[    0.580000]   IO window: disabled.
[    0.580000]   MEM window: 0xfe900000-0xfe9fffff
[    0.580000]   PREFETCH window: disabled.
[    0.580000] PCI: Bridge: 0000:00:1c.4
[    0.580000]   IO window: a000-afff
[    0.580000]   MEM window: 0xfe800000-0xfe8fffff
[    0.580000]   PREFETCH window: disabled.
[    0.580000] PCI: Bridge: 0000:00:1e.0
[    0.580000]   IO window: b000-bfff
[    0.580000]   MEM window: 0xfea00000-0xfeafffff
[    0.580000]   PREFETCH window: 0x0000000088000000-0x00000000880fffff
[    0.580000] ACPI: PCI Interrupt 0000:00:01.0[A] -> GSI 16 (level, low) -> IRQ 16
[    0.580000] PCI: Setting latency timer of device 0000:00:01.0 to 64
[    0.580000] ACPI: PCI Interrupt 0000:00:1c.0[A] -> GSI 16 (level, low) -> IRQ 16
[    0.580000] PCI: Setting latency timer of device 0000:00:1c.0 to 64
[    0.580000] ACPI: PCI Interrupt 0000:00:1c.3[D] -> GSI 19 (level, low) -> IRQ 19
[    0.580000] PCI: Setting latency timer of device 0000:00:1c.3 to 64
[    0.580000] ACPI: PCI Interrupt 0000:00:1c.4[A] -> GSI 16 (level, low) -> IRQ 16
[    0.580000] PCI: Setting latency timer of device 0000:00:1c.4 to 64
[    0.580000] PCI: Setting latency timer of device 0000:00:1e.0 to 64
[    0.580000] NET: Registered protocol family 2
[    0.670000] IP route cache hash table entries: 131072 (order: 8, 1048576 bytes)
[    0.670000] TCP established hash table entries: 262144 (order: 10, 4194304 bytes)
[    0.670000] TCP bind hash table entries: 65536 (order: 8, 1048576 bytes)
[    0.670000] TCP: Hash tables configured (established 262144 bind 65536)
[    0.670000] TCP reno registered
[    0.240000] io scheduler noop registered
[    0.240000] io scheduler cfq registered (default)
[    0.240000] pci 0000:01:00.0: Boot video device
[    0.250000] Real Time Clock Driver v1.12ac
[    0.250000] Linux agpgart interface v0.103
[    0.250000] ACPI: PCI Interrupt 0000:03:00.0[A] -> GSI 19 (level, low) -> IRQ 19
[    0.250000] PCI: Setting latency timer of device 0000:03:00.0 to 64
[    0.250000] atl1 0000:03:00.0: version 2.0.7
[    0.390000] 8139too Fast Ethernet driver 0.9.28
[    0.390000] ACPI: PCI Interrupt 0000:05:02.0[A] -> GSI 23 (level, low) -> IRQ 23
[    0.390000] eth1: RealTek RTL8139 at 0xb800, 00:80:48:2e:06:2e, IRQ 23
[    0.390000] eth1:  Identified 8139 chip type 'RTL-8100B/8139D'
[    0.390000] netconsole: local port 6665
[    0.390000] netconsole: local IP 192.168.0.1
[    0.390000] netconsole: interface eth0
[    0.390000] netconsole: remote port 9353
[    0.390000] netconsole: remote IP 192.168.0.42
[    0.390000] netconsole: remote ethernet address 00:1b:38:af:22:49
[    0.390000] netconsole: device eth0 not up yet, forcing it
[    3.050000] atl1 0000:03:00.0: eth0 link is up 1000 Mbps full duplex
[    2.760000] console [netcon0] enabled
[    2.770000] netconsole: network logging started
[    2.770000] Driver 'sd' needs updating - please use bus_type methods
[    2.770000] ahci 0000:02:00.0: version 3.0
[    2.770000] ACPI: PCI Interrupt 0000:02:00.0[A] -> GSI 16 (level, low) -> IRQ 16
[    2.800000] ahci 0000:02:00.0: AHCI 0001.0000 32 slots 2 ports 3 Gbps 0x3 impl SATA mode
[    2.800000] ahci 0000:02:00.0: flags: 64bit ncq pm led clo pmp pio slum part 
[    2.800000] PCI: Setting latency timer of device 0000:02:00.0 to 64
[    2.800000] scsi0 : ahci
[    2.800000] scsi1 : ahci
[    2.800000] ata1: SATA max UDMA/133 abar m8192@0xfe8fe000 port 0xfe8fe100 irq 16
[    2.800000] ata2: SATA max UDMA/133 abar m8192@0xfe8fe000 port 0xfe8fe180 irq 16
[    2.910000] ata1: SATA link down (SStatus 0 SControl 300)
[    3.010000] ata2: SATA link down (SStatus 0 SControl 300)
[    3.010000] ata_piix 0000:00:1f.2: version 2.12
[    3.010000] ACPI: PCI Interrupt 0000:00:1f.2[B] -> GSI 19 (level, low) -> IRQ 19
[    3.010000] ata_piix 0000:00:1f.2: MAP [ P0 P2 P1 P3 ]
[    3.010000] PCI: Setting latency timer of device 0000:00:1f.2 to 64
[    3.010000] scsi2 : ata_piix
[    3.010000] scsi3 : ata_piix
[    3.020000] ata3: SATA max UDMA/133 cmd 0xec00 ctl 0xe880 bmdma 0xe400 irq 19
[    3.020000] ata4: SATA max UDMA/133 cmd 0xe800 ctl 0xe480 bmdma 0xe408 irq 19
[    3.130000] ata3: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
[    3.270000] ata3.00: ATA-8: ST3750330AS, SD15, max UDMA/133
[    3.270000] ata3.00: 1465149168 sectors, multi 16: LBA48 NCQ (depth 0/32)
[    3.270000] ata3.01: ATA-7: ST3160811AS, 3.AAE, max UDMA/133
[    3.270000] ata3.01: 312581808 sectors, multi 16: LBA48 NCQ (depth 0/32)
[    3.290000] ata3.00: configured for UDMA/133
[    3.310000] ata3.01: configured for UDMA/133
[    3.230000] ata4: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
[    3.330000] ata4.00: ATA-7: ST3250620AS, 3.AAE, max UDMA/133
[    3.330000] ata4.00: 488397168 sectors, multi 16: LBA48 NCQ (depth 0/32)
[    3.360000] ata4.00: configured for UDMA/133
[    3.240000] scsi 2:0:0:0: Direct-Access     ATA      ST3750330AS      SD15 PQ: 0 ANSI: 5
[    3.240000] sd 2:0:0:0: [sda] 1465149168 512-byte hardware sectors (750156 MB)
[    3.240000] sd 2:0:0:0: [sda] Write Protect is off
[    3.250000] sd 2:0:0:0: [sda] Mode Sense: 00 3a 00 00
[    3.250000] sd 2:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[    3.250000] sd 2:0:0:0: [sda] 1465149168 512-byte hardware sectors (750156 MB)
[    3.250000] sd 2:0:0:0: [sda] Write Protect is off
[    3.250000] sd 2:0:0:0: [sda] Mode Sense: 00 3a 00 00
[    3.250000] sd 2:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[    3.250000]  sda: sda1 sda2
[    3.370000] sd 2:0:0:0: [sda] Attached SCSI disk
[    3.370000] scsi 2:0:1:0: Direct-Access     ATA      ST3160811AS      3.AA PQ: 0 ANSI: 5
[    3.370000] sd 2:0:1:0: [sdb] 312581808 512-byte hardware sectors (160042 MB)
[    3.370000] sd 2:0:1:0: [sdb] Write Protect is off
[    3.370000] sd 2:0:1:0: [sdb] Mode Sense: 00 3a 00 00
[    3.370000] sd 2:0:1:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[    3.370000] sd 2:0:1:0: [sdb] 312581808 512-byte hardware sectors (160042 MB)
[    3.370000] sd 2:0:1:0: [sdb] Write Protect is off
[    3.370000] sd 2:0:1:0: [sdb] Mode Sense: 00 3a 00 00
[    3.370000] sd 2:0:1:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[    3.370000]  sdb: sdb4
[    3.390000]  sdb4: <openbsd: sdb5 sdb6 >
[    3.390000] sd 2:0:1:0: [sdb] Attached SCSI disk
[    3.390000] scsi 3:0:0:0: Direct-Access     ATA      ST3250620AS      3.AA PQ: 0 ANSI: 5
[    3.390000] sd 3:0:0:0: [sdc] 488397168 512-byte hardware sectors (250059 MB)
[    3.390000] sd 3:0:0:0: [sdc] Write Protect is off
[    3.390000] sd 3:0:0:0: [sdc] Mode Sense: 00 3a 00 00
[    3.390000] sd 3:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[    3.390000] sd 3:0:0:0: [sdc] 488397168 512-byte hardware sectors (250059 MB)
[    3.390000] sd 3:0:0:0: [sdc] Write Protect is off
[    3.390000] sd 3:0:0:0: [sdc] Mode Sense: 00 3a 00 00
[    3.390000] sd 3:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[    3.390000]  sdc: sdc1
[    3.410000] sd 3:0:0:0: [sdc] Attached SCSI disk
[    3.410000] ACPI: PCI Interrupt 0000:00:1f.5[B] -> GSI 19 (level, low) -> IRQ 19
[    3.410000] ata_piix 0000:00:1f.5: MAP [ P0 -- P1 -- ]
[    3.410000] PCI: Setting latency timer of device 0000:00:1f.5 to 64
[    3.410000] scsi4 : ata_piix
[    3.410000] scsi5 : ata_piix
[    3.420000] ata5: SATA max UDMA/133 cmd 0xd400 ctl 0xd080 bmdma 0xc880 irq 19
[    3.420000] ata6: SATA max UDMA/133 cmd 0xd000 ctl 0xcc00 bmdma 0xc888 irq 19
[    3.420000] ata5: SATA link down (SStatus 0 SControl 300)
[    3.520000] ata6: SATA link down (SStatus 0 SControl 300)
[    3.500000] ACPI: PCI Interrupt 0000:02:00.1[B] -> GSI 17 (level, low) -> IRQ 17
[    3.500000] PCI: Setting latency timer of device 0000:02:00.1 to 64
[    3.500000] scsi6 : pata_jmicron
[    3.500000] scsi7 : pata_jmicron
[    3.500000] ata7: PATA max UDMA/100 cmd 0xac00 ctl 0xa880 bmdma 0xa400 irq 17
[    3.500000] ata8: PATA max UDMA/100 cmd 0xa800 ctl 0xa480 bmdma 0xa408 irq 17
[    3.560000] ata7.01: ATAPI: Optiarc DVD RW AD-7173A, 1-01, max UDMA/66
[    3.600000] ata7.01: configured for UDMA/66
[    3.620000] scsi 6:0:1:0: CD-ROM            Optiarc  DVD RW AD-7173A  1-01 PQ: 0 ANSI: 5
[    3.620000] PNP: PS/2 Controller [PNP0303:PS2K] at 0x60,0x64 irq 1
[    3.620000] PNP: PS/2 appears to have AUX port disabled, if this is incorrect please boot with i8042.nopnp
[    3.620000] serio: i8042 KBD port at 0x60,0x64 irq 1
[    3.620000] mice: PS/2 mouse device common for all mice
[    3.620000] Advanced Linux Sound Architecture Driver Version 1.0.16rc2 (Thu Jan 31 16:40:16 2008 UTC).
[    3.620000] ACPI: PCI Interrupt 0000:00:1b.0[A] -> GSI 22 (level, low) -> IRQ 22
[    3.620000] PCI: Setting latency timer of device 0000:00:1b.0 to 64
[    3.630000] input: AT Translated Set 2 keyboard as /class/input/input0
[    3.880000] ALSA device list:
[    3.880000]   #0: HDA Intel at 0xfebf8000 irq 22
[    3.880000] TCP cubic registered
[    3.880000] NET: Registered protocol family 1
[    3.650000] kjournald starting.  Commit interval 5 seconds
[    3.910000] EXT3-fs: mounted filesystem with ordered data mode.
[    3.910000] VFS: Mounted root (ext3 filesystem) readonly.
[    3.910000] Freeing unused kernel memory: 228k freed
[    4.930000] Driver 'sr' needs updating - please use bus_type methods
[    5.480000] usbcore: registered new interface driver usbfs
[    5.480000] usbcore: registered new interface driver hub
[    5.480000] usbcore: registered new device driver usb
[    5.480000] sr0: scsi3-mmc drive: 48x/48x writer dvd-ram cd/rw xa/form2 cdda tray
[    5.480000] Uniform CD-ROM driver Revision: 3.20
[    5.480000] sr 6:0:1:0: Attached scsi CD-ROM sr0
[    5.480000] USB Universal Host Controller Interface driver v3.0
[    5.480000] ACPI: PCI Interrupt 0000:00:1a.0[A] -> GSI 16 (level, low) -> IRQ 16
[    5.480000] PCI: Setting latency timer of device 0000:00:1a.0 to 64
[    5.480000] uhci_hcd 0000:00:1a.0: UHCI Host Controller
[    5.480000] uhci_hcd 0000:00:1a.0: new USB bus registered, assigned bus number 1
[    5.480000] uhci_hcd 0000:00:1a.0: irq 16, io base 0x0000dc00
[    5.480000] usb usb1: configuration #1 chosen from 1 choice
[    5.480000] hub 1-0:1.0: USB hub found
[    5.480000] hub 1-0:1.0: 2 ports detected
[    5.590000] usb usb1: New USB device found, idVendor=1d6b, idProduct=0001
[    5.590000] usb usb1: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[    5.590000] usb usb1: Product: UHCI Host Controller
[    5.590000] usb usb1: Manufacturer: Linux 2.6.25-rc8 uhci_hcd
[    5.590000] usb usb1: SerialNumber: 0000:00:1a.0
[    5.590000] ACPI: PCI Interrupt 0000:00:1a.1[B] -> GSI 17 (level, low) -> IRQ 17
[    5.590000] PCI: Setting latency timer of device 0000:00:1a.1 to 64
[    5.590000] uhci_hcd 0000:00:1a.1: UHCI Host Controller
[    5.590000] uhci_hcd 0000:00:1a.1: new USB bus registered, assigned bus number 2
[    5.590000] uhci_hcd 0000:00:1a.1: irq 17, io base 0x0000e000
[    5.590000] usb usb2: configuration #1 chosen from 1 choice
[    5.590000] hub 2-0:1.0: USB hub found
[    5.590000] hub 2-0:1.0: 2 ports detected
[    5.700000] usb usb2: New USB device found, idVendor=1d6b, idProduct=0001
[    5.700000] usb usb2: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[    5.700000] usb usb2: Product: UHCI Host Controller
[    5.700000] usb usb2: Manufacturer: Linux 2.6.25-rc8 uhci_hcd
[    5.700000] usb usb2: SerialNumber: 0000:00:1a.1
[    5.700000] ACPI: PCI Interrupt 0000:00:1d.0[A] -> GSI 23 (level, low) -> IRQ 23
[    5.700000] PCI: Setting latency timer of device 0000:00:1d.0 to 64
[    5.700000] uhci_hcd 0000:00:1d.0: UHCI Host Controller
[    5.700000] uhci_hcd 0000:00:1d.0: new USB bus registered, assigned bus number 3
[    5.700000] uhci_hcd 0000:00:1d.0: irq 23, io base 0x0000d480
[    5.700000] usb usb3: configuration #1 chosen from 1 choice
[    5.700000] hub 3-0:1.0: USB hub found
[    5.700000] hub 3-0:1.0: 2 ports detected
[    5.770000] usb usb3: New USB device found, idVendor=1d6b, idProduct=0001
[    5.770000] usb usb3: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[    5.770000] usb usb3: Product: UHCI Host Controller
[    5.770000] usb usb3: Manufacturer: Linux 2.6.25-rc8 uhci_hcd
[    5.770000] usb usb3: SerialNumber: 0000:00:1d.0
[    5.770000] ACPI: PCI Interrupt 0000:00:1d.1[B] -> GSI 19 (level, low) -> IRQ 19
[    5.770000] PCI: Setting latency timer of device 0000:00:1d.1 to 64
[    5.770000] uhci_hcd 0000:00:1d.1: UHCI Host Controller
[    5.770000] uhci_hcd 0000:00:1d.1: new USB bus registered, assigned bus number 4
[    5.770000] uhci_hcd 0000:00:1d.1: irq 19, io base 0x0000d800
[    5.770000] usb usb4: configuration #1 chosen from 1 choice
[    5.770000] hub 4-0:1.0: USB hub found
[    5.770000] hub 4-0:1.0: 2 ports detected
[    5.810000] usb usb4: New USB device found, idVendor=1d6b, idProduct=0001
[    5.810000] usb usb4: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[    5.810000] usb usb4: Product: UHCI Host Controller
[    5.810000] usb usb4: Manufacturer: Linux 2.6.25-rc8 uhci_hcd
[    5.810000] usb usb4: SerialNumber: 0000:00:1d.1
[    5.810000] ACPI: PCI Interrupt 0000:00:1d.2[C] -> GSI 18 (level, low) -> IRQ 18
[    5.810000] PCI: Setting latency timer of device 0000:00:1d.2 to 64
[    5.810000] uhci_hcd 0000:00:1d.2: UHCI Host Controller
[    5.810000] uhci_hcd 0000:00:1d.2: new USB bus registered, assigned bus number 5
[    5.810000] uhci_hcd 0000:00:1d.2: irq 18, io base 0x0000d880
[    5.810000] usb usb5: configuration #1 chosen from 1 choice
[    5.810000] hub 5-0:1.0: USB hub found
[    5.810000] hub 5-0:1.0: 2 ports detected
[    5.860000] usb usb5: New USB device found, idVendor=1d6b, idProduct=0001
[    5.860000] usb usb5: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[    5.860000] usb usb5: Product: UHCI Host Controller
[    5.860000] usb usb5: Manufacturer: Linux 2.6.25-rc8 uhci_hcd
[    5.860000] usb usb5: SerialNumber: 0000:00:1d.2
[    5.860000] ACPI: PCI Interrupt 0000:00:1a.7[C] -> GSI 18 (level, low) -> IRQ 18
[    5.860000] PCI: Setting latency timer of device 0000:00:1a.7 to 64
[    5.860000] ehci_hcd 0000:00:1a.7: EHCI Host Controller
[    5.860000] ehci_hcd 0000:00:1a.7: new USB bus registered, assigned bus number 6
[    5.860000] ehci_hcd 0000:00:1a.7: debug port 1
[    5.860000] PCI: cache line size of 32 is not supported by device 0000:00:1a.7
[    5.860000] ehci_hcd 0000:00:1a.7: irq 18, io mem 0xfebffc00
[    5.880000] ehci_hcd 0000:00:1a.7: USB 2.0 started, EHCI 1.00, driver 10 Dec 2004
[    5.880000] usb usb6: configuration #1 chosen from 1 choice
[    5.880000] hub 6-0:1.0: USB hub found
[    5.880000] hub 6-0:1.0: 4 ports detected
[    5.970000] usb usb6: New USB device found, idVendor=1d6b, idProduct=0002
[    5.970000] usb usb6: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[    5.970000] usb usb6: Product: EHCI Host Controller
[    5.970000] usb usb6: Manufacturer: Linux 2.6.25-rc8 ehci_hcd
[    5.970000] usb usb6: SerialNumber: 0000:00:1a.7
[    5.970000] ACPI: PCI Interrupt 0000:00:1d.7[A] -> GSI 23 (level, low) -> IRQ 23
[    5.970000] PCI: Setting latency timer of device 0000:00:1d.7 to 64
[    5.970000] ehci_hcd 0000:00:1d.7: EHCI Host Controller
[    5.970000] ehci_hcd 0000:00:1d.7: new USB bus registered, assigned bus number 7
[    5.970000] ehci_hcd 0000:00:1d.7: debug port 1
[    5.970000] PCI: cache line size of 32 is not supported by device 0000:00:1d.7
[    5.970000] ehci_hcd 0000:00:1d.7: irq 23, io mem 0xfebff800
[    5.990000] ehci_hcd 0000:00:1d.7: USB 2.0 started, EHCI 1.00, driver 10 Dec 2004
[    5.990000] usb usb7: configuration #1 chosen from 1 choice
[    5.990000] hub 7-0:1.0: USB hub found
[    5.990000] hub 7-0:1.0: 6 ports detected
[    6.040000] usb usb7: New USB device found, idVendor=1d6b, idProduct=0002
[    6.040000] usb usb7: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[    6.040000] usb usb7: Product: EHCI Host Controller
[    6.040000] usb usb7: Manufacturer: Linux 2.6.25-rc8 ehci_hcd
[    6.040000] usb usb7: SerialNumber: 0000:00:1d.7
[    6.870000] EXT3 FS on sda2, internal journal
[    7.130000] Adding 9775512k swap on /dev/sda1.  Priority:-1 extents:1 across:9775512k
[   12.610000] ip_tables: (C) 2000-2006 Netfilter Core Team
[   12.620000] nf_conntrack version 0.5.0 (16384 buckets, 65536 max)
[   13.000000] eth1: link up, 100Mbps, full-duplex, lpa 0x45E1


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2: boot hang after "ACPI: using IOAPIC for interrupt routing"
  2008-04-11  6:29 ` Alexey Dobriyan
@ 2008-04-11  6:43   ` Pekka Enberg
  2008-04-11 10:35   ` Pekka Enberg
  1 sibling, 0 replies; 71+ messages in thread
From: Pekka Enberg @ 2008-04-11  6:43 UTC (permalink / raw
  To: Alexey Dobriyan; +Cc: Andrew Morton, linux-kernel, Christoph Lameter

Hi Alexey,

On Thu, Apr 10, 2008 at 08:33:54PM -0700, Andrew Morton wrote:
>> - git-slub has been temporarily replaced by git-pekka.  Pekka is standing in
>>   while Christoph is away.

Alexey Dobriyan wrote:
> I bisected boot hang after "ACPI: using IOAPIC for interrupt routing"
> down to git-pekka.
> 
> normal dmesg from 2.6.25-rc8-something and .config snippets for -mm

That's odd. I don't immediately see anything there that can cause a 
this... Can you see the hang with the 'for-mm' branch of my tree:

   git://git.kernel.org/pub/scm/linux/kernel/git/penberg/slab-2.6.git

If so, can you do a git bisect? Does sysrq-t tell us anything?

			Pekka


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2
  2008-04-11  3:33 2.6.25-rc8-mm2 Andrew Morton
  2008-04-11  6:28 ` 2.6.25-rc8-mm2: boot hang after "ACPI: using IOAPIC for interrupt routing" Alexey Dobriyan
  2008-04-11  6:29 ` Alexey Dobriyan
@ 2008-04-11  9:57 ` KAMEZAWA Hiroyuki
  2008-04-11 10:23   ` 2.6.25-rc8-mm2 KAMEZAWA Hiroyuki
  2008-04-11 13:40 ` 2.6.25-rc8-mm2 Adrian Bunk
                   ` (8 subsequent siblings)
  11 siblings, 1 reply; 71+ messages in thread
From: KAMEZAWA Hiroyuki @ 2008-04-11  9:57 UTC (permalink / raw
  To: Andrew Morton; +Cc: linux-kernel, penberg

On Thu, 10 Apr 2008 20:33:54 -0700
Andrew Morton <akpm@linux-foundation.org> wrote:

> 
> ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.25-rc8/2.6.25-rc8-mm2/
> 
> - Compilation is busted on mips due to the page-flags patches
> 
On ia64/NUMA box (which has empty nodes.)

CONFIG_SLAB ..... booted well
CONFIG_SLUB ..... can't boot
CONFIG_SLUB + CONFIG_SLUB_DEBUG_ON .... booted. Hmm?

I'll dig more if I can.

2.6.25-rc8-mm1 had no troubles.

Thanks,
-Kame


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2
  2008-04-11  9:57 ` 2.6.25-rc8-mm2 KAMEZAWA Hiroyuki
@ 2008-04-11 10:23   ` KAMEZAWA Hiroyuki
  2008-04-11 10:34     ` 2.6.25-rc8-mm2 Pekka Enberg
  0 siblings, 1 reply; 71+ messages in thread
From: KAMEZAWA Hiroyuki @ 2008-04-11 10:23 UTC (permalink / raw
  To: KAMEZAWA Hiroyuki; +Cc: Andrew Morton, linux-kernel, penberg

On Fri, 11 Apr 2008 18:57:03 +0900
KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com> wrote:

> On Thu, 10 Apr 2008 20:33:54 -0700
> Andrew Morton <akpm@linux-foundation.org> wrote:
> 
> > 
> > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.25-rc8/2.6.25-rc8-mm2/
> > 
> > - Compilation is busted on mips due to the page-flags patches
> > 
> On ia64/NUMA box (which has empty nodes.)
> 
> CONFIG_SLAB ..... booted well
> CONFIG_SLUB ..... can't boot
> CONFIG_SLUB + CONFIG_SLUB_DEBUG_ON .... booted. Hmm?
> 
> I'll dig more if I can.
> 
with slub_nomerge , booted.

Thanks,
-Kame


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2
  2008-04-11 10:23   ` 2.6.25-rc8-mm2 KAMEZAWA Hiroyuki
@ 2008-04-11 10:34     ` Pekka Enberg
  2008-04-11 10:57       ` 2.6.25-rc8-mm2 KAMEZAWA Hiroyuki
  0 siblings, 1 reply; 71+ messages in thread
From: Pekka Enberg @ 2008-04-11 10:34 UTC (permalink / raw
  To: KAMEZAWA Hiroyuki; +Cc: Andrew Morton, linux-kernel, Christoph Lameter

On Fri, 2008-04-11 at 19:23 +0900, KAMEZAWA Hiroyuki wrote:
> On Fri, 11 Apr 2008 18:57:03 +0900
> KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com> wrote:
> 
> > On Thu, 10 Apr 2008 20:33:54 -0700
> > Andrew Morton <akpm@linux-foundation.org> wrote:
> > 
> > > 
> > >
> ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.25-rc8/2.6.25-rc8-mm2/
> > > 
> > > - Compilation is busted on mips due to the page-flags patches
> > > 
> > On ia64/NUMA box (which has empty nodes.)
> > 
> > CONFIG_SLAB ..... booted well
> > CONFIG_SLUB ..... can't boot
> > CONFIG_SLUB + CONFIG_SLUB_DEBUG_ON .... booted. Hmm?
> > 
> > I'll dig more if I can.
> > 
> with slub_nomerge , booted.

What happens when it doesn't boot? Does it hang or do you get an oops?
Can you reproduce it with the 'for-mm' branch of:

 git://git.kernel.org/pub/scm/linux/kernel/git/penberg/slab-2.6.git

			Pekka


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2: boot hang after "ACPI: using IOAPIC for interrupt routing"
  2008-04-11  6:29 ` Alexey Dobriyan
  2008-04-11  6:43   ` Pekka Enberg
@ 2008-04-11 10:35   ` Pekka Enberg
  2008-04-11 21:07     ` Alexey Dobriyan
  1 sibling, 1 reply; 71+ messages in thread
From: Pekka Enberg @ 2008-04-11 10:35 UTC (permalink / raw
  To: Alexey Dobriyan
  Cc: Andrew Morton, linux-kernel, Christoph Lameter, KAMEZAWA Hiroyuki

On Fri, 2008-04-11 at 10:29 +0400, Alexey Dobriyan wrote:
> On Thu, Apr 10, 2008 at 08:33:54PM -0700, Andrew Morton wrote:
> > - git-slub has been temporarily replaced by git-pekka.  Pekka is standing in
> >   while Christoph is away.
> 
> I bisected boot hang after "ACPI: using IOAPIC for interrupt routing"
> down to git-pekka.
> 
> normal dmesg from 2.6.25-rc8-something and .config snippets for -mm

Alexey, can you try passing the 'slub_nomerge' option to the kernel to
see if the hang goes away with that?

			Pekka


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2
  2008-04-11 10:34     ` 2.6.25-rc8-mm2 Pekka Enberg
@ 2008-04-11 10:57       ` KAMEZAWA Hiroyuki
  2008-04-11 11:17         ` 2.6.25-rc8-mm2 KAMEZAWA Hiroyuki
  0 siblings, 1 reply; 71+ messages in thread
From: KAMEZAWA Hiroyuki @ 2008-04-11 10:57 UTC (permalink / raw
  To: Pekka Enberg; +Cc: Andrew Morton, linux-kernel, Christoph Lameter

On Fri, 11 Apr 2008 13:34:18 +0300
Pekka Enberg <penberg@cs.helsinki.fi> wrote:
> > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.25-rc8/2.6.25-rc8-mm2/
> > > > 
> > > > - Compilation is busted on mips due to the page-flags patches
> > > > 
> > > On ia64/NUMA box (which has empty nodes.)
> > > 
> > > CONFIG_SLAB ..... booted well
> > > CONFIG_SLUB ..... can't boot
> > > CONFIG_SLUB + CONFIG_SLUB_DEBUG_ON .... booted. Hmm?
> > > 
> > > I'll dig more if I can.
> > > 
> > with slub_nomerge , booted.
> 
> What happens when it doesn't boot? Does it hang or do you get an oops?
just hangs. before showing anything.


> Can you reproduce it with the 'for-mm' branch of:
> 
>  git://git.kernel.org/pub/scm/linux/kernel/git/penberg/slab-2.6.git
> 

will try. 

Thanks,
-Kame


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2
  2008-04-11 10:57       ` 2.6.25-rc8-mm2 KAMEZAWA Hiroyuki
@ 2008-04-11 11:17         ` KAMEZAWA Hiroyuki
  2008-04-11 13:17           ` 2.6.25-rc8-mm2 KAMEZAWA Hiroyuki
  0 siblings, 1 reply; 71+ messages in thread
From: KAMEZAWA Hiroyuki @ 2008-04-11 11:17 UTC (permalink / raw
  To: KAMEZAWA Hiroyuki
  Cc: Pekka Enberg, Andrew Morton, linux-kernel, Christoph Lameter

On Fri, 11 Apr 2008 19:57:38 +0900
KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com> wrote:

> > What happens when it doesn't boot? Does it hang or do you get an oops?
> just hangs. before showing anything.
> 
> 
> > Can you reproduce it with the 'for-mm' branch of:
> > 
> >  git://git.kernel.org/pub/scm/linux/kernel/git/penberg/slab-2.6.git
> > 
> 
> will try. 
> 
slab-2.6.git booted well :(
Hmm, It seems I have to dig somewhere different...

Thanks,
-Kame 


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2
  2008-04-11 11:17         ` 2.6.25-rc8-mm2 KAMEZAWA Hiroyuki
@ 2008-04-11 13:17           ` KAMEZAWA Hiroyuki
  2008-04-11 14:24             ` 2.6.25-rc8-mm2 Pekka Enberg
  0 siblings, 1 reply; 71+ messages in thread
From: KAMEZAWA Hiroyuki @ 2008-04-11 13:17 UTC (permalink / raw
  To: KAMEZAWA Hiroyuki
  Cc: Pekka Enberg, Andrew Morton, linux-kernel, Christoph Lameter

On Fri, 11 Apr 2008 20:17:24 +0900
KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com> wrote:
> slab-2.6.git booted well :(
> Hmm, It seems I have to dig somewhere different...
> 
Sorry, I tested *master* branch ;), under *testing* branch it reproduced.

bisected. (see below)

I'm sorry I can't use my box for next 2 days. I can test possible fix
on Monday (in Japan). 

==bisect result==
831d78b552aade2c383cf8d75b180dd35f81a4e3 is first bad commit
commit 831d78b552aade2c383cf8d75b180dd35f81a4e3
Author: Christoph Lameter <clameter@sgi.com>
Date:   Tue Apr 8 22:26:30 2008 +0300

    SLUB: Add KICKABLE to avoid repeated kick() attempts

    Add a flag KICKABLE to be set on slabs with a defragmentation method

    Clear the flag if a kick action is not successful in reducing the
    number of objects in a slab. This will avoid future attempts to
    kick objects out.

    The KICKABLE flag is set again when all objects of the slab have been
    allocated (Occurs during removal of a slab from the partial lists).

    Reviewed-by: Rik van Riel <riel@redhat.com>
    Signed-off-by: Christoph Lameter <clameter@sgi.com>
    Signed-off-by: Pekka Enberg <penberg@cs.helsinki.fi>

:040000 040000 b75802b6e7f1f27589f8242a824a4a24773a56d4 c935aae1a02277f0e605069d397d315f6db430b6 M      mm

== bisect log ==

git-bisect start
# good: [0e81a8ae37687845f7cdfa2adce14ea6a5f1dd34] Linux 2.6.25-rc8
git-bisect good 0e81a8ae37687845f7cdfa2adce14ea6a5f1dd34
# bad: [28e4b71a66881df1ac343f13d06395fa01021e8e] slub: use typedefs for ->get and ->kick functions
git-bisect bad 28e4b71a66881df1ac343f13d06395fa01021e8e
# good: [9597362d354f8655ece324b01d0c640a0e99c077] Merge git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/usb-2.6
git-bisect good 9597362d354f8655ece324b01d0c640a0e99c077
# good: [28b8383d5d4d9b636c3734c993563bafdc2ab3c3] Merge git://git.kernel.org/pub/scm/linux/kernel/git/mchehab/v4l-dvb
git-bisect good 28b8383d5d4d9b636c3734c993563bafdc2ab3c3
# good: [869ab5147e1eead890245cfd4f652ba282b6ac26] SELinux: more GFP_NOFS fixups to prevent selinux from re-entering the fs code
git-bisect good 869ab5147e1eead890245cfd4f652ba282b6ac26
# good: [acd49c885e03f087c31f49e7c42ccb8befbf4009] slub: Make the order configurable for each slab cache
git-bisect good acd49c885e03f087c31f49e7c42ccb8befbf4009
# skip: [24337bca6e77ab48f459e35690b32ef20a34bda5] SLUB: Replace ctor field with ops field in /sys/slab/*
git-bisect skip 24337bca6e77ab48f459e35690b32ef20a34bda5
# good: [66892337435a0d88996057af221e8c18ff91bc14] SLUB: Add get() and kick() methods
git-bisect good 66892337435a0d88996057af221e8c18ff91bc14
# bad: [171250363fe803b4dc61301276c2693cce3e5684] SLUB: Extend slabinfo to support -D and -F options
git-bisect bad 171250363fe803b4dc61301276c2693cce3e5684
# good: [c963d891d875a9bd39ae44da623c421bc0140937] SLUB: Slab defrag core
git-bisect good c963d891d875a9bd39ae44da623c421bc0140937
# bad: [831d78b552aade2c383cf8d75b180dd35f81a4e3] SLUB: Add KICKABLE to avoid repeated kick() attempts
git-bisect bad 831d78b552aade2c383cf8d75b180dd35f81a4e3
==

Thanks,
-Kame


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2
  2008-04-11  3:33 2.6.25-rc8-mm2 Andrew Morton
                   ` (2 preceding siblings ...)
  2008-04-11  9:57 ` 2.6.25-rc8-mm2 KAMEZAWA Hiroyuki
@ 2008-04-11 13:40 ` Adrian Bunk
  2008-04-12  4:22   ` 2.6.25-rc8-mm2 Andrew Morton
  2008-04-11 23:43 ` 2.6.25-rc8-mm2: panic involving mount_block_root and down the road Alexey Dobriyan
                   ` (7 subsequent siblings)
  11 siblings, 1 reply; 71+ messages in thread
From: Adrian Bunk @ 2008-04-11 13:40 UTC (permalink / raw
  To: Andrew Morton; +Cc: linux-kernel

On Thu, Apr 10, 2008 at 08:33:54PM -0700, Andrew Morton wrote:
>...
> Changes since 2.6.25-rc8-mm1:
>...
> +ext4-is-busted-on-m68k.patch
> 
>  ext4 doesn't build on m68k
>...

Is this due to the generic_find_next_le_bit compile error I reported as 
2.6.25 regression (there seems to be some discussion recently how to 
fix it - hopefully for 2.6.25) or is there even more breakage in -mm?

cu
Adrian

-- 

       "Is there not promise of rain?" Ling Tan asked suddenly out
        of the darkness. There had been need of rain for many days.
       "Only a promise," Lao Er said.
                                       Pearl S. Buck - Dragon Seed


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2
  2008-04-11 13:17           ` 2.6.25-rc8-mm2 KAMEZAWA Hiroyuki
@ 2008-04-11 14:24             ` Pekka Enberg
  2008-04-14  3:13               ` 2.6.25-rc8-mm2 KAMEZAWA Hiroyuki
  0 siblings, 1 reply; 71+ messages in thread
From: Pekka Enberg @ 2008-04-11 14:24 UTC (permalink / raw
  To: KAMEZAWA Hiroyuki; +Cc: Andrew Morton, linux-kernel, Christoph Lameter

On Fri, 2008-04-11 at 22:17 +0900, KAMEZAWA Hiroyuki wrote:
> On Fri, 11 Apr 2008 20:17:24 +0900
> KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com> wrote:
> > slab-2.6.git booted well :(
> > Hmm, It seems I have to dig somewhere different...
> > 
> Sorry, I tested *master* branch ;), under *testing* branch it reproduced.
> 
> bisected. (see below)
> 
> I'm sorry I can't use my box for next 2 days. I can test possible fix
> on Monday (in Japan). 
> 
> ==bisect result==
> 831d78b552aade2c383cf8d75b180dd35f81a4e3 is first bad commit
> commit 831d78b552aade2c383cf8d75b180dd35f81a4e3
> Author: Christoph Lameter <clameter@sgi.com>
> Date:   Tue Apr 8 22:26:30 2008 +0300
> 
>     SLUB: Add KICKABLE to avoid repeated kick() attempts
> 
>     Add a flag KICKABLE to be set on slabs with a defragmentation method
> 
>     Clear the flag if a kick action is not successful in reducing the
>     number of objects in a slab. This will avoid future attempts to
>     kick objects out.
> 
>     The KICKABLE flag is set again when all objects of the slab have been
>     allocated (Occurs during removal of a slab from the partial lists).
> 
>     Reviewed-by: Rik van Riel <riel@redhat.com>
>     Signed-off-by: Christoph Lameter <clameter@sgi.com>
>     Signed-off-by: Pekka Enberg <penberg@cs.helsinki.fi>

My bad, sorry. Fixed and pushed out.

			Pekka

>From 7c7e7e5e7ec07c0a47705b2d21c779c39ba02252 Mon Sep 17 00:00:00 2001
From: Pekka Enberg <penberg@cs.helsinki.fi>
Date: Fri, 11 Apr 2008 17:17:43 +0300
Subject: [PATCH] slub: add missing slab_unlock() to __kmem_cache_shrink()

If page is not kickable, remember to slab_unlock() before continuing the loop.

Signed-off-by: Pekka Enberg <penberg@cs.helsinki.fi>
---
 mm/slub.c |    4 +++-
 1 files changed, 3 insertions(+), 1 deletions(-)

diff --git a/mm/slub.c b/mm/slub.c
index 4b694a7..f09f1fb 100644
--- a/mm/slub.c
+++ b/mm/slub.c
@@ -2926,8 +2926,10 @@ static unsigned long __kmem_cache_shrink(struct kmem_cache *s, int node,
 			continue;
 
 		if (page->inuse) {
-			if (!SlabKickable(page))
+			if (!SlabKickable(page)) {
+				slab_unlock(page);
 				continue;
+			}
 
 			if (page->inuse * 100 >=
 					s->defrag_ratio * page->objects) {
-- 
1.5.2.5




^ permalink raw reply related	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2: boot hang after "ACPI: using IOAPIC for interrupt routing"
  2008-04-11 10:35   ` Pekka Enberg
@ 2008-04-11 21:07     ` Alexey Dobriyan
  2008-04-11 21:25       ` Pekka J Enberg
  0 siblings, 1 reply; 71+ messages in thread
From: Alexey Dobriyan @ 2008-04-11 21:07 UTC (permalink / raw
  To: Pekka Enberg
  Cc: Andrew Morton, linux-kernel, Christoph Lameter, KAMEZAWA Hiroyuki

On Fri, Apr 11, 2008 at 01:35:28PM +0300, Pekka Enberg wrote:
> On Fri, 2008-04-11 at 10:29 +0400, Alexey Dobriyan wrote:
> > On Thu, Apr 10, 2008 at 08:33:54PM -0700, Andrew Morton wrote:
> > > - git-slub has been temporarily replaced by git-pekka.  Pekka is standing in
> > >   while Christoph is away.
> > 
> > I bisected boot hang after "ACPI: using IOAPIC for interrupt routing"
> > down to git-pekka.
> > 
> > normal dmesg from 2.6.25-rc8-something and .config snippets for -mm
> 
> Alexey, can you try passing the 'slub_nomerge' option to the kernel to
> see if the hang goes away with that?

nomerge doesn't help as well as turning on combinations of SLUB debug options.


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2: boot hang after "ACPI: using IOAPIC for interrupt routing"
  2008-04-11 21:07     ` Alexey Dobriyan
@ 2008-04-11 21:25       ` Pekka J Enberg
  2008-04-11 23:09         ` Alexey Dobriyan
  0 siblings, 1 reply; 71+ messages in thread
From: Pekka J Enberg @ 2008-04-11 21:25 UTC (permalink / raw
  To: Alexey Dobriyan
  Cc: Andrew Morton, linux-kernel, Christoph Lameter, KAMEZAWA Hiroyuki

On Sat, 12 Apr 2008, Alexey Dobriyan wrote:
> > Alexey, can you try passing the 'slub_nomerge' option to the kernel to
> > see if the hang goes away with that?
> 
> nomerge doesn't help as well as turning on combinations of SLUB debug options.

Does the following patch fix it?

			Pekka

>From 7c7e7e5e7ec07c0a47705b2d21c779c39ba02252 Mon Sep 17 00:00:00 2001
From: Pekka Enberg <penberg@cs.helsinki.fi>
Date: Fri, 11 Apr 2008 17:17:43 +0300
Subject: [PATCH] slub: add missing slab_unlock() to __kmem_cache_shrink()

If page is not kickable, remember to slab_unlock() before continuing the loop.

Signed-off-by: Pekka Enberg <penberg@cs.helsinki.fi>
---
 mm/slub.c |    4 +++-
 1 files changed, 3 insertions(+), 1 deletions(-)

diff --git a/mm/slub.c b/mm/slub.c
index 4b694a7..f09f1fb 100644
--- a/mm/slub.c
+++ b/mm/slub.c
@@ -2926,8 +2926,10 @@ static unsigned long __kmem_cache_shrink(struct kmem_cache *s, int node,
 			continue;
 
 		if (page->inuse) {
-			if (!SlabKickable(page))
+			if (!SlabKickable(page)) {
+				slab_unlock(page);
 				continue;
+			}
 
 			if (page->inuse * 100 >=
 					s->defrag_ratio * page->objects) {
-- 
1.5.2.5




^ permalink raw reply related	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2: boot hang after "ACPI: using IOAPIC for interrupt routing"
  2008-04-11 21:25       ` Pekka J Enberg
@ 2008-04-11 23:09         ` Alexey Dobriyan
  0 siblings, 0 replies; 71+ messages in thread
From: Alexey Dobriyan @ 2008-04-11 23:09 UTC (permalink / raw
  To: Pekka J Enberg
  Cc: Andrew Morton, linux-kernel, Christoph Lameter, KAMEZAWA Hiroyuki

On Sat, Apr 12, 2008 at 12:25:21AM +0300, Pekka J Enberg wrote:
> On Sat, 12 Apr 2008, Alexey Dobriyan wrote:
> > > Alexey, can you try passing the 'slub_nomerge' option to the kernel to
> > > see if the hang goes away with that?
> > 
> > nomerge doesn't help as well as turning on combinations of SLUB debug options.
> 
> Does the following patch fix it?
> 
> 			Pekka
> 
> From 7c7e7e5e7ec07c0a47705b2d21c779c39ba02252 Mon Sep 17 00:00:00 2001
> From: Pekka Enberg <penberg@cs.helsinki.fi>
> Date: Fri, 11 Apr 2008 17:17:43 +0300
> Subject: [PATCH] slub: add missing slab_unlock() to __kmem_cache_shrink()
> 
> If page is not kickable, remember to slab_unlock() before continuing the loop.
> 
> Signed-off-by: Pekka Enberg <penberg@cs.helsinki.fi>
> ---
>  mm/slub.c |    4 +++-
>  1 files changed, 3 insertions(+), 1 deletions(-)
> 
> diff --git a/mm/slub.c b/mm/slub.c
> index 4b694a7..f09f1fb 100644
> --- a/mm/slub.c
> +++ b/mm/slub.c
> @@ -2926,8 +2926,10 @@ static unsigned long __kmem_cache_shrink(struct kmem_cache *s, int node,
>  			continue;
>  
>  		if (page->inuse) {
> -			if (!SlabKickable(page))
> +			if (!SlabKickable(page)) {
> +				slab_unlock(page);
>  				continue;
> +			}

Yes, it helps. Now I have some more bugs to report. :-(


^ permalink raw reply	[flat|nested] 71+ messages in thread

* 2.6.25-rc8-mm2: panic involving mount_block_root and down the road
  2008-04-11  3:33 2.6.25-rc8-mm2 Andrew Morton
                   ` (3 preceding siblings ...)
  2008-04-11 13:40 ` 2.6.25-rc8-mm2 Adrian Bunk
@ 2008-04-11 23:43 ` Alexey Dobriyan
  2008-04-12  0:59   ` 2.6.25-rc8-mm2: CONFIG_ATA_SFF: " Alexey Dobriyan
  2008-04-12  5:53 ` 2.6.25-rc8-mm2 Valdis.Kletnieks
                   ` (6 subsequent siblings)
  11 siblings, 1 reply; 71+ messages in thread
From: Alexey Dobriyan @ 2008-04-11 23:43 UTC (permalink / raw
  To: Andrew Morton; +Cc: linux-kernel

Pekka fixed SLUB for me, and now core2 box survives up and including to
not finding / :

Setup is SATA disk with plain old partitions, nothing lvmancy:

	/dev/sda2 on / type ext3 (rw,noatime)

CONFIG_ATA=y
CONFIG_ATA_ACPI=y
CONFIG_SATA_AHCI=y
CONFIG_ATA_PIIX=y
CONFIG_PATA_JMICRON=y

sda1 is for swap.


[    3.920000] NET: Registered protocol family 1
[    3.920000] VFS: Cannot open root device "sda2" or unknown-block(0,0)
[    3.920000] Please append a correct "root=" boot option; here are the available partitions:
[    3.920000] Kernel panic - not syncing: VFS: Unable to mount root fs on unknown-block(0,0)
[    3.920000] Pid: 1, comm: swapper Not tainted 2.6.25-rc8-mm2 #19
[    3.920000] 
[    3.920000] Call Trace:
[    3.920000]  [<ffffffff8022ff80>] panic+0xa0/0x180
[    3.920000]  [<ffffffff8044d8c9>] ? mutex_unlock+0x9/0x10
[    3.920000]  [<ffffffff805d8e03>] ? printk_all_partitions+0x23/0x180
[    3.920000]  [<ffffffff8044d8c9>] ? mutex_unlock+0x9/0x10
[    3.920000]  [<ffffffff805d8efc>] ? printk_all_partitions+0x11c/0x180
[    3.920000]  [<ffffffff8044ee46>] ? _read_unlock+0x26/0x30
[    3.920000]  [<ffffffff805c8e82>] mount_block_root+0x102/0x2a0
[    3.920000]  [<ffffffff805c9076>] mount_root+0x56/0x60
[    3.920000]  [<ffffffff805c90cc>] prepare_namespace+0x4c/0x160
[    3.920000]  [<ffffffff805c8bce>] kernel_init+0x23e/0x2f0
[    3.920000]  [<ffffffff80228b17>] ? finish_task_switch+0x67/0xe0
[    3.920000]  [<ffffffff8020c358>] child_rip+0xa/0x12
[    3.920000]  [<ffffffff803184d2>] ? acpi_os_acquire_lock+0x9/0xb
[    3.920000]  [<ffffffff805c8990>] ? kernel_init+0x0/0x2f0
[    3.920000]  [<ffffffff8020c34e>] ? child_rip+0x0/0x12


The very same sequence with lockdep on (I originally thought it's the
culprit):


[   12.770000] eth1: link up, 100Mbps, full-duplex, lpa 0x45E1
[   17.820000] atl1 0000:03:00.0: eth0 link is up 1000 Mbps full duplex
[    3.500000] ahci 0000:02:00.0: AHCI 0001.0000 32 slots 2 ports 3 Gbps 0x3 impl SATA mode
[    3.500000] ahci 0000:02:00.0: flags: 64bit ncq pm led clo pmp pio slum part 
[    3.500000] PCI: Setting latency timer of device 0000:02:00.0 to 64
[    3.500000] scsi0 : ahci
[    3.500000] scsi1 : ahci
[    3.500000] ata1: SATA max UDMA/133 abar m8192@0xfe8fe000 port 0xfe8fe100 irq 16
[    3.500000] ata2: SATA max UDMA/133 abar m8192@0xfe8fe000 port 0xfe8fe180 irq 16
[    3.500000] INFO: trying to register non-static key.
[    3.500000] the code is fine but needs lockdep annotation.
[    3.500000] turning off the locking correctness validator.
[    3.500000] Pid: 358, comm: scsi_eh_0 Not tainted 2.6.25-rc8-mm2 #17
[    3.500000] 
[    3.500000] Call Trace:
[    3.500000]  [<ffffffff80250631>] ? static_obj+0xd1/0xe0
[    3.500000]  [<ffffffff80252bba>] register_lock_class+0x44a/0x4f0
[    3.500000]  [<ffffffff80252cd0>] __lock_acquire+0x70/0xc50
[    3.500000]  [<ffffffff80253907>] lock_acquire+0x57/0x80
[    3.500000]  [<ffffffff80241ea5>] ? __cancel_work_timer+0x55/0x230
[    3.500000]  [<ffffffff80241ed5>] __cancel_work_timer+0x85/0x230
[    3.500000]  [<ffffffff8025262d>] ? trace_hardirqs_on+0xd/0x10
[    3.500000]  [<ffffffff80252599>] ? trace_hardirqs_on_caller+0xc9/0x150
[    3.500000]  [<ffffffff8025262d>] ? trace_hardirqs_on+0xd/0x10
[    3.500000]  [<ffffffff8045425b>] ? _spin_unlock_irq+0x2b/0x40
[    3.500000]  [<ffffffff80228e60>] ? finish_task_switch+0x0/0xe0
[    3.500000]  [<ffffffff80228ec7>] ? finish_task_switch+0x67/0xe0
[    3.500000]  [<ffffffff8024208d>] cancel_delayed_work_sync+0xd/0x10
[    3.500000]  [<ffffffff8037d0a8>] ata_port_flush_task+0x18/0x50
[    3.500000]  [<ffffffff80389ac4>] ata_scsi_error+0x24/0x680
[    3.500000]  [<ffffffff80371b70>] ? scsi_error_handler+0x0/0x330
[    3.500000]  [<ffffffff80371c4c>] scsi_error_handler+0xdc/0x330
[    3.500000]  [<ffffffff80451d4a>] ? thread_return+0x4e/0x4e4
[    3.500000]  [<ffffffff80252599>] ? trace_hardirqs_on_caller+0xc9/0x150
[    3.500000]  [<ffffffff80371b70>] ? scsi_error_handler+0x0/0x330
[    3.500000]  [<ffffffff802456fd>] kthread+0x4d/0x80
[    3.500000]  [<ffffffff8020c408>] child_rip+0xa/0x12
[    3.500000]  [<ffffffff8020bb1f>] ? restore_args+0x0/0x30
[    3.500000]  [<ffffffff802456b0>] ? kthread+0x0/0x80
[    3.500000]  [<ffffffff8020c3fe>] ? child_rip+0x0/0x12
[    3.500000] 
[    3.600000] ata1: SATA link down (SStatus 0 SControl 300)
[    3.690000] ata2: SATA link down (SStatus 0 SControl 300)
[    3.690000] PNP: PS/2 Controller [PNP0303:PS2K] at 0x60,0x64 irq 1
[    3.690000] PNP: PS/2 appears to have AUX port disabled, if this is incorrect please boot with i8042.nopnp
[    3.690000] serio: i8042 KBD port at 0x60,0x64 irq 1
[    3.690000] mice: PS/2 mouse device common for all mice
[    3.690000] Advanced Linux Sound Architecture Driver Version 1.0.16.
[    3.700000] ACPI: PCI Interrupt 0000:00:1b.0[A] -> GSI 22 (level, low) -> IRQ 22
[    3.700000] PCI: Setting latency timer of device 0000:00:1b.0 to 64
[    3.720000] input: AT Translated Set 2 keyboard as /class/input/input0
[    4.000000] ALSA device list:
[    4.000000]   #0: HDA Intel at 0xfebf8000 irq 22
[    4.000000] TCP cubic registered
[    4.000000] NET: Registered protocol family 1
[    4.000000] VFS: Cannot open root device "sda2" or unknown-block(0,0)
[    4.000000] Please append a correct "root=" boot option; here are the available partitions:
[    4.000000] Kernel panic - not syncing: VFS: Unable to mount root fs on unknown-block(0,0)
[    4.000000] Pid: 1, comm: swapper Not tainted 2.6.25-rc8-mm2 #17
[    4.000000] 
[    4.000000] Call Trace:
[    4.000000]  [<ffffffff80230720>] panic+0xa0/0x180
[    4.000000]  [<ffffffff80452b79>] ? mutex_unlock+0x9/0x10
[    4.000000]  [<ffffffff80452b79>] ? mutex_unlock+0x9/0x10
[    4.000000]  [<ffffffff805df01c>] ? printk_all_partitions+0x11c/0x180
[    4.000000]  [<ffffffff804541e6>] ? _read_unlock+0x26/0x30
[    4.000000]  [<ffffffff805ceea2>] mount_block_root+0x102/0x2a0
[    4.000000]  [<ffffffff805cf096>] mount_root+0x56/0x60
[    4.000000]  [<ffffffff805cf0ec>] prepare_namespace+0x4c/0x160
[    4.000000]  [<ffffffff805cebeb>] kernel_init+0x23b/0x2f0
[    4.000000]  [<ffffffff80453f37>] ? trace_hardirqs_on_thunk+0x3a/0x3f
[    4.000000]  [<ffffffff80252599>] ? trace_hardirqs_on_caller+0xc9/0x150
[    4.000000]  [<ffffffff8020c408>] child_rip+0xa/0x12
[    4.000000]  [<ffffffff8020bb1f>] ? restore_args+0x0/0x30
[    4.000000]  [<ffffffff805ce9b0>] ? kernel_init+0x0/0x2f0
[    4.000000]  [<ffffffff8020c3fe>] ? child_rip+0x0/0x12


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2: CONFIG_ATA_SFF: panic involving mount_block_root and down the road
  2008-04-11 23:43 ` 2.6.25-rc8-mm2: panic involving mount_block_root and down the road Alexey Dobriyan
@ 2008-04-12  0:59   ` Alexey Dobriyan
  2008-04-12  6:52     ` Adrian Bunk
  0 siblings, 1 reply; 71+ messages in thread
From: Alexey Dobriyan @ 2008-04-12  0:59 UTC (permalink / raw
  To: Andrew Morton; +Cc: linux-kernel, linux-ide

On Sat, Apr 12, 2008 at 03:43:45AM +0400, Alexey Dobriyan wrote:
> Pekka fixed SLUB for me, and now core2 box survives up and including to
> not finding / :
> 
> Setup is SATA disk with plain old partitions, nothing lvmancy:
> 
> 	/dev/sda2 on / type ext3 (rw,noatime)
> 
> CONFIG_ATA=y
> CONFIG_ATA_ACPI=y
> CONFIG_SATA_AHCI=y
> CONFIG_ATA_PIIX=y
> CONFIG_PATA_JMICRON=y
> 
> sda1 is for swap.
> 
> 
> [    3.920000] NET: Registered protocol family 1
> [    3.920000] VFS: Cannot open root device "sda2" or unknown-block(0,0)

The winner is partly me, partly git-libata-all.

The latter introduced CONFIG_ATA_SFF option and put more or less every
SATA and PATA driver under it. The former honestly answered N to when
ATA_SFF popped up and failed to check existence of ATA_PIIX and
PATA_JMICRON in failing .config .

Now raise hands those who knew that your ATA controller is SFF
compliant.


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2
  2008-04-11 13:40 ` 2.6.25-rc8-mm2 Adrian Bunk
@ 2008-04-12  4:22   ` Andrew Morton
  2008-04-12 18:43     ` 2.6.25-rc8-mm2 Aneesh Kumar K.V
  0 siblings, 1 reply; 71+ messages in thread
From: Andrew Morton @ 2008-04-12  4:22 UTC (permalink / raw
  To: Adrian Bunk; +Cc: linux-kernel, linux-ext4

On Fri, 11 Apr 2008 16:40:36 +0300 Adrian Bunk <bunk@kernel.org> wrote:

> On Thu, Apr 10, 2008 at 08:33:54PM -0700, Andrew Morton wrote:
> >...
> > Changes since 2.6.25-rc8-mm1:
> >...
> > +ext4-is-busted-on-m68k.patch
> > 
> >  ext4 doesn't build on m68k
> >...
> 
> Is this due to the generic_find_next_le_bit compile error I reported as 
> 2.6.25 regression (there seems to be some discussion recently how to 
> fix it - hopefully for 2.6.25) or is there even more breakage in -mm?

Yes, it's the bitops screwup.

I saw some related linux-ext4 email float past yesterday, so something
might be happening.


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2
  2008-04-11  3:33 2.6.25-rc8-mm2 Andrew Morton
                   ` (4 preceding siblings ...)
  2008-04-11 23:43 ` 2.6.25-rc8-mm2: panic involving mount_block_root and down the road Alexey Dobriyan
@ 2008-04-12  5:53 ` Valdis.Kletnieks
  2008-04-12  6:07   ` 2.6.25-rc8-mm2 Andrew Morton
  2008-04-12  8:52 ` 2.6.25-rc8-mm2 - CONFIG_PROFILE_LIKELY broke again Valdis.Kletnieks
                   ` (5 subsequent siblings)
  11 siblings, 1 reply; 71+ messages in thread
From: Valdis.Kletnieks @ 2008-04-12  5:53 UTC (permalink / raw
  To: Andrew Morton; +Cc: linux-kernel

[-- Attachment #1: Type: text/plain, Size: 2844 bytes --]

On Thu, 10 Apr 2008 20:33:54 PDT, Andrew Morton said:
> 
> ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.25-rc8/2.6.25-rc8-mm2/

Was in the middle of a bisection chasing a different problem, and hit this
pair of warnings.  'quilt top' at the time was remove-div_long_long_rem.patch
(in other words, very nearly all of rc8-mm2 was applied). Feel free to delete
if it's known-fixed in one of the last 100 or so patches. ;)

[    0.060389] ACPI: Core revision 20070126
[    0.069072] ------------[ cut here ]------------
[    0.069077] WARNING: at arch/x86/kernel/genapic_64.c:86 read_apic_id+0x41/0x7c()
[    0.069084] Modules linked in:
[    0.069090] Pid: 1, comm: swapper Not tainted 2.6.25-rc8 #5
[    0.069094]
[    0.069095] Call Trace:
[    0.069104]  [<ffffffff8023d4fe>] warn_on_slowpath+0x67/0xb7
[    0.069113]  [<ffffffff8021f12d>] ? __cpus_weight+0x4b/0x68
[    0.069118]  [<ffffffff80225591>] read_apic_id+0x41/0x7c
[    0.069126]  [<ffffffff80729d84>] verify_local_APIC+0xb4/0x177
[    0.069132]  [<ffffffff80728a81>] native_smp_prepare_cpus+0x247/0x383
[    0.069140]  [<ffffffff8071d712>] kernel_init+0x69/0x29c
[    0.069148]  [<ffffffff8057908a>] ? _spin_unlock_irq+0x43/0x62
[    0.069156]  [<ffffffff80237516>] ? finish_task_switch+0x3e/0xb4
[    0.069163]  [<ffffffff8020d5f8>] child_rip+0xa/0x12
[    0.069994]  [<ffffffff803b8237>] ? acpi_ds_init_one_object+0x0/0xb1
[    0.070000]  [<ffffffff8071d6a9>] ? kernel_init+0x0/0x29c
[    0.070006]  [<ffffffff8020d5ee>] ? child_rip+0x0/0x12
[    0.070010]
[    0.070019] ---[ end trace a7919e7f17c0a725 ]---
[    0.070024] ------------[ cut here ]------------
[    0.070028] WARNING: at arch/x86/kernel/genapic_64.c:86 read_apic_id+0x41/0x7c()
[    0.070034] Modules linked in:
[    0.070039] Pid: 1, comm: swapper Tainted: G        W 2.6.25-rc8 #5
[    0.070043]
[    0.070044] Call Trace:
[    0.070049]  [<ffffffff8023d4fe>] warn_on_slowpath+0x67/0xb7
[    0.070056]  [<ffffffff8021f12d>] ? __cpus_weight+0x4b/0x68
[    0.070061]  [<ffffffff80225591>] read_apic_id+0x41/0x7c
[    0.070067]  [<ffffffff80729db6>] verify_local_APIC+0xe6/0x177
[    0.070072]  [<ffffffff80728a81>] native_smp_prepare_cpus+0x247/0x383
[    0.070078]  [<ffffffff8071d712>] kernel_init+0x69/0x29c
[    0.070084]  [<ffffffff8057908a>] ? _spin_unlock_irq+0x43/0x62
[    0.070090]  [<ffffffff80237516>] ? finish_task_switch+0x3e/0xb4
[    0.070096]  [<ffffffff8020d5f8>] child_rip+0xa/0x12
[    0.070102]  [<ffffffff803b8237>] ? acpi_ds_init_one_object+0x0/0xb1
[    0.070108]  [<ffffffff8071d6a9>] ? kernel_init+0x0/0x29c
[    0.070113]  [<ffffffff8020d5ee>] ? child_rip+0x0/0x12
[    0.070117]
[    0.070120] ---[ end trace a7919e7f17c0a725 ]---
[    0.080292] CPU0: Intel(R) Core(TM)2 CPU         T7200  @ 2.00GHz stepping 06
[    0.080300] Using local APIC timer interrupts.


[-- Attachment #2: Type: application/pgp-signature, Size: 226 bytes --]

^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2
  2008-04-12  5:53 ` 2.6.25-rc8-mm2 Valdis.Kletnieks
@ 2008-04-12  6:07   ` Andrew Morton
  0 siblings, 0 replies; 71+ messages in thread
From: Andrew Morton @ 2008-04-12  6:07 UTC (permalink / raw
  To: Valdis.Kletnieks; +Cc: linux-kernel, Ingo Molnar, Thomas Gleixner

On Sat, 12 Apr 2008 01:53:59 -0400 Valdis.Kletnieks@vt.edu wrote:

> On Thu, 10 Apr 2008 20:33:54 PDT, Andrew Morton said:
> > 
> > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.25-rc8/2.6.25-rc8-mm2/
> 
> Was in the middle of a bisection chasing a different problem, and hit this
> pair of warnings.  'quilt top' at the time was remove-div_long_long_rem.patch

yup, remove-div_long_long_rem.patch is a "good" bisection point (one with
no known fixes against it).

> (in other words, very nearly all of rc8-mm2 was applied). Feel free to delete
> if it's known-fixed in one of the last 100 or so patches. ;)
> 
> [    0.060389] ACPI: Core revision 20070126
> [    0.069072] ------------[ cut here ]------------
> [    0.069077] WARNING: at arch/x86/kernel/genapic_64.c:86 read_apic_id+0x41/0x7c()
> [    0.069084] Modules linked in:
> [    0.069090] Pid: 1, comm: swapper Not tainted 2.6.25-rc8 #5
> [    0.069094]
> [    0.069095] Call Trace:
> [    0.069104]  [<ffffffff8023d4fe>] warn_on_slowpath+0x67/0xb7
> [    0.069113]  [<ffffffff8021f12d>] ? __cpus_weight+0x4b/0x68
> [    0.069118]  [<ffffffff80225591>] read_apic_id+0x41/0x7c
> [    0.069126]  [<ffffffff80729d84>] verify_local_APIC+0xb4/0x177
> [    0.069132]  [<ffffffff80728a81>] native_smp_prepare_cpus+0x247/0x383
> [    0.069140]  [<ffffffff8071d712>] kernel_init+0x69/0x29c
> [    0.069148]  [<ffffffff8057908a>] ? _spin_unlock_irq+0x43/0x62
> [    0.069156]  [<ffffffff80237516>] ? finish_task_switch+0x3e/0xb4
> [    0.069163]  [<ffffffff8020d5f8>] child_rip+0xa/0x12
> [    0.069994]  [<ffffffff803b8237>] ? acpi_ds_init_one_object+0x0/0xb1
> [    0.070000]  [<ffffffff8071d6a9>] ? kernel_init+0x0/0x29c
> [    0.070006]  [<ffffffff8020d5ee>] ? child_rip+0x0/0x12
> [    0.070010]
> [    0.070019] ---[ end trace a7919e7f17c0a725 ]---
> [    0.070024] ------------[ cut here ]------------
> [    0.070028] WARNING: at arch/x86/kernel/genapic_64.c:86 read_apic_id+0x41/0x7c()
> [    0.070034] Modules linked in:
> [    0.070039] Pid: 1, comm: swapper Tainted: G        W 2.6.25-rc8 #5
> [    0.070043]
> [    0.070044] Call Trace:
> [    0.070049]  [<ffffffff8023d4fe>] warn_on_slowpath+0x67/0xb7
> [    0.070056]  [<ffffffff8021f12d>] ? __cpus_weight+0x4b/0x68
> [    0.070061]  [<ffffffff80225591>] read_apic_id+0x41/0x7c
> [    0.070067]  [<ffffffff80729db6>] verify_local_APIC+0xe6/0x177
> [    0.070072]  [<ffffffff80728a81>] native_smp_prepare_cpus+0x247/0x383
> [    0.070078]  [<ffffffff8071d712>] kernel_init+0x69/0x29c
> [    0.070084]  [<ffffffff8057908a>] ? _spin_unlock_irq+0x43/0x62
> [    0.070090]  [<ffffffff80237516>] ? finish_task_switch+0x3e/0xb4
> [    0.070096]  [<ffffffff8020d5f8>] child_rip+0xa/0x12
> [    0.070102]  [<ffffffff803b8237>] ? acpi_ds_init_one_object+0x0/0xb1
> [    0.070108]  [<ffffffff8071d6a9>] ? kernel_init+0x0/0x29c
> [    0.070113]  [<ffffffff8020d5ee>] ? child_rip+0x0/0x12
> [    0.070117]
> [    0.070120] ---[ end trace a7919e7f17c0a725 ]---
> [    0.080292] CPU0: Intel(R) Core(TM)2 CPU         T7200  @ 2.00GHz stepping 06
> [    0.080300] Using local APIC timer interrupts.

So that may well be a destined-for-2.6.25 problem.

^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2: CONFIG_ATA_SFF: panic involving mount_block_root and down the road
  2008-04-12  0:59   ` 2.6.25-rc8-mm2: CONFIG_ATA_SFF: " Alexey Dobriyan
@ 2008-04-12  6:52     ` Adrian Bunk
  2008-04-12 10:02       ` Jeff Garzik
  0 siblings, 1 reply; 71+ messages in thread
From: Adrian Bunk @ 2008-04-12  6:52 UTC (permalink / raw
  To: Alexey Dobriyan, Tejun Heo, Jeff Garzik
  Cc: Andrew Morton, linux-kernel, linux-ide

On Sat, Apr 12, 2008 at 04:59:17AM +0400, Alexey Dobriyan wrote:
> On Sat, Apr 12, 2008 at 03:43:45AM +0400, Alexey Dobriyan wrote:
> > Pekka fixed SLUB for me, and now core2 box survives up and including to
> > not finding / :
> > 
> > Setup is SATA disk with plain old partitions, nothing lvmancy:
> > 
> > 	/dev/sda2 on / type ext3 (rw,noatime)
> > 
> > CONFIG_ATA=y
> > CONFIG_ATA_ACPI=y
> > CONFIG_SATA_AHCI=y
> > CONFIG_ATA_PIIX=y
> > CONFIG_PATA_JMICRON=y
> > 
> > sda1 is for swap.
> > 
> > 
> > [    3.920000] NET: Registered protocol family 1
> > [    3.920000] VFS: Cannot open root device "sda2" or unknown-block(0,0)
> 
> The winner is partly me, partly git-libata-all.
> 
> The latter introduced CONFIG_ATA_SFF option and put more or less every
> SATA and PATA driver under it. The former honestly answered N to when
> ATA_SFF popped up and failed to check existence of ATA_PIIX and
> PATA_JMICRON in failing .config .
> 
> Now raise hands those who knew that your ATA controller is SFF
> compliant.

Is there any technical reason why we have to bother users with the 
ATA_SFF option at all?

It sounds like a perfect canndidate for being select'ed.

cu
Adrian

-- 

       "Is there not promise of rain?" Ling Tan asked suddenly out
        of the darkness. There had been need of rain for many days.
       "Only a promise," Lao Er said.
                                       Pearl S. Buck - Dragon Seed


^ permalink raw reply	[flat|nested] 71+ messages in thread

* 2.6.25-rc8-mm2 - CONFIG_PROFILE_LIKELY broke again...
  2008-04-11  3:33 2.6.25-rc8-mm2 Andrew Morton
                   ` (5 preceding siblings ...)
  2008-04-12  5:53 ` 2.6.25-rc8-mm2 Valdis.Kletnieks
@ 2008-04-12  8:52 ` Valdis.Kletnieks
  2008-04-12  9:41 ` 2.6.25-rc8-mm2 Valdis.Kletnieks
                   ` (4 subsequent siblings)
  11 siblings, 0 replies; 71+ messages in thread
From: Valdis.Kletnieks @ 2008-04-12  8:52 UTC (permalink / raw
  To: Andrew Morton; +Cc: linux-kernel

[-- Attachment #1: Type: text/plain, Size: 964 bytes --]

On Thu, 10 Apr 2008 20:33:54 PDT, Andrew Morton said:
> ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.25-rc8/2.6.25-rc8-mm2/

OK, I found the issue I was chasing when I noticed the WARN from genapic_64.c.

After seeing that I'd gotten all the way to remove-div_long_long_rem.patch I
took a close look at the remaining several dozen patches, and took a wild
stab in the dark.. .and ta-da...

PROFILE_LIKELY broke between -rc8-mm1 and -rc8-mm2.  Setting it to 'y'
gets me an instant reboot similar to what I was seeing against -rc2-mm.

And changed between -mm1 and -mm2 (datestamp of 04/08):

patches/profile-likely-unlikely-macros.patch
patches/profile-likely-unlikely-macros-fix.patch

I found the profile-likely-unlikely-macros-fix-2.patch from last time around,
but the code has changed a bunch since then so it isn't a clean apply (and in
fact, it's so different it's not even easily hand-patchable - stuff like
!!(foo) has appeared).  




[-- Attachment #2: Type: application/pgp-signature, Size: 226 bytes --]

^ permalink raw reply	[flat|nested] 71+ messages in thread

* 2.6.25-rc8-mm2
  2008-04-11  3:33 2.6.25-rc8-mm2 Andrew Morton
                   ` (6 preceding siblings ...)
  2008-04-12  8:52 ` 2.6.25-rc8-mm2 - CONFIG_PROFILE_LIKELY broke again Valdis.Kletnieks
@ 2008-04-12  9:41 ` Valdis.Kletnieks
  2008-04-12 10:47 ` 2.6.25-rc8-mm2 - ftraced chews 100% of a CPU Valdis.Kletnieks
                   ` (3 subsequent siblings)
  11 siblings, 0 replies; 71+ messages in thread
From: Valdis.Kletnieks @ 2008-04-12  9:41 UTC (permalink / raw
  To: Andrew Morton, Ingo Molnar, Steven Rostedt; +Cc: linux-kernel

[-- Attachment #1: Type: text/plain, Size: 976 bytes --]

On Thu, 10 Apr 2008 20:33:54 PDT, Andrew Morton said:
> ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.25-rc8/2.6.25-rc8-mm2/

I'm pretty sure this one is for Ingo and Steven to sort out, their
names are all over git-sched.patch for this code.... :)

The following config will actually build on x86_64:

CONFIG_HAVE_FTRACE=y
# CONFIG_FTRACE is not set
CONFIG_FTRACE_STARTUP_TEST=y

However, at boot time, it dies a quick and horrid ker-splat and hangs without
doing anything visible at all, without even the decency of rebooting. I'm going
to guess that the startup test fandangos on memory that wasn't set up by the
non-present CONFIG_FTRACE.

I got into this state by saying 'y' to 'startup test' in make oldconfig,
then deciding I didn't want ftrace so I turned that *one* entry off in
make menuconfig - which left the startup test dangling.  Easy local workaround
was to just turn the test off too, so you guys can hash this one out at
your leisure...


[-- Attachment #2: Type: application/pgp-signature, Size: 226 bytes --]

^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2: CONFIG_ATA_SFF: panic involving mount_block_root and down the road
  2008-04-12  6:52     ` Adrian Bunk
@ 2008-04-12 10:02       ` Jeff Garzik
  2008-04-13  0:45         ` Tejun Heo
  2008-04-13  0:51         ` [mm patch] select ATA_SFF Adrian Bunk
  0 siblings, 2 replies; 71+ messages in thread
From: Jeff Garzik @ 2008-04-12 10:02 UTC (permalink / raw
  To: Adrian Bunk
  Cc: Alexey Dobriyan, Tejun Heo, Jeff Garzik, Andrew Morton,
	linux-kernel, linux-ide

Adrian Bunk wrote:
> On Sat, Apr 12, 2008 at 04:59:17AM +0400, Alexey Dobriyan wrote:
>> On Sat, Apr 12, 2008 at 03:43:45AM +0400, Alexey Dobriyan wrote:
>>> Pekka fixed SLUB for me, and now core2 box survives up and including to
>>> not finding / :
>>>
>>> Setup is SATA disk with plain old partitions, nothing lvmancy:
>>>
>>> 	/dev/sda2 on / type ext3 (rw,noatime)
>>>
>>> CONFIG_ATA=y
>>> CONFIG_ATA_ACPI=y
>>> CONFIG_SATA_AHCI=y
>>> CONFIG_ATA_PIIX=y
>>> CONFIG_PATA_JMICRON=y
>>>
>>> sda1 is for swap.
>>>
>>>
>>> [    3.920000] NET: Registered protocol family 1
>>> [    3.920000] VFS: Cannot open root device "sda2" or unknown-block(0,0)
>> The winner is partly me, partly git-libata-all.
>>
>> The latter introduced CONFIG_ATA_SFF option and put more or less every
>> SATA and PATA driver under it. The former honestly answered N to when
>> ATA_SFF popped up and failed to check existence of ATA_PIIX and
>> PATA_JMICRON in failing .config .
>>
>> Now raise hands those who knew that your ATA controller is SFF
>> compliant.
> 
> Is there any technical reason why we have to bother users with the 
> ATA_SFF option at all?
> 
> It sounds like a perfect canndidate for being select'ed.

'default y' is appropriate, but option that is used to disable a major 
swath of legacy code unneeded on modern FIS-based SATA platforms like AHCI.

	Jeff




^ permalink raw reply	[flat|nested] 71+ messages in thread

* 2.6.25-rc8-mm2 - ftraced chews 100% of a CPU
  2008-04-11  3:33 2.6.25-rc8-mm2 Andrew Morton
                   ` (7 preceding siblings ...)
  2008-04-12  9:41 ` 2.6.25-rc8-mm2 Valdis.Kletnieks
@ 2008-04-12 10:47 ` Valdis.Kletnieks
  2008-04-13 20:44 ` 2.6.25-rc8-mm2: IP: [<ffffffff802868f9>] __kmalloc+0x69/0x110 Alexey Dobriyan
                   ` (2 subsequent siblings)
  11 siblings, 0 replies; 71+ messages in thread
From: Valdis.Kletnieks @ 2008-04-12 10:47 UTC (permalink / raw
  To: Andrew Morton, Ingo Molnar; +Cc: linux-kernel

[-- Attachment #1: Type: text/plain, Size: 3174 bytes --]

On Thu, 10 Apr 2008 20:33:54 PDT, Andrew Morton said:
> ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.25-rc8/2.6.25-rc8-mm2/

(Man, everything I look at tonight falls over.. I'm jinxed :)

So I built a kernel with:

% zgrep FTRACE /proc/config.gz 
CONFIG_HAVE_FTRACE=y
CONFIG_FTRACE=y
CONFIG_DYNAMIC_FTRACE=y
CONFIG_FTRACE_SELFTEST=y
CONFIG_FTRACE_STARTUP_TEST=y

It ran pretty much OK for about 12 minutes, and then gkrellm reported tons of
system time, and 'top' confirms it:

top - 06:27:28 up 25 min,  3 users,  load average: 1.21, 1.20, 1.02
Tasks: 132 total,   3 running, 128 sleeping,   0 stopped,   1 zombie
Cpu(s):  0.7%us, 50.7%sy,  0.0%ni, 48.7%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Mem:   2054764k total,   773648k used,  1281116k free,    32008k buffers
Swap:  2031608k total,        0k used,  2031608k free,   425696k cached

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND                                                      
   17 root      15  -5     0    0    0 R 99.8  0.0  13:04.53 ftraced                                                       
 2717 root      20   0  169m  60m  11m S  2.0  3.0   1:14.71 X           

And the CPUs are sitting at around 58C, which is their usual temp when
running flat-out, so there's actual looping happening.

I see this in the dmesg from right around when it went belly-up:

[  725.345544] hm, dftrace overflow: 127 changes (0 total) in 273 usecs
[  725.345562] ------------[ cut here ]------------
[  725.345568] WARNING: at kernel/trace/ftrace.c:658 ftraced+0x138/0x1f0()

(no I have no idea what the system was doing at that instant)

and 'echo t > /proc/sysrq-trigger' tells us:

[ 2368.256762] ftraced       R  running task     5952    17      2
[ 2368.256769]  ffff81000101b940 000000010101f800 ffff81007f957d10 ffffffff802302c9
[ 2368.256777]  ffff81007f954480 0000000000000000 00000000ab636be3 ffffffff805e35ec
[ 2368.256784]  ffff81007f957d50 ffffffff80213b72 0000000000000246 ffffffff8022ea19
[ 2368.256791] Call Trace:
[ 2368.256796]  [<ffffffff805e0632>] ? schedule+0x3e/0x6a9
[ 2368.256801]  [<ffffffff80230be3>] ? pick_next_task_fair+0xa0/0xc2
[ 2368.256806]  [<ffffffff8028d690>] ? trace_preempt_on+0x1c/0x32
[ 2368.256811]  [<ffffffff805e6469>] ? sub_preempt_count+0x49/0x73
[ 2368.256817]  [<ffffffff805e35ec>] ? _spin_lock_irqsave+0x35/0x69
[ 2368.256826]  [<ffffffff8024afed>] ? __mod_timer+0xce/0xf4
[ 2368.256830]  [<ffffffff8024ad0a>] ? del_timer_sync+0x28/0x4d
[ 2368.256836]  [<ffffffff805e1152>] ? schedule_timeout+0xac/0xdc
[ 2368.256840]  [<ffffffff8024a74b>] ? process_timeout+0x0/0x37
[ 2368.256844]  [<ffffffff8024af4f>] ? __mod_timer+0x30/0xf4
[ 2368.256851]  [<ffffffff80286ef0>] ? ftraced+0x52/0x1f0
[ 2368.256855]  [<ffffffff80257bbf>] ? kthread+0x0/0xa4
[ 2368.256859]  [<ffffffff80286e9e>] ? ftraced+0x0/0x1f0
[ 2368.256863]  [<ffffffff80257c20>] ? kthread+0x61/0xa4
[ 2368.256868]  [<ffffffff8020d7f8>] ? child_rip+0xa/0x12
[ 2368.256873]  [<ffffffff8020cee0>] ? restore_args+0x0/0x30
[ 2368.256878]  [<ffffffff80257bbf>] ? kthread+0x0/0xa4
[ 2368.256882]  [<ffffffff8020d7ee>] ? child_rip+0x0/0x12

Hopefully this tells you something?

[-- Attachment #2: Type: application/pgp-signature, Size: 226 bytes --]

^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2
  2008-04-12  4:22   ` 2.6.25-rc8-mm2 Andrew Morton
@ 2008-04-12 18:43     ` Aneesh Kumar K.V
  2008-04-13  7:45       ` 2.6.25-rc8-mm2 Michael Schmitz
  0 siblings, 1 reply; 71+ messages in thread
From: Aneesh Kumar K.V @ 2008-04-12 18:43 UTC (permalink / raw
  To: Andrew Morton; +Cc: Adrian Bunk, linux-kernel, linux-ext4, linux-m68k

On Fri, Apr 11, 2008 at 09:22:22PM -0700, Andrew Morton wrote:
> On Fri, 11 Apr 2008 16:40:36 +0300 Adrian Bunk <bunk@kernel.org> wrote:
> 
> > On Thu, Apr 10, 2008 at 08:33:54PM -0700, Andrew Morton wrote:
> > >...
> > > Changes since 2.6.25-rc8-mm1:
> > >...
> > > +ext4-is-busted-on-m68k.patch
> > > 
> > >  ext4 doesn't build on m68k
> > >...
> > 
> > Is this due to the generic_find_next_le_bit compile error I reported as 
> > 2.6.25 regression (there seems to be some discussion recently how to 
> > fix it - hopefully for 2.6.25) or is there even more breakage in -mm?
> 
> Yes, it's the bitops screwup.
> 
> I saw some related linux-ext4 email float past yesterday, so something
> might be happening.
> 

It would really help if we can get some m68k people to look at patch
i posted.

http://article.gmane.org/gmane.comp.file-systems.ext4/5944


-aneesh

^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2: CONFIG_ATA_SFF: panic involving mount_block_root and down the road
  2008-04-12 10:02       ` Jeff Garzik
@ 2008-04-13  0:45         ` Tejun Heo
  2008-04-13  0:51         ` [mm patch] select ATA_SFF Adrian Bunk
  1 sibling, 0 replies; 71+ messages in thread
From: Tejun Heo @ 2008-04-13  0:45 UTC (permalink / raw
  To: Jeff Garzik
  Cc: Adrian Bunk, Alexey Dobriyan, Jeff Garzik, Andrew Morton,
	linux-kernel, linux-ide

Jeff Garzik wrote:
>>> Now raise hands those who knew that your ATA controller is SFF
>>> compliant.

/me raises a hand.

>> Is there any technical reason why we have to bother users with the 
>> ATA_SFF option at all?
>>
>> It sounds like a perfect canndidate for being select'ed.
> 
> 'default y' is appropriate, but option that is used to disable a major 
> swath of legacy code unneeded on modern FIS-based SATA platforms like AHCI.

Heh.. yeah, but I have to admit SFF support is cryptic.  We can 
definitely use some friendly explanation there.

-- 
tejun

^ permalink raw reply	[flat|nested] 71+ messages in thread

* [mm patch] select ATA_SFF
  2008-04-12 10:02       ` Jeff Garzik
  2008-04-13  0:45         ` Tejun Heo
@ 2008-04-13  0:51         ` Adrian Bunk
  1 sibling, 0 replies; 71+ messages in thread
From: Adrian Bunk @ 2008-04-13  0:51 UTC (permalink / raw
  To: Jeff Garzik
  Cc: Alexey Dobriyan, Tejun Heo, Jeff Garzik, Andrew Morton,
	linux-kernel, linux-ide

On Sat, Apr 12, 2008 at 06:02:21AM -0400, Jeff Garzik wrote:
> Adrian Bunk wrote:
>> On Sat, Apr 12, 2008 at 04:59:17AM +0400, Alexey Dobriyan wrote:
>>> On Sat, Apr 12, 2008 at 03:43:45AM +0400, Alexey Dobriyan wrote:
>>>> Pekka fixed SLUB for me, and now core2 box survives up and including to
>>>> not finding / :
>>>>
>>>> Setup is SATA disk with plain old partitions, nothing lvmancy:
>>>>
>>>> 	/dev/sda2 on / type ext3 (rw,noatime)
>>>>
>>>> CONFIG_ATA=y
>>>> CONFIG_ATA_ACPI=y
>>>> CONFIG_SATA_AHCI=y
>>>> CONFIG_ATA_PIIX=y
>>>> CONFIG_PATA_JMICRON=y
>>>>
>>>> sda1 is for swap.
>>>>
>>>>
>>>> [    3.920000] NET: Registered protocol family 1
>>>> [    3.920000] VFS: Cannot open root device "sda2" or unknown-block(0,0)
>>> The winner is partly me, partly git-libata-all.
>>>
>>> The latter introduced CONFIG_ATA_SFF option and put more or less every
>>> SATA and PATA driver under it. The former honestly answered N to when
>>> ATA_SFF popped up and failed to check existence of ATA_PIIX and
>>> PATA_JMICRON in failing .config .
>>>
>>> Now raise hands those who knew that your ATA controller is SFF
>>> compliant.
>>
>> Is there any technical reason why we have to bother users with the  
>> ATA_SFF option at all?
>>
>> It sounds like a perfect canndidate for being select'ed.
>
> 'default y' is appropriate, but option that is used to disable a major  
> swath of legacy code unneeded on modern FIS-based SATA platforms like 
> AHCI.

I think you didn't understand my suggestion.

I didn't want to get the code enabled unconditionally, I want kconfig 
users to not needlessly being bothered with an option where we could 
determine automatically the correct setting.

The patch below implements what I have in mind.

> 	Jeff

cu
Adrian


<--  snip  -->


Making ATA_SFF a user-visible option with the drivers needing it 
depending on it caused the following problems:
- people lose their driver when accidentally disabling it
- people not requiring it needlessly enable it

Fortunately, we don't have to bother the user with this option at all 
since we can simply select it when it's required.

Signed-off-by: Adrian Bunk <bunk@kernel.org>

---

 drivers/ata/Kconfig |   79 +++++++++++++++++++++++++++++++++++++++-----
 1 file changed, 71 insertions(+), 8 deletions(-)

--- linux-2.6.25-rc8-mm2/drivers/ata/Kconfig.old	2008-04-13 01:59:27.000000000 +0300
+++ linux-2.6.25-rc8-mm2/drivers/ata/Kconfig	2008-04-13 02:05:42.000000000 +0300
@@ -73,17 +73,12 @@
 	  If unsure, say N.
 
 config ATA_SFF
-	bool "ATA SFF support"
-	default y
-	help
-	  This option adds support for ATA controllers with SFF
-	  compliant or similar programming interface.
-
-if ATA_SFF
+	bool
 
 config SATA_SVW
 	tristate "ServerWorks Frodo / Apple K2 SATA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for Broadcom/Serverworks/Apple K2
 	  SATA support.
@@ -93,6 +88,7 @@
 config ATA_PIIX
 	tristate "Intel ESB, ICH, PIIX3, PIIX4 PATA/SATA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for ICH5/6/7/8 Serial ATA
 	  and support for PATA on the Intel ESB/ICH/PIIX3/PIIX4 series
@@ -103,6 +99,7 @@
 config SATA_MV
 	tristate "Marvell SATA support (HIGHLY EXPERIMENTAL)"
 	depends on EXPERIMENTAL
+	select ATA_SFF
 	help
 	  This option enables support for the Marvell Serial ATA family.
 	  Currently supports 88SX[56]0[48][01] chips.
@@ -112,6 +109,7 @@
 config SATA_NV
 	tristate "NVIDIA SATA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for NVIDIA Serial ATA.
 
@@ -120,6 +118,7 @@
 config PDC_ADMA
 	tristate "Pacific Digital ADMA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for Pacific Digital ADMA controllers
 
@@ -128,6 +127,7 @@
 config SATA_QSTOR
 	tristate "Pacific Digital SATA QStor support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for Pacific Digital Serial ATA QStor.
 
@@ -136,6 +136,7 @@
 config SATA_PROMISE
 	tristate "Promise SATA TX2/TX4 support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for Promise Serial ATA TX2/TX4.
 
@@ -144,6 +145,7 @@
 config SATA_SX4
 	tristate "Promise SATA SX4 support"
 	depends on PCI && EXPERIMENTAL
+	select ATA_SFF
 	help
 	  This option enables support for Promise Serial ATA SX4.
 
@@ -152,6 +154,7 @@
 config SATA_SIL
 	tristate "Silicon Image SATA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for Silicon Image Serial ATA.
 
@@ -161,6 +164,7 @@
 	tristate "SiS 964/965/966/180 SATA support"
 	depends on PCI
 	select PATA_SIS
+	select ATA_SFF
 	help
 	  This option enables support for SiS Serial ATA on
 	  SiS 964/965/966/180 and Parallel ATA on SiS 180.
@@ -171,6 +175,7 @@
 config SATA_ULI
 	tristate "ULi Electronics SATA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for ULi Electronics SATA.
 
@@ -179,6 +184,7 @@
 config SATA_VIA
 	tristate "VIA SATA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for VIA Serial ATA.
 
@@ -187,6 +193,7 @@
 config SATA_VITESSE
 	tristate "VITESSE VSC-7174 / INTEL 31244 SATA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for Vitesse VSC7174 and Intel 31244 Serial ATA.
 
@@ -195,12 +202,14 @@
 config SATA_INIC162X
 	tristate "Initio 162x SATA support (HIGHLY EXPERIMENTAL)"
 	depends on PCI && EXPERIMENTAL
+	select ATA_SFF
 	help
 	  This option enables support for Initio 162x Serial ATA.
 
 config PATA_ACPI
 	tristate "ACPI firmware driver for PATA"
 	depends on ATA_ACPI
+	select ATA_SFF
 	help
 	  This option enables an ACPI method driver which drives
 	  motherboard PATA controller interfaces through the ACPI
@@ -210,6 +219,7 @@
 config PATA_ALI
 	tristate "ALi PATA support (Experimental)"
 	depends on PCI && EXPERIMENTAL
+	select ATA_SFF
 	help
 	  This option enables support for the ALi ATA interfaces
 	  found on the many ALi chipsets.
@@ -219,6 +229,7 @@
 config PATA_AMD
 	tristate "AMD/NVidia PATA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for the AMD and NVidia PATA
 	  interfaces found on the chipsets for Athlon/Athlon64.
@@ -228,6 +239,7 @@
 config PATA_ARTOP
 	tristate "ARTOP 6210/6260 PATA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for ARTOP PATA controllers.
 
@@ -236,6 +248,7 @@
 config PATA_AT32
 	tristate "Atmel AVR32 PATA support (Experimental)"
 	depends on AVR32 && PLATFORM_AT32AP && EXPERIMENTAL
+	select ATA_SFF
 	help
 	  This option enables support for the IDE devices on the
 	  Atmel AT32AP platform.
@@ -245,6 +258,7 @@
 config PATA_ATIIXP
 	tristate "ATI PATA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for the ATI ATA interfaces
 	  found on the many ATI chipsets.
@@ -254,6 +268,7 @@
 config PATA_CMD640_PCI
 	tristate "CMD640 PCI PATA support (Very Experimental)"
 	depends on PCI && EXPERIMENTAL
+	select ATA_SFF
 	help
 	  This option enables support for the CMD640 PCI IDE
 	  interface chip. Only the primary channel is currently
@@ -264,6 +279,7 @@
 config PATA_CMD64X
 	tristate "CMD64x PATA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for the CMD64x series chips
 	  except for the CMD640.
@@ -273,6 +289,7 @@
 config PATA_CS5520
 	tristate "CS5510/5520 PATA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for the Cyrix 5510/5520
 	  companion chip used with the MediaGX/Geode processor family.
@@ -282,6 +299,7 @@
 config PATA_CS5530
 	tristate "CS5530 PATA support (Experimental)"
 	depends on PCI && EXPERIMENTAL
+	select ATA_SFF
 	help
 	  This option enables support for the Cyrix/NatSemi/AMD CS5530
 	  companion chip used with the MediaGX/Geode processor family.
@@ -291,6 +309,7 @@
 config PATA_CS5535
 	tristate "CS5535 PATA support (Experimental)"
 	depends on PCI && X86 && !X86_64 && EXPERIMENTAL
+	select ATA_SFF
 	help
 	  This option enables support for the NatSemi/AMD CS5535
 	  companion chip used with the Geode processor family.
@@ -300,6 +319,7 @@
 config PATA_CS5536
 	tristate "CS5536 PATA support (Experimental)"
 	depends on PCI && X86 && !X86_64 && EXPERIMENTAL
+	select ATA_SFF
 	help
 	  This option enables support for the AMD CS5536
 	  companion chip used with the Geode LX processor family.
@@ -309,6 +329,7 @@
 config PATA_CYPRESS
 	tristate "Cypress CY82C693 PATA support (Very Experimental)"
 	depends on PCI && EXPERIMENTAL
+	select ATA_SFF
 	help
 	  This option enables support for the Cypress/Contaq CY82C693
 	  chipset found in some Alpha systems
@@ -318,6 +339,7 @@
 config PATA_EFAR
 	tristate "EFAR SLC90E66 support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for the EFAR SLC90E66
 	  IDE controller found on some older machines.
@@ -327,6 +349,7 @@
 config ATA_GENERIC
 	tristate "Generic ATA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for generic BIOS configured
 	  ATA controllers via the new ATA layer
@@ -336,6 +359,7 @@
 config PATA_HPT366
 	tristate "HPT 366/368 PATA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for the HPT 366 and 368
 	  PATA controllers via the new ATA layer.
@@ -345,6 +369,7 @@
 config PATA_HPT37X
 	tristate "HPT 370/370A/371/372/374/302 PATA support (Experimental)"
 	depends on PCI && EXPERIMENTAL
+	select ATA_SFF
 	help
 	  This option enables support for the majority of the later HPT
 	  PATA controllers via the new ATA layer.
@@ -354,6 +379,7 @@
 config PATA_HPT3X2N
 	tristate "HPT 372N/302N PATA support (Very Experimental)"
 	depends on PCI && EXPERIMENTAL
+	select ATA_SFF
 	help
 	  This option enables support for the N variant HPT PATA
 	  controllers via the new ATA layer
@@ -363,6 +389,7 @@
 config PATA_HPT3X3
 	tristate "HPT 343/363 PATA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for the HPT 343/363
 	  PATA controllers via the new ATA layer
@@ -372,6 +399,7 @@
 config PATA_HPT3X3_DMA
 	bool "HPT 343/363 DMA support (Experimental)"
 	depends on PATA_HPT3X3
+	select ATA_SFF
 	help
 	  This option enables DMA support for the HPT343/363
 	  controllers. Enable with care as there are still some
@@ -380,6 +408,7 @@
 config PATA_ISAPNP
 	tristate "ISA Plug and Play PATA support (Experimental)"
 	depends on EXPERIMENTAL && ISAPNP
+	select ATA_SFF
 	help
 	  This option enables support for ISA plug & play ATA
 	  controllers such as those found on old soundcards.
@@ -389,6 +418,7 @@
 config PATA_IT821X
 	tristate "IT8211/2 PATA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for the ITE 8211 and 8212
 	  PATA controllers via the new ATA layer, including RAID
@@ -399,6 +429,7 @@
 config PATA_IT8213
 	tristate "IT8213 PATA support (Experimental)"
 	depends on PCI && EXPERIMENTAL
+	select ATA_SFF
 	help
 	  This option enables support for the ITE 821 PATA
           controllers via the new ATA layer.
@@ -408,6 +439,7 @@
 config PATA_JMICRON
 	tristate "JMicron PATA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  Enable support for the JMicron IDE controller, via the new
 	  ATA layer.
@@ -417,6 +449,7 @@
 config PATA_LEGACY
 	tristate "Legacy ISA PATA support (Experimental)"
 	depends on ISA && EXPERIMENTAL
+	select ATA_SFF
 	help
 	  This option enables support for ISA/VLB bus legacy PATA
 	  ports and allows them to be accessed via the new ATA layer.
@@ -426,6 +459,7 @@
 config PATA_TRIFLEX
 	tristate "Compaq Triflex PATA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  Enable support for the Compaq 'Triflex' IDE controller as found
 	  on many Compaq Pentium-Pro systems, via the new ATA layer.
@@ -435,6 +469,7 @@
 config PATA_MARVELL
 	tristate "Marvell PATA support via legacy mode"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables limited support for the Marvell 88SE6145 ATA
 	  controller.
@@ -444,6 +479,7 @@
 config PATA_MPC52xx
 	tristate "Freescale MPC52xx SoC internal IDE"
 	depends on PPC_MPC52xx
+	select ATA_SFF
 	help
 	  This option enables support for integrated IDE controller
 	  of the Freescale MPC52xx SoC.
@@ -453,6 +489,7 @@
 config PATA_MPIIX
 	tristate "Intel PATA MPIIX support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for MPIIX PATA support.
 
@@ -461,6 +498,7 @@
 config PATA_OLDPIIX
 	tristate "Intel PATA old PIIX support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for early PIIX PATA support.
 
@@ -469,6 +507,7 @@
 config PATA_NETCELL
 	tristate "NETCELL Revolution RAID support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for the Netcell Revolution RAID
 	  PATA controller.
@@ -478,6 +517,7 @@
 config PATA_NINJA32
 	tristate "Ninja32/Delkin Cardbus ATA support (Experimental)"
 	depends on PCI && EXPERIMENTAL
+	select ATA_SFF
 	help
 	  This option enables support for the Ninja32, Delkin and
 	  possibly other brands of Cardbus ATA adapter
@@ -487,6 +527,7 @@
 config PATA_NS87410
 	tristate "Nat Semi NS87410 PATA support (Experimental)"
 	depends on PCI && EXPERIMENTAL
+	select ATA_SFF
 	help
 	  This option enables support for the National Semiconductor
 	  NS87410 PCI-IDE controller.
@@ -496,6 +537,7 @@
 config PATA_NS87415
 	tristate "Nat Semi NS87415 PATA support (Experimental)"
 	depends on PCI && EXPERIMENTAL
+	select ATA_SFF
 	help
 	  This option enables support for the National Semiconductor
 	  NS87415 PCI-IDE controller.
@@ -505,6 +547,7 @@
 config PATA_OPTI
 	tristate "OPTI621/6215 PATA support (Very Experimental)"
 	depends on PCI && EXPERIMENTAL
+	select ATA_SFF
 	help
 	  This option enables full PIO support for the early Opti ATA
 	  controllers found on some old motherboards.
@@ -514,6 +557,7 @@
 config PATA_OPTIDMA
 	tristate "OPTI FireStar PATA support (Very Experimental)"
 	depends on PCI && EXPERIMENTAL
+	select ATA_SFF
 	help
 	  This option enables DMA/PIO support for the later OPTi
 	  controllers found on some old motherboards and in some
@@ -524,6 +568,7 @@
 config PATA_PCMCIA
 	tristate "PCMCIA PATA support"
 	depends on PCMCIA
+	select ATA_SFF
 	help
 	  This option enables support for PCMCIA ATA interfaces, including
 	  compact flash card adapters via the new ATA layer.
@@ -533,6 +578,7 @@
 config PATA_PDC_OLD
 	tristate "Older Promise PATA controller support (Experimental)"
 	depends on PCI && EXPERIMENTAL
+	select ATA_SFF
 	help
 	  This option enables support for the Promise 20246, 20262, 20263,
 	  20265 and 20267 adapters.
@@ -542,12 +588,14 @@
 config PATA_QDI
 	tristate "QDI VLB PATA support"
 	depends on ISA
+	select ATA_SFF
 	help
 	  Support for QDI 6500 and 6580 PATA controllers on VESA local bus.
 
 config PATA_RADISYS
 	tristate "RADISYS 82600 PATA support (Very Experimental)"
 	depends on PCI && EXPERIMENTAL
+	select ATA_SFF
 	help
 	  This option enables support for the RADISYS 82600
 	  PATA controllers via the new ATA layer
@@ -557,6 +605,7 @@
 config PATA_RB500
 	tristate "RouterBoard 500 PATA CompactFlash support"
 	depends on MIKROTIK_RB500
+	select ATA_SFF
 	help
 	  This option enables support for the RouterBoard 500
 	  PATA CompactFlash controller.
@@ -566,6 +615,7 @@
 config PATA_RZ1000
 	tristate "PC Tech RZ1000 PATA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables basic support for the PC Tech RZ1000/1
 	  PATA controllers via the new ATA layer
@@ -575,6 +625,7 @@
 config PATA_SC1200
 	tristate "SC1200 PATA support (Very Experimental)"
 	depends on PCI && EXPERIMENTAL
+	select ATA_SFF
 	help
 	  This option enables support for the NatSemi/AMD SC1200 SoC
 	  companion chip used with the Geode processor family.
@@ -584,6 +635,7 @@
 config PATA_SERVERWORKS
 	tristate "SERVERWORKS OSB4/CSB5/CSB6/HT1000 PATA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for the Serverworks OSB4/CSB5/CSB6 and
 	  HT1000 PATA controllers, via the new ATA layer.
@@ -593,6 +645,7 @@
 config PATA_PDC2027X
 	tristate "Promise PATA 2027x support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for Promise PATA pdc20268 to pdc20277 host adapters.
 
@@ -601,6 +654,7 @@
 config PATA_SIL680
 	tristate "CMD / Silicon Image 680 PATA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for CMD / Silicon Image 680 PATA.
 
@@ -609,6 +663,7 @@
 config PATA_SIS
 	tristate "SiS PATA support (Experimental)"
 	depends on PCI && EXPERIMENTAL
+	select ATA_SFF
 	help
 	  This option enables support for SiS PATA controllers
 
@@ -617,6 +672,7 @@
 config PATA_VIA
 	tristate "VIA PATA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for the VIA PATA interfaces
 	  found on the many VIA chipsets.
@@ -626,6 +682,7 @@
 config PATA_WINBOND
 	tristate "Winbond SL82C105 PATA support"
 	depends on PCI
+	select ATA_SFF
 	help
 	  This option enables support for SL82C105 PATA devices found in the
 	  Netwinder and some other systems
@@ -635,6 +692,7 @@
 config PATA_WINBOND_VLB
 	tristate "Winbond W83759A VLB PATA support (Experimental)"
 	depends on ISA && EXPERIMENTAL
+	select ATA_SFF
 	help
 	  Support for the Winbond W83759A controller on Vesa Local Bus
 	  systems.
@@ -642,6 +700,7 @@
 config PATA_PLATFORM
 	tristate "Generic platform device PATA support"
 	depends on EMBEDDED || ARCH_RPC || PPC
+	select ATA_SFF
 	help
 	  This option enables support for generic directly connected ATA
 	  devices commonly found on embedded systems.
@@ -651,6 +710,7 @@
 config PATA_OF_PLATFORM
 	tristate "OpenFirmware platform device PATA support"
 	depends on PATA_PLATFORM && PPC_OF
+	select ATA_SFF
 	help
 	  This option enables support for generic directly connected ATA
 	  devices commonly found on embedded systems with OpenFirmware
@@ -661,6 +721,7 @@
 config PATA_ICSIDE
 	tristate "Acorn ICS PATA support"
 	depends on ARM && ARCH_ACORN
+	select ATA_SFF
 	help
 	  On Acorn systems, say Y here if you wish to use the ICS PATA
 	  interface card.  This is not required for ICS partition support.
@@ -669,6 +730,7 @@
 config PATA_IXP4XX_CF
 	tristate "IXP4XX Compact Flash support"
 	depends on ARCH_IXP4XX
+	select ATA_SFF
 	help
 	  This option enables support for a Compact Flash connected on
 	  the ixp4xx expansion bus. This driver had been written for
@@ -679,6 +741,7 @@
 config PATA_SCC
 	tristate "Toshiba's Cell Reference Set IDE support"
 	depends on PCI && PPC_CELLEB
+	select ATA_SFF
 	help
 	  This option enables support for the built-in IDE controller on
 	  Toshiba Cell Reference Board.
@@ -688,11 +751,11 @@
 config PATA_BF54X
 	tristate "Blackfin 54x ATAPI support"
 	depends on BF542 || BF548 || BF549
+	select ATA_SFF
 	help
 	  This option enables support for the built-in ATAPI controller on
 	  Blackfin 54x family chips.
 
 	  If unsure, say N.
 
-endif # ATA_SFF
 endif # ATA


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2
  2008-04-12 18:43     ` 2.6.25-rc8-mm2 Aneesh Kumar K.V
@ 2008-04-13  7:45       ` Michael Schmitz
  2008-04-13  8:11         ` 2.6.25-rc8-mm2 Geert Uytterhoeven
  0 siblings, 1 reply; 71+ messages in thread
From: Michael Schmitz @ 2008-04-13  7:45 UTC (permalink / raw
  To: Aneesh Kumar K.V
  Cc: Andrew Morton, Adrian Bunk, linux-kernel, linux-ext4, linux-m68k

>> I saw some related linux-ext4 email float past yesterday, so something
>> might be happening.
>>
>
> It would really help if we can get some m68k people to look at patch
> i posted.
>
> http://article.gmane.org/gmane.comp.file-systems.ext4/5944

I'd love to - only I can't seem to compile 2.6.25 at all. What's the 
minimum gcc version I should use?

 	Michael


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2
  2008-04-13  7:45       ` 2.6.25-rc8-mm2 Michael Schmitz
@ 2008-04-13  8:11         ` Geert Uytterhoeven
  0 siblings, 0 replies; 71+ messages in thread
From: Geert Uytterhoeven @ 2008-04-13  8:11 UTC (permalink / raw
  To: Michael Schmitz
  Cc: Aneesh Kumar K.V, Andrew Morton, Adrian Bunk, linux-kernel,
	linux-ext4, linux-m68k

On Sun, 13 Apr 2008, Michael Schmitz wrote:
> > > I saw some related linux-ext4 email float past yesterday, so something
> > > might be happening.
> > 
> > It would really help if we can get some m68k people to look at patch
> > i posted.
> > 
> > http://article.gmane.org/gmane.comp.file-systems.ext4/5944
> 
> I'd love to - only I can't seem to compile 2.6.25 at all. What's the minimum
> gcc version I should use?

My cross-compiler is gcc version 4.1.2 20061115 (prerelease) (Debian 4.1.1-21).

Gr{oetje,eeting}s,

						Geert

P.S. I'll look into it when I find some time, but you may beat me to it...
--
Geert Uytterhoeven -- There's lots of Linux beyond ia32 -- geert@linux-m68k.org

In personal conversations with technical people, I call myself a hacker. But
when I'm talking to journalists I just say "programmer" or something like that.
							    -- Linus Torvalds

^ permalink raw reply	[flat|nested] 71+ messages in thread

* 2.6.25-rc8-mm2: IP: [<ffffffff802868f9>] __kmalloc+0x69/0x110
  2008-04-11  3:33 2.6.25-rc8-mm2 Andrew Morton
                   ` (8 preceding siblings ...)
  2008-04-12 10:47 ` 2.6.25-rc8-mm2 - ftraced chews 100% of a CPU Valdis.Kletnieks
@ 2008-04-13 20:44 ` Alexey Dobriyan
  2008-04-13 20:53   ` Andrew Morton
                     ` (2 more replies)
  2008-04-14  8:07 ` BUG at __dentry_open [Was: 2.6.25-rc8-mm2] Jiri Slaby
  2008-04-15 17:00 ` 2.6.25-rc8-mm2: Observed problems: Not a detailed bug report Zan Lynx
  11 siblings, 3 replies; 71+ messages in thread
From: Alexey Dobriyan @ 2008-04-13 20:44 UTC (permalink / raw
  To: Andrew Morton; +Cc: linux-kernel, clameter, penberg

Grrr, I was hunting for oopses in dup_fd and near that were plaguing one
box here for far too long, and hit below.

What happened if freshly booted box (probably not all init scripts finished),
X already started. ssh from another box and reboot from session.


(gdb) p __kmalloc
$1 = {void *(size_t, gfp_t)} 0xffffffff80286890 <__kmalloc>
(gdb) l *(0xffffffff80286890 + 0x69)
0xffffffff802868f9 is in __kmalloc (mm/slub.c:1663).
1658
1659                    object = __slab_alloc(s, gfpflags, node, addr, c);
1660
1661            else {
1662                    object = c->freelist;
1663      ===>          c->freelist = object[c->offset];   <===
1664                    stat(c, ALLOC_FASTPATH);
1665            }
1666            local_irq_restore(flags);



BUG: unable to handle kernel paging request at 0000000500000500
IP: [<ffffffff802868f9>] __kmalloc+0x69/0x110
PGD 17e04a067 PUD 0 
Oops: 0000 [1] SMP DEBUG_PAGEALLOC
last sysfs file: /sys/devices/pci0000:00/0000:00:1e.0/0000:05:02.0/resource
CPU 1 
Modules linked in: nf_conntrack_irc ipt_MASQUERADE iptable_nat nf_nat nf_conntrack_ipv4 xt_state nf_conntrack iptable_filter ip_tables x_tables usblp ehci_hcd uhci_hcd usbcore sr_mod cdrom
Pid: 4966, comm: depscan.sh Not tainted 2.6.25-rc8-mm2 #20
RIP: 0010:[<ffffffff802868f9>]  [<ffffffff802868f9>] __kmalloc+0x69/0x110
RSP: 0018:ffff81017cba9c68  EFLAGS: 00010006
RAX: 0000000000000000 RBX: ffffffff805c3950 RCX: ffff81017e7bb278
RDX: ffff81017c868000 RSI: 0000000000000001 RDI: ffffffff802868db
RBP: ffff81017cba9c98 R08: 0000000000000000 R09: 0000000000000001
R10: 0000000005050561 R11: 00000000036c00b1 R12: 0000000500000500
R13: 0000000000000282 R14: 00000000000080d0 R15: ffff810001070360
FS:  00007fc9d17276f0(0000) GS:ffff81017fc44600(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 0000000500000500 CR3: 000000017c9c2000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process depscan.sh (pid: 4966, threadinfo ffff81017cba8000, task ffff81017c868000)
Stack:  ffffffff802d4a42 ffff81017e7bb278 ffff81017e7bb278 00000000fe5c5c7c
 000000000cb4c2b8 ffff81017efdc8c0 ffff81017cba9cd8 ffffffff802d4a42
 ffff81017cba9cd8 ffff81017e7bb278 ffff81017f82e2a0 ffff81017cba9da8
Call Trace:
 [<ffffffff802d4a42>] ? ext3_htree_store_dirent+0x32/0x120
 [<ffffffff802d4a42>] ext3_htree_store_dirent+0x32/0x120
 [<ffffffff802dba25>] htree_dirblock_to_tree+0x105/0x170
 [<ffffffff802de30d>] ext3_htree_fill_tree+0x7d/0x220
 [<ffffffff80252d59>] ? trace_hardirqs_on_caller+0xc9/0x150
 [<ffffffff802d50f4>] ? ext3_readdir+0x5c4/0x630
 [<ffffffff802d4c74>] ext3_readdir+0x144/0x630
 [<ffffffff802975f0>] ? filldir+0x0/0xe0
 [<ffffffff8045475a>] ? __mutex_lock_common+0x22a/0x330
 [<ffffffff80297741>] ? vfs_readdir+0x71/0xc0
 [<ffffffff802975f0>] ? filldir+0x0/0xe0
 [<ffffffff802975f0>] ? filldir+0x0/0xe0
 [<ffffffff80297773>] vfs_readdir+0xa3/0xc0
 [<ffffffff80297822>] sys_getdents+0x92/0xd0
 [<ffffffff8020b4cb>] system_call_after_swapgs+0x7b/0x80


Code: 48 89 45 d0 9c 41 5d fa e8 f5 a5 fc ff 65 8b 04 25 24 00 00 00 48 98 4c 8b bc c3 c8 00 00 00 4d 8b 27 4d 85 e4 74 7a 41 8b 47 14 <49> 8b 04 c4 49 89 07 41 f7 c5 00 02 00 00 75 37 41 55 9d e8 bf 
RIP  [<ffffffff802868f9>] __kmalloc+0x69/0x110
 RSP <ffff81017cba9c68>
CR2: 0000000500000500
---[ end trace f513ce88520d2ac0 ]---
BUG: sleeping function called from invalid context at kernel/rwsem.c:21
in_atomic():0, irqs_disabled():1
INFO: lockdep is turned off.
irq event stamp: 19250
hardirqs last  enabled at (19249): [<ffffffff80252ded>] trace_hardirqs_on+0xd/0x10
hardirqs last disabled at (19250): [<ffffffff80250edd>] trace_hardirqs_off+0xd/0x10
softirqs last  enabled at (14334): [<ffffffff80236aae>] __do_softirq+0xee/0x110
softirqs last disabled at (14329): [<ffffffff8020c77c>] call_softirq+0x1c/0x30
Pid: 4966, comm: depscan.sh Tainted: G      D   2.6.25-rc8-mm2 #20

Call Trace:
 [<ffffffff802523f0>] ? print_irqtrace_events+0x110/0x120
 [<ffffffff802280e7>] __might_sleep+0xc7/0xe0
 [<ffffffff80454aed>] down_read+0x1d/0x50
 [<ffffffff80232e6e>] exit_mm+0x2e/0xf0
 [<ffffffff802339a9>] do_exit+0x189/0x760
 [<ffffffff80227f6e>] ? __wake_up+0x4e/0x70
 [<ffffffff8020c8a5>] oops_end+0x85/0x90
 [<ffffffff802209ec>] do_page_fault+0x3fc/0x890
 [<ffffffff80253a65>] ? __lock_acquire+0x645/0xc50
 [<ffffffff8045639d>] error_exit+0x0/0xa9
 [<ffffffff802868db>] ? __kmalloc+0x4b/0x110
 [<ffffffff802868f9>] ? __kmalloc+0x69/0x110
 [<ffffffff802868db>] ? __kmalloc+0x4b/0x110
 [<ffffffff802d4a42>] ? ext3_htree_store_dirent+0x32/0x120
 [<ffffffff802d4a42>] ? ext3_htree_store_dirent+0x32/0x120
 [<ffffffff802dba25>] ? htree_dirblock_to_tree+0x105/0x170
 [<ffffffff802de30d>] ? ext3_htree_fill_tree+0x7d/0x220
 [<ffffffff80252d59>] ? trace_hardirqs_on_caller+0xc9/0x150
 [<ffffffff802d50f4>] ? ext3_readdir+0x5c4/0x630
 [<ffffffff802d4c74>] ? ext3_readdir+0x144/0x630
 [<ffffffff802975f0>] ? filldir+0x0/0xe0
 [<ffffffff8045475a>] ? __mutex_lock_common+0x22a/0x330
 [<ffffffff80297741>] ? vfs_readdir+0x71/0xc0
 [<ffffffff802975f0>] ? filldir+0x0/0xe0
 [<ffffffff802975f0>] ? filldir+0x0/0xe0
 [<ffffffff80297773>] ? vfs_readdir+0xa3/0xc0
 [<ffffffff80297822>] ? sys_getdents+0x92/0xd0
 [<ffffffff8020b4cb>] ? system_call_after_swapgs+0x7b/0x80

BUG: unable to handle kernel paging request at 0000000500000500
IP: [<ffffffff80286672>] kmem_cache_alloc+0x52/0xd0
PGD 17e277067 PUD 0 
Oops: 0000 [2] SMP DEBUG_PAGEALLOC
last sysfs file: /sys/devices/pci0000:00/0000:00:1e.0/0000:05:02.0/resource
CPU 1 
Modules linked in: nf_conntrack_irc ipt_MASQUERADE iptable_nat nf_nat nf_conntrack_ipv4 xt_state nf_conntrack iptable_filter ip_tables x_tables usblp ehci_hcd uhci_hcd usbcore sr_mod cdrom
Pid: 4951, comm: bash Tainted: G      D   2.6.25-rc8-mm2 #20
RIP: 0010:[<ffffffff80286672>]  [<ffffffff80286672>] kmem_cache_alloc+0x52/0xd0
RSP: 0018:ffff81017d76dca8  EFLAGS: 00010006
RAX: 0000000000000000 RBX: 0000000500000500 RCX: 0000000000000001
RDX: ffff81017ed45eb0 RSI: 00000000000000d0 RDI: ffffffff80286653
RBP: ffff81017d76dcd8 R08: 0000000000000001 R09: 0000000000000000
R10: ffffffff8022e3b7 R11: 0000000000000000 R12: 0000000000000282
R13: ffff810001070360 R14: 00000000000000d0 R15: ffffffff805c3950
FS:  00007fd0a07386f0(0000) GS:ffff81017fc44600(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 0000000500000500 CR3: 000000017c959000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process bash (pid: 4951, threadinfo ffff81017d76c000, task ffff81017ed45eb0)
Stack:  ffffffff802a0bea 00000000000000ff ffff81017ef30f00 0000000000000100
 ffff81017ef30f10 0000000000000100 ffff81017d76dd28 ffffffff802a0bea
 ffff81017ef30f80 ffff81017ef30f80 ffff81017ee36940 00000000000000c0
Call Trace:
 [<ffffffff802a0bea>] ? expand_files+0xaa/0x300
 [<ffffffff802a0bea>] expand_files+0xaa/0x300
 [<ffffffff8022e3c2>] dup_fd+0x292/0x2d0
 [<ffffffff8022e454>] copy_files+0x54/0x80
 [<ffffffff8022cec7>] ? sched_fork+0x37/0x70
 [<ffffffff8022f346>] copy_process+0x5d6/0x1590
 [<ffffffff80455c46>] ? _spin_unlock+0x26/0x30
 [<ffffffff80230382>] do_fork+0x82/0x280
 [<ffffffff80252ded>] ? trace_hardirqs_on+0xd/0x10
 [<ffffffff80455c0b>] ? _spin_unlock_irq+0x2b/0x40
 [<ffffffff8023b856>] ? sigprocmask+0x86/0xf0
 [<ffffffff8020b4cb>] ? system_call_after_swapgs+0x7b/0x80
 [<ffffffff80209393>] sys_clone+0x23/0x30
 [<ffffffff8020b867>] ptregscall_common+0x67/0xb0


Code: 89 45 d0 9c 41 5c fa e8 7d a8 fc ff 65 8b 04 25 24 00 00 00 48 98 4d 8b ac c7 c8 00 00 00 49 8b 5d 00 48 85 db 74 51 41 8b 45 14 <48> 8b 04 c3 49 89 45 00 41 f7 c4 00 02 00 00 75 32 41 54 9d e8 
RIP  [<ffffffff80286672>] kmem_cache_alloc+0x52/0xd0
 RSP <ffff81017d76dca8>
CR2: 0000000500000500
---[ end trace f513ce88520d2ac0 ]---
BUG: unable to handle kernel paging request at 0000000500000500
IP: [<ffffffff80286672>] kmem_cache_alloc+0x52/0xd0
PGD 17fe9f067 PUD 0 
Oops: 0000 [3] SMP DEBUG_PAGEALLOC
last sysfs file: /sys/devices/pci0000:00/0000:00:1e.0/0000:05:02.0/resource
CPU 1 
Modules linked in: nf_conntrack_irc ipt_MASQUERADE iptable_nat nf_nat nf_conntrack_ipv4 xt_state nf_conntrack iptable_filter ip_tables x_tables usblp ehci_hcd uhci_hcd usbcore sr_mod cdrom
Pid: 1, comm: init Tainted: G      D   2.6.25-rc8-mm2 #20
RIP: 0010:[<ffffffff80286672>]  [<ffffffff80286672>] kmem_cache_alloc+0x52/0xd0
RSP: 0018:ffff81017fc9bc78  EFLAGS: 00010006
RAX: 0000000000000000 RBX: 0000000500000500 RCX: 00000000fffffffb
RDX: ffff81017fca0000 RSI: 00000000000000d0 RDI: ffffffff80286653
RBP: ffff81017fc9bca8 R08: 0000000000000001 R09: 0000000000000000
R10: ffff81017fc9bbc8 R11: 0000000000000000 R12: 0000000000000286
R13: ffff810001070360 R14: 00000000000000d0 R15: ffffffff805c3950
FS:  00007f47ac8376f0(0000) GS:ffff81017fc44600(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 0000000500000500 CR3: 000000017ec34000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process init (pid: 1, threadinfo ffff81017fc9a000, task ffff81017fca0000)
Stack:  ffffffff80344a3d ffff81017fdf99f8 ffff81017e1f4000 ffff81017fe52400
 0000000000000000 ffff81017fdf9bf0 ffff81017fc9bd18 ffffffff80344a3d
 ffff81017fc9bcd8 ffff81017fc9bd30 ffff81017fe52400 0000000000000000
Call Trace:
 [<ffffffff80344a3d>] ? init_dev+0x2ad/0x600
 [<ffffffff80344a3d>] init_dev+0x2ad/0x600
 [<ffffffff8034504f>] tty_open+0xff/0x330
 [<ffffffff8024aaf3>] ? down+0x33/0x50
 [<ffffffff8028cdeb>] chrdev_open+0xbb/0x1c0
 [<ffffffff8028cd30>] ? chrdev_open+0x0/0x1c0
 [<ffffffff80288254>] __dentry_open+0xd4/0x330
 [<ffffffff802884f4>] nameidata_to_filp+0x44/0x60
 [<ffffffff8029595f>] do_filp_open+0x25f/0x8d0
 [<ffffffff80288043>] ? get_unused_fd_flags+0x103/0x130
 [<ffffffff802880c8>] do_sys_open+0x58/0xb0
 [<ffffffff8028814b>] sys_open+0x1b/0x20
 [<ffffffff8020b4cb>] system_call_after_swapgs+0x7b/0x80


Code: 89 45 d0 9c 41 5c fa e8 7d a8 fc ff 65 8b 04 25 24 00 00 00 48 98 4d 8b ac c7 c8 00 00 00 49 8b 5d 00 48 85 db 74 51 41 8b 45 14 <48> 8b 04 c3 49 89 45 00 41 f7 c4 00 02 00 00 75 32 41 54 9d e8 
RIP  [<ffffffff80286672>] kmem_cache_alloc+0x52/0xd0
 RSP <ffff81017fc9bc78>
CR2: 0000000500000500
---[ end trace f513ce88520d2ac0 ]---
Kernel panic - not syncing: Attempted to kill init!
Pid: 1, comm: init Tainted: G      D   2.6.25-rc8-mm2 #20

Call Trace:
 [<ffffffff80230ee0>] panic+0xa0/0x180
 [<ffffffff80231df7>] ? printk+0x67/0x70
 [<ffffffff80250edd>] ? trace_hardirqs_off+0xd/0x10
 [<ffffffff80233f71>] do_exit+0x751/0x760
 [<ffffffff80227f6e>] ? __wake_up+0x4e/0x70
 [<ffffffff8020c8a5>] oops_end+0x85/0x90
 [<ffffffff802209ec>] do_page_fault+0x3fc/0x890
 [<ffffffff8045639d>] error_exit+0x0/0xa9
 [<ffffffff80286653>] ? kmem_cache_alloc+0x33/0xd0
 [<ffffffff80286672>] ? kmem_cache_alloc+0x52/0xd0
 [<ffffffff80286653>] ? kmem_cache_alloc+0x33/0xd0
 [<ffffffff80344a3d>] ? init_dev+0x2ad/0x600
 [<ffffffff80344a3d>] ? init_dev+0x2ad/0x600
 [<ffffffff8034504f>] ? tty_open+0xff/0x330
 [<ffffffff8024aaf3>] ? down+0x33/0x50
 [<ffffffff8028cdeb>] ? chrdev_open+0xbb/0x1c0
 [<ffffffff8028cd30>] ? chrdev_open+0x0/0x1c0
 [<ffffffff80288254>] ? __dentry_open+0xd4/0x330
 [<ffffffff802884f4>] ? nameidata_to_filp+0x44/0x60
 [<ffffffff8029595f>] ? do_filp_open+0x25f/0x8d0
 [<ffffffff80288043>] ? get_unused_fd_flags+0x103/0x130
 [<ffffffff802880c8>] ? do_sys_open+0x58/0xb0
 [<ffffffff8028814b>] ? sys_open+0x1b/0x20
 [<ffffffff8020b4cb>] ? system_call_after_swapgs+0x7b/0x80





# CONFIG_DEBUG_DRIVER is not set
# CONFIG_DEBUG_DEVRES is not set
# CONFIG_DEBUG_FS is not set
CONFIG_DEBUG_KERNEL=y
# CONFIG_DEBUG_SHIRQ is not set
CONFIG_DEBUG_OBJECTS=y
# CONFIG_DEBUG_OBJECTS_SELFTEST is not set
CONFIG_DEBUG_OBJECTS_FREE=y
CONFIG_DEBUG_OBJECTS_TIMERS=y
CONFIG_DEBUG_RT_MUTEXES=y
CONFIG_DEBUG_PI_LIST=y
CONFIG_DEBUG_SPINLOCK=y
CONFIG_DEBUG_MUTEXES=y
CONFIG_DEBUG_LOCK_ALLOC=y
# CONFIG_DEBUG_LOCKDEP is not set
CONFIG_DEBUG_SPINLOCK_SLEEP=y
# CONFIG_DEBUG_LOCKING_API_SELFTESTS is not set
# CONFIG_DEBUG_KOBJECT is not set
CONFIG_DEBUG_BUGVERBOSE=y
CONFIG_DEBUG_INFO=y
CONFIG_DEBUG_VM=y
CONFIG_DEBUG_WRITECOUNT=y
CONFIG_DEBUG_LIST=y
CONFIG_DEBUG_SG=y
# CONFIG_DEBUG_SYNCHRO_TEST is not set
# CONFIG_DEBUG_STACKOVERFLOW is not set
# CONFIG_DEBUG_STACK_USAGE is not set
CONFIG_DEBUG_PAGEALLOC=y
CONFIG_DEBUG_PER_CPU_MAPS=y
CONFIG_DEBUG_RODATA=y
CONFIG_DEBUG_RODATA_TEST=y
# CONFIG_DEBUG_NX_TEST is not set


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2: IP: [<ffffffff802868f9>] __kmalloc+0x69/0x110
  2008-04-13 20:44 ` 2.6.25-rc8-mm2: IP: [<ffffffff802868f9>] __kmalloc+0x69/0x110 Alexey Dobriyan
@ 2008-04-13 20:53   ` Andrew Morton
  2008-04-14 16:18     ` Pekka Enberg
  2008-04-14 18:00     ` Christoph Lameter
  2008-04-14  9:51   ` Pekka J Enberg
  2008-04-14 16:01   ` Pekka Enberg
  2 siblings, 2 replies; 71+ messages in thread
From: Andrew Morton @ 2008-04-13 20:53 UTC (permalink / raw
  To: Alexey Dobriyan; +Cc: linux-kernel, clameter, penberg

On Mon, 14 Apr 2008 00:44:22 +0400 Alexey Dobriyan <adobriyan@gmail.com> wrote:

> Grrr, I was hunting for oopses in dup_fd and near that were plaguing one
> box here for far too long, and hit below.
> 
> What happened if freshly booted box (probably not all init scripts finished),
> X already started. ssh from another box and reboot from session.
> 
> 
> (gdb) p __kmalloc
> $1 = {void *(size_t, gfp_t)} 0xffffffff80286890 <__kmalloc>
> (gdb) l *(0xffffffff80286890 + 0x69)
> 0xffffffff802868f9 is in __kmalloc (mm/slub.c:1663).
> 1658
> 1659                    object = __slab_alloc(s, gfpflags, node, addr, c);
> 1660
> 1661            else {
> 1662                    object = c->freelist;
> 1663      ===>          c->freelist = object[c->offset];   <===
> 1664                    stat(c, ALLOC_FASTPATH);
> 1665            }
> 1666            local_irq_restore(flags);

Yes, I don't think the sub changes are ready for prime-time.

There is a fix in
ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.25-rc8/2.6.25-rc8-mm2/hot-fixes
but it won't help this crash.

^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2
  2008-04-11 14:24             ` 2.6.25-rc8-mm2 Pekka Enberg
@ 2008-04-14  3:13               ` KAMEZAWA Hiroyuki
  0 siblings, 0 replies; 71+ messages in thread
From: KAMEZAWA Hiroyuki @ 2008-04-14  3:13 UTC (permalink / raw
  To: Pekka Enberg; +Cc: Andrew Morton, linux-kernel, Christoph Lameter

On Fri, 11 Apr 2008 17:24:11 +0300
Pekka Enberg <penberg@cs.helsinki.fi> wrote:
 
> >     The KICKABLE flag is set again when all objects of the slab have been
> >     allocated (Occurs during removal of a slab from the partial lists).
> > 
> >     Reviewed-by: Rik van Riel <riel@redhat.com>
> >     Signed-off-by: Christoph Lameter <clameter@sgi.com>
> >     Signed-off-by: Pekka Enberg <penberg@cs.helsinki.fi>
> 
> My bad, sorry. Fixed and pushed out.
> 
Works well. Thank you.
Tested-by: KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>

-Kame


^ permalink raw reply	[flat|nested] 71+ messages in thread

* BUG at __dentry_open [Was: 2.6.25-rc8-mm2]
  2008-04-11  3:33 2.6.25-rc8-mm2 Andrew Morton
                   ` (9 preceding siblings ...)
  2008-04-13 20:44 ` 2.6.25-rc8-mm2: IP: [<ffffffff802868f9>] __kmalloc+0x69/0x110 Alexey Dobriyan
@ 2008-04-14  8:07 ` Jiri Slaby
  2008-04-15 17:00 ` 2.6.25-rc8-mm2: Observed problems: Not a detailed bug report Zan Lynx
  11 siblings, 0 replies; 71+ messages in thread
From: Jiri Slaby @ 2008-04-14  8:07 UTC (permalink / raw
  To: Andrew Morton
  Cc: linux-kernel, sct, adilger, linux-ext4, Al Viro, linux-fsdevel

On 04/11/2008 05:33 AM, Andrew Morton wrote:
> ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.25-rc8/2.6.25-rc8-mm2/

$ cat /var/lib/rpm/Conflictname
Killed

BUG: unable to handle kernel paging request at fffff0002004c1b0
IP: [<ffffffff80296df7>] __dentry_open+0xe7/0x2d0
PGD 0
Oops: 0000 [6] SMP
last sysfs file: /sys/devices/virtual/net/tun0/statistics/collisions
CPU 1
Modules linked in: ipv6 tun bitrev test arc4 ecb crypto_blkcipher cryptomgr 
crypto_algapi ath5k mac80211 crc32 rtc_cmos usbhid sr_mod ohci1394 hid rtc_core 
cfg80211 rtc_lib ehci_hcd cdrom ieee1394 ff_memless floppy
Pid: 4388, comm: cat Tainted: G      D   2.6.25-rc8-mm2_64 #399
RIP: 0010:[<ffffffff80296df7>]  [<ffffffff80296df7>] __dentry_open+0xe7/0x2d0
RSP: 0018:ffff810028ebbd98  EFLAGS: 00010206
RAX: fffff0002004c1b0 RBX: ffff81001a62d6c0 RCX: 0000000000000000
RDX: ffff81001a62d6c0 RSI: ffff81001a62d6c0 RDI: ffff81001a62d728
RBP: ffff810028ebbdc8 R08: 0000000000000000 R09: 0000000000000000
R10: 00000000000000e6 R11: 0000000000000246 R12: ffff81002004c0a0
R13: 0000000000000000 R14: ffffffff80296770 R15: ffff81001c6583e8
FS:  00007fb9b575b6f0(0000) GS:ffff81007d006580(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: fffff0002004c1b0 CR3: 00000000268ea000 CR4: 00000000000006a0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff4ff0 DR7: 0000000000000400
Process cat (pid: 4388, threadinfo ffff810028eba000, task ffff810024500000)
Stack:  ffff81007c5d4500 ffff81001a62d6c0 0000000000000000 0000000000000004
  ffff810028ebbe48 0000000000008000 ffff810028ebbde8 ffffffff802970c4
  0000000000000004 0000000000000000 ffff810028ebbf28 ffffffff802a56cb
Call Trace:
  [<ffffffff802970c4>] nameidata_to_filp+0x44/0x60
  [<ffffffff802a56cb>] do_filp_open+0x1eb/0x990
  [<ffffffff80296aec>] ? get_unused_fd_flags+0x8c/0x140
  [<ffffffff80296c16>] do_sys_open+0x76/0x110
  [<ffffffff80296cdb>] sys_open+0x1b/0x20
  [<ffffffff8020b88b>] system_call_after_swapgs+0x7b/0x80


Code: 4d 85 f6 0f 84 9b 01 00 00 48 89 de 4c 89 e7 41 ff d6 41 89 c5 85 c0 75 63 
81 63 2c 3f fc ff ff 48 8b 83 b0 00 00 00 48 8d 7b 68 <48> 8b 00 48 8b b0 08 01 
00 00 e8 ea de fd ff f6 43 2d 40 74 1f
RIP  [<ffffffff80296df7>] __dentry_open+0xe7/0x2d0
  RSP <ffff810028ebbd98>
CR2: fffff0002004c1b0
---[ end trace ae5dfe91803cf591 ]---



as the first (not tainted):
00]
BUG: unable to handle kernel paging request at fffff0002004c1b0
IP: [<ffffffff80296df7>] __dentry_open+0xe7/0x2d0
PGD 0
Oops: 0000 [1] SMP
last sysfs file: /sys/devices/platform/coretemp.1/temp1_input
CPU 0
Modules linked in: ipv6 tun bitrev test arc4 ecb crypto_blkcipher cryptomgr 
crypto_algapi ath5k mac80211 crc32 rtc_cmos usbhid sr_mod ohci1394 hid rtc_core 
cfg80211 rtc_lib ehci_hcd cdrom ieee1394 ff_memless floppy
Pid: 4348, comm: rpm Not tainted 2.6.25-rc8-mm2_64 #399
RIP: 0010:[<ffffffff80296df7>]  [<ffffffff80296df7>] __dentry_open+0xe7/0x2d0
RSP: 0018:ffff81003e95fd98  EFLAGS: 00010206
RAX: fffff0002004c1b0 RBX: ffff81003ea68cc0 RCX: 0000000000000000
RDX: ffff81003ea68cc0 RSI: ffff81003ea68cc0 RDI: ffff81003ea68d28
RBP: ffff81003e95fdc8 R08: 0000000000000000 R09: 0000000000000000
R10: 00000000000000ee R11: 0000000000000246 R12: ffff81002004c0a0
R13: 0000000000000000 R14: ffffffff80296770 R15: ffff81001c6583e8
FS:  00007f32306556f0(0000) GS:ffffffff80657000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: fffff0002004c1b0 CR3: 00000000269ab000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff4ff0 DR7: 0000000000000400
Process rpm (pid: 4348, threadinfo ffff81003e95e000, task ffff8100245069e0)
Stack:  ffff81007c5d4500 ffff81003ea68cc0 0000000000000000 0000000000000004
  ffff81003e95fe48 0000000000008000 ffff81003e95fde8 ffffffff802970c4
  0000000000000004 0000000000000000 ffff81003e95ff28 ffffffff802a56cb
Call Trace:
  [<ffffffff802970c4>] nameidata_to_filp+0x44/0x60
  [<ffffffff802a56cb>] do_filp_open+0x1eb/0x990
  [<ffffffff802a246c>] ? path_put+0x2c/0x40
  [<ffffffff80296aec>] ? get_unused_fd_flags+0x8c/0x140
  [<ffffffff80296c16>] do_sys_open+0x76/0x110
  [<ffffffff80296cdb>] sys_open+0x1b/0x20
  [<ffffffff8020b88b>] system_call_after_swapgs+0x7b/0x80


Code: 4d 85 f6 0f 84 9b 01 00 00 48 89 de 4c 89 e7 41 ff d6 41 89 c5 85 c0 75 63 
81 63 2c 3f fc ff ff 48 8b 83 b0 00 00 00 48 8d 7b 68 <48> 8b 00 48 8b b0 08 01 
00 00 e8 ea de fd ff f6 43 2d 40 74 1f
RIP  [<ffffffff80296df7>] __dentry_open+0xe7/0x2d0
  RSP <ffff81003e95fd98>
CR2: fffff0002004c1b0





(gdb) l *0xffffffff80296df7
0xffffffff80296df7 is in __dentry_open (/home/l/latest/xxx/fs/open.c:834).
829                             goto cleanup_all;
830             }
831
832             f->f_flags &= ~(O_CREAT | O_EXCL | O_NOCTTY | O_TRUNC);
833
834             file_ra_state_init(&f->f_ra, f->f_mapping->host->i_mapping);
835
836             /* NB: we're sure to have correct a_ops only after f_op->open */
837             if (f->f_flags & O_DIRECT) {
838                     if (!f->f_mapping->a_ops ||


         .loc 1 834 0
         movq    176(%rbx), %rax # <variable>.f_mapping, <variable>.f_mapping
         leaq    104(%rbx), %rdi #, tmp92
HERE    movq    (%rax), %rax    # <variable>.host, <variable>.host
         movq    264(%rax), %rsi # <variable>.i_mapping, <variable>.i_mapping
         call    file_ra_state_init      #

So it seems like broken (freed) f_mapping. Before that, dmesg is full of
ext3_orphan_cleanup: deleting unreferenced inode 228686
ext3_orphan_cleanup: deleting unreferenced inode 245058
ext3_orphan_cleanup: deleting unreferenced inode 245070
ext3_orphan_cleanup: deleting unreferenced inode 245069
ext3_orphan_cleanup: deleting unreferenced inode 245059
ext3_orphan_cleanup: deleting unreferenced inode 228499
ext3_orphan_cleanup: deleting unreferenced inode 244841
ext3_orphan_cleanup: deleting unreferenced inode 245057
ext3_orphan_cleanup: deleting unreferenced inode 229196
ext3_orphan_cleanup: deleting unreferenced inode 228773
ext3_orphan_cleanup: deleting unreferenced inode 587535
ext3_orphan_cleanup: deleting unreferenced inode 554911
EXT3-fs: md1: 376 orphan inodes deleted


Now I got:
EXT3 Inode ffff81002009cb00: orphan list check failed!
ffff81002009cb00: 000e66cf 000e66d0 00000000 00000000
ffff81002009cb10: 00000000 00000000 00000000 00000000
ffff81002009cb20: 00000000 00000000 00000000 00000000
ffff81002009cb30: 00000000 00000000 00000000 00000000
ffff81002009cb40: 00000000 00000000 0000ffff 00000000
ffff81002009cb50: 0000001c 00000000 00000000 00000000
ffff81002009cb60: 00000000 00000006 f009cb68 ffff8100
ffff81002009cb70: 2009cb68 ffff8100 00002000 00000000
ffff81002009cb80: 148b0000 0000003c 00000001 00000000
ffff81002009cb90: 2009cb90 ffff8100 2009cb90 ffff8100
ffff81002009cba0: 00000000 00000000 00000000 00000000
ffff81002009cbb0: 00100100 00000000 00200200 00000000
ffff81002009cbc0: 2009cbc0 ffff8100 2009cbc0 ffff8100
ffff81002009cbd0: 2009cbd0 ffff8100 2009cbd0 ffff8100
ffff81002009cbe0: 0006ea1b 00000000 00000000 00000001
ffff81002009cbf0: 000001f4 000001f4 00000000 00000000
ffff81002009cc00: 00000001 00000000 00002000 00000000
ffff81002009cc10: 477fcac7 00000000 00000000 00000000
ffff81002009cc20: 477f4c94 00000000 00000000 00000000
ffff81002009cc30: 477f4c94 00000000 00000000 00000000
ffff81002009cc40: 0000000c 00000000 00000010 00000000
ffff81002009cc50: 81b40000 00000000 00000001 00000000
ffff81002009cc60: 2009cc60 ffff8100 2009cc60 ffff8100
ffff81002009cc70: 00000000 00000000 2009cc78 ffff8100
ffff81002009cc80: 2009cc78 ffff8100 8051d920 ffffffff
ffff81002009cc90: 8051d840 ffffffff 7a552400 ffff8100
ffff81002009cca0: 00000000 00000000 2009ccb0 ffff8100
ffff81002009ccb0: 2009cba0 ffff8100 00000000 00000020
ffff81002009ccc0: 00000000 00000000 01000000 00000000
ffff81002009ccd0: 00000000 00000000 00010001 00000000
ffff81002009cce0: 2009cce0 ffff8100 2009cce0 ffff8100
ffff81002009ccf0: 00000000 00000000 00000000 00000000
ffff81002009cd00: 00000000 00000000 8051db40 ffffffff
ffff81002009cd10: 001200d2 00000000 7c504bd8 ffff8100
ffff81002009cd20: 00000000 00000000 2009cd28 ffff8100
ffff81002009cd30: 2009cd28 ffff8100 00000000 00000000
ffff81002009cd40: 2009cd40 ffff8100 2009cd40 ffff8100
ffff81002009cd50: 00000000 00000000 00000000 a68b3ece
ffff81002009cd60: 00000000 00000000 00000000 00000000
ffff81002009cd70: 2009cd70 ffff8100 2009cd70 ffff8100
ffff81002009cd80: 00000001 00000000 2009cd88 ffff8100
ffff81002009cd90: 2009cd88 ffff8100 00000040 00000000
ffff81002009cda0: 00000000 00000000 00000000 00000000
ffff81002009cdb0: 00000000 00000000
Pid: 5579, comm: rrdtool Tainted: G      D   2.6.25-rc8-mm2_64 #399

Call Trace:
  [<ffffffff802fb03c>] ext3_destroy_inode+0x7c/0x80
  [<ffffffff802af11e>] destroy_inode+0x2e/0x60
  [<ffffffff802af7e3>] dispose_list+0xa3/0x120
  [<ffffffff802afaad>] shrink_icache_memory+0x24d/0x2a0
  [<ffffffff80277415>] shrink_slab+0x145/0x1e0
  [<ffffffff80278ed8>] try_to_free_pages+0x248/0x3a0
  [<ffffffff804f60ed>] ? schedule_timeout+0x5d/0xd0
  [<ffffffff80277820>] ? isolate_pages_global+0x0/0x40
  [<ffffffff80272229>] __alloc_pages_internal+0x1e9/0x470
  [<ffffffff802724cb>] __alloc_pages+0xb/0x10
  [<ffffffff802724e8>] get_zeroed_page+0x18/0x60
  [<ffffffff8027c33c>] __pte_alloc+0x2c/0xf0
  [<ffffffff8027fc9d>] handle_mm_fault+0x61d/0x6c0
  [<ffffffff804fa024>] do_page_fault+0x364/0xa30
  [<ffffffff80328fa8>] ? __up_write+0x68/0x140
  [<ffffffff804f7c29>] error_exit+0x0/0x51


Going to fsck.

Few days ago I got this (tainted) version:

BUG: unable to handle kernel paging request at ffff81f02003f16c
IP: [<ffffffff802ad7d5>] __d_lookup+0x155/0x160
PGD 0
Oops: 0000 [1] SMP
last sysfs file: /sys/devices/platform/coretemp.1/temp1_input
CPU 1
Modules linked in: ppdev parport tun bitrev ipv6 test arc4 ecb crypto_blkcipher 
cryptomgr crypto_algapi ath5k mac80211 crc32 rtc_cmos sr_mod ohci1394 rtc_core 
usbhid rtc_lib ieee1394 cdrom cfg80211 hid usblp ehci_hcd ff_memless floppy 
[last unloaded: vmnet]
Pid: 3710, comm: sensors-applet Tainted: P          2.6.25-rc8-mm2_64 #399
RIP: 0010:[<ffffffff802ad7d5>]  [<ffffffff802ad7d5>] __d_lookup+0x155/0x160
RSP: 0018:ffff810057973b98  EFLAGS: 00010246
RAX: 0000000000000017 RBX: ffff81002003f0e0 RCX: 0000000000000017
RDX: 0000000000000017 RSI: ffff81f02003f16c RDI: ffff8100036f7022
RBP: ffff810057973bf8 R08: ffff810057973ca8 R09: 0000000000000000
R10: 00000000000000d8 R11: 0000000000000246 R12: ffff81002003f0c8
R13: 00000000910b9880 R14: ffff810035a5ded8 R15: ffff810057973bc8
FS:  00007f6e2b7266f0(0000) GS:ffff81007d006580(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: ffff81f02003f16c CR3: 000000005788a000 CR4: 00000000000006a0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process sensors-applet (pid: 3710, threadinfo ffff810057972000, task 
ffff810062ace9e0)
Stack:  ffff810057973ca8 0000000000000017 ffff81002003f0d0 000000176767e000
  ffff8100036f7022 ffffffff8047a695 ffff81002003f0e0 0000000000000001
  ffff810057973e48 ffff810057973e48 ffff810057973ca8 ffff810057973cb8
Call Trace:
  [<ffffffff8047a695>] ? skb_release_data+0x85/0xd0
  [<ffffffff802a2b95>] do_lookup+0x35/0x220
  [<ffffffff802a2fd2>] __link_path_walk+0x252/0x1010
  [<ffffffff8022b4d0>] ? default_wake_function+0x0/0x10
  [<ffffffff802a3dfe>] path_walk+0x6e/0xe0
  [<ffffffff802a40c2>] do_path_lookup+0xa2/0x240
  [<ffffffff802a45c7>] __path_lookup_intent_open+0x67/0xd0
  [<ffffffff802a463c>] path_lookup_open+0xc/0x10
  [<ffffffff802a558a>] do_filp_open+0xaa/0x990
  [<ffffffff80281778>] ? unmap_region+0x138/0x160
  [<ffffffff80296aec>] ? get_unused_fd_flags+0x8c/0x140
  [<ffffffff80296c16>] do_sys_open+0x76/0x110
  [<ffffffff80296cdb>] sys_open+0x1b/0x20
  [<ffffffff8020b88b>] system_call_after_swapgs+0x7b/0x80


Code: 89 e0 48 8b 55 b0 fe 02 eb ae 0f 1f 40 00 8b 45 bc 41 39 44 24 34 75 8d 48 
8b 55 a8 49 8b 74 24 38 48 39 d2 48 8b 7d c0 48 89 d1 <f3> a6 0f 85 72 ff ff ff 
eb bb 90 55 48 89 e5 41 55 49 89 fd 41
RIP  [<ffffffff802ad7d5>] __d_lookup+0x155/0x160
  RSP <ffff810057973b98>
CR2: ffff81f02003f16c
---[ end trace 9c63388ed58b7c09 ]---

Here the qstr->name used in memcmp seems to be freed or somewhat:
         .loc 1 1280 0
         movq    -88(%rbp), %rdx #,
         movq    56(%r12), %rsi  # <variable>.d_name.name, <variable>.d_name.name
         cmpq    %rdx, %rdx      #,
         movq    -64(%rbp), %rdi # str, str
         movq    %rdx, %rcx      #, len
.LVL394:
HERE    repz cmpsb


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2: IP: [<ffffffff802868f9>] __kmalloc+0x69/0x110
  2008-04-13 20:44 ` 2.6.25-rc8-mm2: IP: [<ffffffff802868f9>] __kmalloc+0x69/0x110 Alexey Dobriyan
  2008-04-13 20:53   ` Andrew Morton
@ 2008-04-14  9:51   ` Pekka J Enberg
  2008-04-14 16:01   ` Pekka Enberg
  2 siblings, 0 replies; 71+ messages in thread
From: Pekka J Enberg @ 2008-04-14  9:51 UTC (permalink / raw
  To: Alexey Dobriyan; +Cc: Andrew Morton, linux-kernel, clameter

On Mon, 14 Apr 2008, Alexey Dobriyan wrote:
> Grrr, I was hunting for oopses in dup_fd and near that were plaguing one
> box here for far too long, and hit below.
> 
> What happened if freshly booted box (probably not all init scripts finished),
> X already started. ssh from another box and reboot from session.

If this is easily reproducable, I would appreciate if you could give the 
'for-linus' branch of my tree a spin:

git://git.kernel.org/pub/scm/linux/kernel/git/penberg/slab-2.6.git

			Pekka

^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2: IP: [<ffffffff802868f9>] __kmalloc+0x69/0x110
  2008-04-13 20:44 ` 2.6.25-rc8-mm2: IP: [<ffffffff802868f9>] __kmalloc+0x69/0x110 Alexey Dobriyan
  2008-04-13 20:53   ` Andrew Morton
  2008-04-14  9:51   ` Pekka J Enberg
@ 2008-04-14 16:01   ` Pekka Enberg
  2008-04-14 17:53     ` Christoph Lameter
  2 siblings, 1 reply; 71+ messages in thread
From: Pekka Enberg @ 2008-04-14 16:01 UTC (permalink / raw
  To: Alexey Dobriyan; +Cc: Andrew Morton, linux-kernel, clameter

On Sun, Apr 13, 2008 at 11:44 PM, Alexey Dobriyan <adobriyan@gmail.com> wrote:
> Grrr, I was hunting for oopses in dup_fd and near that were plaguing one
>  box here for far too long, and hit below.
>
>  What happened if freshly booted box (probably not all init scripts finished),
>  X already started. ssh from another box and reboot from session.
>
>  (gdb) p __kmalloc
>  $1 = {void *(size_t, gfp_t)} 0xffffffff80286890 <__kmalloc>
>  (gdb) l *(0xffffffff80286890 + 0x69)
>  0xffffffff802868f9 is in __kmalloc (mm/slub.c:1663).
>  1658
>  1659                    object = __slab_alloc(s, gfpflags, node, addr, c);
>  1660
>  1661            else {
>  1662                    object = c->freelist;
>  1663      ===>          c->freelist = object[c->offset];   <===
>  1664                    stat(c, ALLOC_FASTPATH);
>  1665            }
>  1666            local_irq_restore(flags);
>
>
>
>  BUG: unable to handle kernel paging request at 0000000500000500
>  IP: [<ffffffff802868f9>] __kmalloc+0x69/0x110
>  PGD 17e04a067 PUD 0
>  Oops: 0000 [1] SMP DEBUG_PAGEALLOC
>  last sysfs file: /sys/devices/pci0000:00/0000:00:1e.0/0000:05:02.0/resource
>  CPU 1
>  Modules linked in: nf_conntrack_irc ipt_MASQUERADE iptable_nat nf_nat nf_conntrack_ipv4 xt_state nf_conntrack iptable_filter ip_tables x_tables usblp ehci_hcd uhci_hcd usbcore sr_mod cdrom
>  Pid: 4966, comm: depscan.sh Not tainted 2.6.25-rc8-mm2 #20
>  RIP: 0010:[<ffffffff802868f9>]  [<ffffffff802868f9>] __kmalloc+0x69/0x110
>  RSP: 0018:ffff81017cba9c68  EFLAGS: 00010006
>  RAX: 0000000000000000 RBX: ffffffff805c3950 RCX: ffff81017e7bb278
>  RDX: ffff81017c868000 RSI: 0000000000000001 RDI: ffffffff802868db
>  RBP: ffff81017cba9c98 R08: 0000000000000000 R09: 0000000000000001
>  R10: 0000000005050561 R11: 00000000036c00b1 R12: 0000000500000500
>  R13: 0000000000000282 R14: 00000000000080d0 R15: ffff810001070360
>  FS:  00007fc9d17276f0(0000) GS:ffff81017fc44600(0000) knlGS:0000000000000000
>  CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
>  CR2: 0000000500000500 CR3: 000000017c9c2000 CR4: 00000000000006e0
>  DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
>  DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
>  Process depscan.sh (pid: 4966, threadinfo ffff81017cba8000, task ffff81017c868000)
>  Stack:  ffffffff802d4a42 ffff81017e7bb278 ffff81017e7bb278 00000000fe5c5c7c
>   000000000cb4c2b8 ffff81017efdc8c0 ffff81017cba9cd8 ffffffff802d4a42
>   ffff81017cba9cd8 ffff81017e7bb278 ffff81017f82e2a0 ffff81017cba9da8
>  Call Trace:
>   [<ffffffff802d4a42>] ? ext3_htree_store_dirent+0x32/0x120
>   [<ffffffff802d4a42>] ext3_htree_store_dirent+0x32/0x120
>   [<ffffffff802dba25>] htree_dirblock_to_tree+0x105/0x170
>   [<ffffffff802de30d>] ext3_htree_fill_tree+0x7d/0x220
>   [<ffffffff80252d59>] ? trace_hardirqs_on_caller+0xc9/0x150
>   [<ffffffff802d50f4>] ? ext3_readdir+0x5c4/0x630
>   [<ffffffff802d4c74>] ext3_readdir+0x144/0x630
>   [<ffffffff802975f0>] ? filldir+0x0/0xe0
>   [<ffffffff8045475a>] ? __mutex_lock_common+0x22a/0x330
>   [<ffffffff80297741>] ? vfs_readdir+0x71/0xc0
>   [<ffffffff802975f0>] ? filldir+0x0/0xe0
>   [<ffffffff802975f0>] ? filldir+0x0/0xe0
>   [<ffffffff80297773>] vfs_readdir+0xa3/0xc0
>   [<ffffffff80297822>] sys_getdents+0x92/0xd0
>   [<ffffffff8020b4cb>] system_call_after_swapgs+0x7b/0x80
>
>
>  Code: 48 89 45 d0 9c 41 5d fa e8 f5 a5 fc ff 65 8b 04 25 24 00 00 00 48 98 4c 8b bc c3 c8 00 00 00 4d 8b 27 4d 85 e4 74 7a 41 8b 47 14 <49> 8b 04 c4 49 89 07 41 f7 c5 00 02 00 00 75 37 41 55 9d e8 bf
>  RIP  [<ffffffff802868f9>] __kmalloc+0x69/0x110
>   RSP <ffff81017cba9c68>
>  CR2: 0000000500000500

Looks like freelist corruption where c->freelist is 0x0000000500000500
and c->offset is zero... Christoph?

>  # CONFIG_DEBUG_DRIVER is not set
>  # CONFIG_DEBUG_DEVRES is not set
>  # CONFIG_DEBUG_FS is not set
>  CONFIG_DEBUG_KERNEL=y
>  # CONFIG_DEBUG_SHIRQ is not set
>  CONFIG_DEBUG_OBJECTS=y
>  # CONFIG_DEBUG_OBJECTS_SELFTEST is not set
>  CONFIG_DEBUG_OBJECTS_FREE=y
>  CONFIG_DEBUG_OBJECTS_TIMERS=y
>  CONFIG_DEBUG_RT_MUTEXES=y
>  CONFIG_DEBUG_PI_LIST=y
>  CONFIG_DEBUG_SPINLOCK=y
>  CONFIG_DEBUG_MUTEXES=y
>  CONFIG_DEBUG_LOCK_ALLOC=y
>  # CONFIG_DEBUG_LOCKDEP is not set
>  CONFIG_DEBUG_SPINLOCK_SLEEP=y
>  # CONFIG_DEBUG_LOCKING_API_SELFTESTS is not set
>  # CONFIG_DEBUG_KOBJECT is not set
>  CONFIG_DEBUG_BUGVERBOSE=y
>  CONFIG_DEBUG_INFO=y
>  CONFIG_DEBUG_VM=y
>  CONFIG_DEBUG_WRITECOUNT=y
>  CONFIG_DEBUG_LIST=y
>  CONFIG_DEBUG_SG=y
>  # CONFIG_DEBUG_SYNCHRO_TEST is not set
>  # CONFIG_DEBUG_STACKOVERFLOW is not set
>  # CONFIG_DEBUG_STACK_USAGE is not set
>  CONFIG_DEBUG_PAGEALLOC=y
>  CONFIG_DEBUG_PER_CPU_MAPS=y
>  CONFIG_DEBUG_RODATA=y
>  CONFIG_DEBUG_RODATA_TEST=y
>  # CONFIG_DEBUG_NX_TEST is not set

I assume you have CONFIG_SLUB_DEBUG enabled but it was left out by the grep?

^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2: IP: [<ffffffff802868f9>] __kmalloc+0x69/0x110
  2008-04-13 20:53   ` Andrew Morton
@ 2008-04-14 16:18     ` Pekka Enberg
  2008-04-14 18:00     ` Christoph Lameter
  1 sibling, 0 replies; 71+ messages in thread
From: Pekka Enberg @ 2008-04-14 16:18 UTC (permalink / raw
  To: Andrew Morton; +Cc: Alexey Dobriyan, linux-kernel, clameter

On Sun, Apr 13, 2008 at 11:53 PM, Andrew Morton
<akpm@linux-foundation.org> wrote:
>  Yes, I don't think the sub changes are ready for prime-time.

Indeed. I now dropped the SLUB defragmentation patches from the
'for-mm' branch so Andrew can you please pull the new branch to -mm?

^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2: IP: [<ffffffff802868f9>] __kmalloc+0x69/0x110
  2008-04-14 16:01   ` Pekka Enberg
@ 2008-04-14 17:53     ` Christoph Lameter
  2008-04-14 17:57       ` Christoph Lameter
  2008-04-14 18:32       ` Alexey Dobriyan
  0 siblings, 2 replies; 71+ messages in thread
From: Christoph Lameter @ 2008-04-14 17:53 UTC (permalink / raw
  To: Pekka Enberg; +Cc: Alexey Dobriyan, Andrew Morton, linux-kernel

On Mon, 14 Apr 2008, Pekka Enberg wrote:

> >  Code: 48 89 45 d0 9c 41 5d fa e8 f5 a5 fc ff 65 8b 04 25 24 00 00 00 48 98 4c 8b bc c3 c8 00 00 00 4d 8b 27 4d 85 e4 74 7a 41 8b 47 14 <49> 8b 04 c4 49 89 07 41 f7 c5 00 02 00 00 75 37 41 55 9d e8 bf
> >  RIP  [<ffffffff802868f9>] __kmalloc+0x69/0x110
> >   RSP <ffff81017cba9c68>
> >  CR2: 0000000500000500
> 
> Looks like freelist corruption where c->freelist is 0x0000000500000500
> and c->offset is zero... Christoph?

c->offset is zero is okay. Could be a object freepointer corruption 
because the first word of the object is overwritten after free. You 
need to run with slub_debug on the commandline or CONFIG_SLUB_DEBUG_ON to 
debug this.

Anyone know what the possible meaning of 0x0000000500000500 is? I do not 
see anything in poison.h.


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2: IP: [<ffffffff802868f9>] __kmalloc+0x69/0x110
  2008-04-14 17:53     ` Christoph Lameter
@ 2008-04-14 17:57       ` Christoph Lameter
  2008-04-14 18:32       ` Alexey Dobriyan
  1 sibling, 0 replies; 71+ messages in thread
From: Christoph Lameter @ 2008-04-14 17:57 UTC (permalink / raw
  To: Pekka Enberg; +Cc: Alexey Dobriyan, Andrew Morton, linux-kernel

500000000 is 21474836480. So 21GB boundary? Some sort of device?



^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2: IP: [<ffffffff802868f9>] __kmalloc+0x69/0x110
  2008-04-13 20:53   ` Andrew Morton
  2008-04-14 16:18     ` Pekka Enberg
@ 2008-04-14 18:00     ` Christoph Lameter
  1 sibling, 0 replies; 71+ messages in thread
From: Christoph Lameter @ 2008-04-14 18:00 UTC (permalink / raw
  To: Andrew Morton; +Cc: Alexey Dobriyan, linux-kernel, penberg

On Sun, 13 Apr 2008, Andrew Morton wrote:

> Yes, I don't think the sub changes are ready for prime-time.

At this point there is no tie in with slub changes. This is 
freepointer corruption that is typical for writing to the object after 
free it. Enabling slub debugging is needed to figure out when the object 
was overwritten.



^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2: IP: [<ffffffff802868f9>] __kmalloc+0x69/0x110
  2008-04-14 17:53     ` Christoph Lameter
  2008-04-14 17:57       ` Christoph Lameter
@ 2008-04-14 18:32       ` Alexey Dobriyan
  2008-04-14 19:56         ` 2.6.25-rc8-mm2: FIX kmalloc-2048 (was Re: 2.6.25-rc8-mm2: IP: [<ffffffff802868f9>] __kmalloc+0x69/0x110) Alexey Dobriyan
  1 sibling, 1 reply; 71+ messages in thread
From: Alexey Dobriyan @ 2008-04-14 18:32 UTC (permalink / raw
  To: Christoph Lameter; +Cc: Pekka Enberg, Andrew Morton, linux-kernel

On Mon, Apr 14, 2008 at 10:53:12AM -0700, Christoph Lameter wrote:
> On Mon, 14 Apr 2008, Pekka Enberg wrote:
> 
> > >  Code: 48 89 45 d0 9c 41 5d fa e8 f5 a5 fc ff 65 8b 04 25 24 00 00 00 48 98 4c 8b bc c3 c8 00 00 00 4d 8b 27 4d 85 e4 74 7a 41 8b 47 14 <49> 8b 04 c4 49 89 07 41 f7 c5 00 02 00 00 75 37 41 55 9d e8 bf
> > >  RIP  [<ffffffff802868f9>] __kmalloc+0x69/0x110
> > >   RSP <ffff81017cba9c68>
> > >  CR2: 0000000500000500
> > 
> > Looks like freelist corruption where c->freelist is 0x0000000500000500
> > and c->offset is zero... Christoph?
> 
> c->offset is zero is okay. Could be a object freepointer corruption 
> because the first word of the object is overwritten after free. You 
> need to run with slub_debug on the commandline or CONFIG_SLUB_DEBUG_ON to 
> debug this.

Doh! I forgot SLUB debugging again. Apologies and let me reproduce some more.

> Anyone know what the possible meaning of 0x0000000500000500 is? I do not 
> see anything in poison.h.


^ permalink raw reply	[flat|nested] 71+ messages in thread

* 2.6.25-rc8-mm2: FIX kmalloc-2048 (was Re: 2.6.25-rc8-mm2: IP: [<ffffffff802868f9>] __kmalloc+0x69/0x110)
  2008-04-14 18:32       ` Alexey Dobriyan
@ 2008-04-14 19:56         ` Alexey Dobriyan
  2008-04-14 20:05           ` Christoph Lameter
  0 siblings, 1 reply; 71+ messages in thread
From: Alexey Dobriyan @ 2008-04-14 19:56 UTC (permalink / raw
  To: Christoph Lameter; +Cc: Pekka Enberg, Andrew Morton, linux-kernel, netdev

I can reproduce semi-reliably (by kernel standards) corruption in
kmalloc-2048. No idea if this can explain all "struct file" related
oopses I saw, or SLUB free pointer corruption Pekka and Christoph are
looking into.

8139too and atl1 drivers are in use. 8139too connects to outer world,
atl1 -- to laptop collecting netconsole logs. However, I never managed to
collect late oopses with netconsole even if init scripts which are
shutting down interfaces are disabled. :-(



Transcribed from photo:

8000 flags=0x8000000000002082
INFO: Object 0xffff81017ff9d2d0 @offset=21200 fp=0xffff81017ff9ca88

Bytes b4 0xffff81017ff9d2c0:  62 ea ff ff 00 00 00 00 5a 5a 5a 5a 5a 5a 5a 5a
  Object 0xffff81017ff9d2d0:  6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b
  Object 0xffff81017ff9d2e0:  6b 6b 00 18 f3 a2 9f 90 00 1b 38 af 22 49 08 00
  Object 0xffff81017ff9d2f0:  45 10 00 4c ff 59 40 00 40 11 86 ac c0 a8 00 2a
  Object 0xffff81017ff9d300:  50 fa a2 be 91 43 00 7b 00 38 54 d4 23 00 00 00
  Object 0xffff81017ff9d310:  00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
  Object 0xffff81017ff9d320:  00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
  Object 0xffff81017ff9d330:  00 00 00 00 4c ff 10 44 74 7f 6f 9d e4 c8 a2 4f
  Object 0xffff81017ff9d340:  00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
 Redzone 0xffff81017ff9dad0:  bb bb bb bb bb bb bb bb
 Padding 0xffff81017ff9db10:  5a 5a 5a 5a 5a 5a 5a 5a

 Pid: 6168, comm: reboot Not tainted 2.6.25-rc8-mm2 #28

 Call Trace:
	print_trailer
	check_bytes_and_report
	check_object
	__free_slab
	discard_slab
	__slab_free
	? skb_release_data
	kfree
	? skb_release_data
	skb_release_all
	__kfre_skb
	kfree_skb
	atl1_clean_rx_ring
	atl1_down
	atl1_close
	dev_close
	dev_change_flags
	devinet_ioctl
	? trace_hardirqs_on
	inet_ioctl
	sock_ioctl
	vfs_ioctl
	do_vfs_ioctl
	sys_ioctl
	system_call_after_swapgs

FIX kmalloc-2048: Restoring 0xffff81017ff9d2e2-0xffff81017ff9d8d9=0x6b


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2: FIX kmalloc-2048 (was Re: 2.6.25-rc8-mm2: IP: [<ffffffff802868f9>] __kmalloc+0x69/0x110)
  2008-04-14 19:56         ` 2.6.25-rc8-mm2: FIX kmalloc-2048 (was Re: 2.6.25-rc8-mm2: IP: [<ffffffff802868f9>] __kmalloc+0x69/0x110) Alexey Dobriyan
@ 2008-04-14 20:05           ` Christoph Lameter
  2008-04-19 11:17             ` Alexey Dobriyan
  0 siblings, 1 reply; 71+ messages in thread
From: Christoph Lameter @ 2008-04-14 20:05 UTC (permalink / raw
  To: Alexey Dobriyan; +Cc: Pekka Enberg, Andrew Morton, linux-kernel, netdev

On Mon, 14 Apr 2008, Alexey Dobriyan wrote:

> I can reproduce semi-reliably (by kernel standards) corruption in
> kmalloc-2048. No idea if this can explain all "struct file" related
> oopses I saw, or SLUB free pointer corruption Pekka and Christoph are
> looking into.

The slub free pointer corruption is usually a result of the overwrites.

> Bytes b4 0xffff81017ff9d2c0:  62 ea ff ff 00 00 00 00 5a 5a 5a 5a 5a 5a 5a 5a
>   Object 0xffff81017ff9d2d0:  6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b
>   Object 0xffff81017ff9d2e0:  6b 6b 00 18 f3 a2 9f 90 00 1b 38 af 22 49 08 00
>   Object 0xffff81017ff9d2f0:  45 10 00 4c ff 59 40 00 40 11 86 ac c0 a8 00 2a
>   Object 0xffff81017ff9d300:  50 fa a2 be 91 43 00 7b 00 38 54 d4 23 00 00 00
>   Object 0xffff81017ff9d310:  00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
>   Object 0xffff81017ff9d320:  00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
>   Object 0xffff81017ff9d330:  00 00 00 00 4c ff 10 44 74 7f 6f 9d e4 c8 a2 4f
>   Object 0xffff81017ff9d340:  00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
>  Redzone 0xffff81017ff9dad0:  bb bb bb bb bb bb bb bb
>  Padding 0xffff81017ff9db10:  5a 5a 5a 5a 5a 5a 5a 5a
> 
> FIX kmalloc-2048: Restoring 0xffff81017ff9d2e2-0xffff81017ff9d8d9=0x6b

Looks like skb corruption. Would be helpful to have the complete output 
though. Does the data in the restored range trigger any memories?




^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2: Observed problems: Not a detailed bug report
  2008-04-11  3:33 2.6.25-rc8-mm2 Andrew Morton
                   ` (10 preceding siblings ...)
  2008-04-14  8:07 ` BUG at __dentry_open [Was: 2.6.25-rc8-mm2] Jiri Slaby
@ 2008-04-15 17:00 ` Zan Lynx
  2008-04-15 18:42   ` Andrew Morton
  11 siblings, 1 reply; 71+ messages in thread
From: Zan Lynx @ 2008-04-15 17:00 UTC (permalink / raw
  To: Andrew Morton; +Cc: linux-kernel

[-- Attachment #1: Type: text/plain, Size: 947 bytes --]

On Thu, 2008-04-10 at 20:33 -0700, Andrew Morton wrote:
> ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.25-rc8/2.6.25-rc8-mm2/

I figured that I would post these without details just so they're out
there.  I've had trouble getting netconsole to dump anything when I need
it to.  I've also had trouble convincing myself to spend the off-work
hours I need to narrow it down.

The general environment is a AMD64 laptop running 64-bit Gentoo, GCC
4.3.

Problem #1:
Compiling with the stack protector option crashes right after initramfs
loads.  I think it might be dying while attempting to mount root.
Simple work around: Don't compile with stack protector.

Problem #2:
Resume from STR.  The laptop suspends and goes into suspend state with
the blinking power indicator.  On resume, it simply boots normally
instead of resuming.  No idea how to debug this one.  Bisect I guess.
-- 
Zan Lynx <zlynx@acm.org>

[-- Attachment #2: This is a digitally signed message part --]
[-- Type: application/pgp-signature, Size: 197 bytes --]

^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2: Observed problems: Not a detailed bug report
  2008-04-15 17:00 ` 2.6.25-rc8-mm2: Observed problems: Not a detailed bug report Zan Lynx
@ 2008-04-15 18:42   ` Andrew Morton
  2008-04-15 20:12     ` Rafael J. Wysocki
  0 siblings, 1 reply; 71+ messages in thread
From: Andrew Morton @ 2008-04-15 18:42 UTC (permalink / raw
  To: Zan Lynx; +Cc: linux-kernel, Ingo Molnar

On Tue, 15 Apr 2008 11:00:29 -0600 Zan Lynx <zlynx@acm.org> wrote:

> On Thu, 2008-04-10 at 20:33 -0700, Andrew Morton wrote:
> > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.25-rc8/2.6.25-rc8-mm2/
> 
> I figured that I would post these without details just so they're out
> there.  I've had trouble getting netconsole to dump anything when I need
> it to.  I've also had trouble convincing myself to spend the off-work
> hours I need to narrow it down.
> 
> The general environment is a AMD64 laptop running 64-bit Gentoo, GCC
> 4.3.
> 
> Problem #1:
> Compiling with the stack protector option crashes right after initramfs
> loads.  I think it might be dying while attempting to mount root.
> Simple work around: Don't compile with stack protector.

Yes, there have been a couple of reports of stack-protector problems.

> Problem #2:
> Resume from STR.  The laptop suspends and goes into suspend state with
> the blinking power indicator.  On resume, it simply boots normally
> instead of resuming.  No idea how to debug this one.

hm.  I figured that if suspend worked on the Vaio, it'll work on
everything ;)

>  Bisect I guess.

If you have the time, please.

^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2: Observed problems: Not a detailed bug report
  2008-04-15 18:42   ` Andrew Morton
@ 2008-04-15 20:12     ` Rafael J. Wysocki
  0 siblings, 0 replies; 71+ messages in thread
From: Rafael J. Wysocki @ 2008-04-15 20:12 UTC (permalink / raw
  To: Andrew Morton, Zan Lynx; +Cc: linux-kernel, Ingo Molnar

On Tuesday, 15 of April 2008, Andrew Morton wrote:
> On Tue, 15 Apr 2008 11:00:29 -0600 Zan Lynx <zlynx@acm.org> wrote:
> 
> > On Thu, 2008-04-10 at 20:33 -0700, Andrew Morton wrote:
> > > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.25-rc8/2.6.25-rc8-mm2/
> > 
> > I figured that I would post these without details just so they're out
> > there.  I've had trouble getting netconsole to dump anything when I need
> > it to.  I've also had trouble convincing myself to spend the off-work
> > hours I need to narrow it down.
> > 
> > The general environment is a AMD64 laptop running 64-bit Gentoo, GCC
> > 4.3.
> > 
> > Problem #1:
> > Compiling with the stack protector option crashes right after initramfs
> > loads.  I think it might be dying while attempting to mount root.
> > Simple work around: Don't compile with stack protector.
> 
> Yes, there have been a couple of reports of stack-protector problems.
> 
> > Problem #2:
> > Resume from STR.  The laptop suspends and goes into suspend state with
> > the blinking power indicator.  On resume, it simply boots normally
> > instead of resuming.  No idea how to debug this one.
> 
> hm.  I figured that if suspend worked on the Vaio, it'll work on
> everything ;)
> 
> >  Bisect I guess.
> 
> If you have the time, please.

Zan, can you please try the latest x86.git tree first?  The bug may be fixed
there.  If that still fails, please bisect, if you can.

Thanks,
Rafael

^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: 2.6.25-rc8-mm2: FIX kmalloc-2048 (was Re: 2.6.25-rc8-mm2: IP: [<ffffffff802868f9>] __kmalloc+0x69/0x110)
  2008-04-14 20:05           ` Christoph Lameter
@ 2008-04-19 11:17             ` Alexey Dobriyan
  2008-04-19 14:45               ` atl1 64-bit => 32-bit DMA borkage (reproducible, bisected) Alexey Dobriyan
  0 siblings, 1 reply; 71+ messages in thread
From: Alexey Dobriyan @ 2008-04-19 11:17 UTC (permalink / raw
  To: Christoph Lameter; +Cc: Pekka Enberg, Andrew Morton, linux-kernel, netdev

On Mon, Apr 14, 2008 at 01:05:09PM -0700, Christoph Lameter wrote:
> On Mon, 14 Apr 2008, Alexey Dobriyan wrote:
> 
> > I can reproduce semi-reliably (by kernel standards) corruption in
> > kmalloc-2048. No idea if this can explain all "struct file" related
> > oopses I saw, or SLUB free pointer corruption Pekka and Christoph are
> > looking into.
> 
> The slub free pointer corruption is usually a result of the overwrites.
> 
> > Bytes b4 0xffff81017ff9d2c0:  62 ea ff ff 00 00 00 00 5a 5a 5a 5a 5a 5a 5a 5a
> >   Object 0xffff81017ff9d2d0:  6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b
> >   Object 0xffff81017ff9d2e0:  6b 6b 00 18 f3 a2 9f 90 00 1b 38 af 22 49 08 00
> >   Object 0xffff81017ff9d2f0:  45 10 00 4c ff 59 40 00 40 11 86 ac c0 a8 00 2a
> >   Object 0xffff81017ff9d300:  50 fa a2 be 91 43 00 7b 00 38 54 d4 23 00 00 00
> >   Object 0xffff81017ff9d310:  00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
> >   Object 0xffff81017ff9d320:  00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
> >   Object 0xffff81017ff9d330:  00 00 00 00 4c ff 10 44 74 7f 6f 9d e4 c8 a2 4f
> >   Object 0xffff81017ff9d340:  00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
> >  Redzone 0xffff81017ff9dad0:  bb bb bb bb bb bb bb bb
> >  Padding 0xffff81017ff9db10:  5a 5a 5a 5a 5a 5a 5a 5a
> > 
> > FIX kmalloc-2048: Restoring 0xffff81017ff9d2e2-0xffff81017ff9d8d9=0x6b
> 
> Looks like skb corruption. Would be helpful to have the complete output 
> though. Does the data in the restored range trigger any memories?

No.

I'm currently tracing this bug and 2.6.24 also has it. :-(


^ permalink raw reply	[flat|nested] 71+ messages in thread

* atl1 64-bit => 32-bit DMA borkage (reproducible, bisected)
  2008-04-19 11:17             ` Alexey Dobriyan
@ 2008-04-19 14:45               ` Alexey Dobriyan
  2008-04-20  2:54                 ` Jay Cliburn
  0 siblings, 1 reply; 71+ messages in thread
From: Alexey Dobriyan @ 2008-04-19 14:45 UTC (permalink / raw
  To: Luca Tettamanti, Chris Snook, Jay Cliburn, Jeff Garzik
  Cc: Pekka Enberg, Andrew Morton, linux-kernel, netdev,
	Christoph Lameter, torvalds

On Sat, Apr 19, 2008 at 03:17:19PM +0400, Alexey Dobriyan wrote:
> > > Bytes b4 0xffff81017ff9d2c0:  62 ea ff ff 00 00 00 00 5a 5a 5a 5a 5a 5a 5a 5a
> > >   Object 0xffff81017ff9d2d0:  6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b
> > >   Object 0xffff81017ff9d2e0:  6b 6b 00 18 f3 a2 9f 90 00 1b 38 af 22 49 08 00
> > >   Object 0xffff81017ff9d2f0:  45 10 00 4c ff 59 40 00 40 11 86 ac c0 a8 00 2a
> > >   Object 0xffff81017ff9d300:  50 fa a2 be 91 43 00 7b 00 38 54 d4 23 00 00 00
> > >   Object 0xffff81017ff9d310:  00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
> > >   Object 0xffff81017ff9d320:  00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
> > >   Object 0xffff81017ff9d330:  00 00 00 00 4c ff 10 44 74 7f 6f 9d e4 c8 a2 4f
> > >   Object 0xffff81017ff9d340:  00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
> > >  Redzone 0xffff81017ff9dad0:  bb bb bb bb bb bb bb bb
> > >  Padding 0xffff81017ff9db10:  5a 5a 5a 5a 5a 5a 5a 5a
> > > 
> > > FIX kmalloc-2048: Restoring 0xffff81017ff9d2e2-0xffff81017ff9d8d9=0x6b

OK, nailed it.

It's commit 5f08e46b621a769e52a9545a23ab1d5fb2aec1d4 aka "atl1: disable broken 64-bit DMA".

With this commit in tree, I can reproduce either
a) kmalloc-2048 corruption after initscripts shutdown eth0
	http://marc.info/?l=linux-kernel&m=120820360221261&w=2

b) or oopses at filp_close() first reported long ago
	(sorry, can't find that email)

c) or hard hang after initscripts shutdown eth0 with even SysRq not working.
	http://marc.info/?l=linux-kernel&m=120795046008115&w=2

I have two boxes one with atl1, 4G RAM with 2G remapped after 4G boundary, another
with r8169 connected with just ethernet cable. NICs agree on 1Gbps speed.

So, it's enough to scp 200 MB git archive and immediately start
rebooting sequence for horrors described above to appear. It's not 100%
reproducible but more like 90%.

I tested 10 times kernel one commit before and it doesn't have these
issues and reboots reliably.

CONFIG_IOMMU is in use, dmesg, lspci, /proc/mtrr below:

03:00.0 Ethernet controller [0200]: Attansic Technology Corp. L1 Gigabit Ethernet Adapter [1969:1048] (rev b0)
	Subsystem: ASUSTeK Computer Inc. Unknown device [1043:8226]
	Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
	Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Latency: 0, Cache Line Size: 32 bytes
	Interrupt: pin A routed to IRQ 319
	Region 0: Memory at fe9c0000 (64-bit, non-prefetchable) [size=256K]
	Expansion ROM at fe9a0000 [disabled] [size=128K]
	Capabilities: [40] Power Management version 2
		Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot+,D3cold+)
		Status: D0 PME-Enable- DSel=0 DScale=0 PME-
	Capabilities: [48] Message Signalled Interrupts: Mask- 64bit+ Queue=0/0 Enable+
		Address: 00000000fee0300c  Data: 4161
	Capabilities: [58] Express (v1) Endpoint, MSI 00
		DevCap:	MaxPayload 128 bytes, PhantFunc 0, Latency L0s <4us, L1 unlimited
			ExtTag- AttnBtn+ AttnInd+ PwrInd+ RBE- FLReset-
		DevCtl:	Report errors: Correctable- Non-Fatal- Fatal- Unsupported-
			RlxdOrd- ExtTag- PhantFunc- AuxPwr- NoSnoop-
			MaxPayload 128 bytes, MaxReadReq 512 bytes
		DevSta:	CorrErr+ UncorrErr+ FatalErr- UnsuppReq+ AuxPwr+ TransPend-
		LnkCap:	Port #0, Speed 2.5GT/s, Width x1, ASPM L0s, Latency L0 unlimited, L1 unlimited
			ClockPM- Suprise- LLActRep- BwNot-
		LnkCtl:	ASPM Disabled; RCB 64 bytes Disabled- Retrain- CommClk+
			ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
		LnkSta:	Speed 2.5GT/s, Width x1, TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt-
	Capabilities: [6c] Vital Product Data <?>
	Kernel driver in use: atl1
00: 69 19 48 10 06 04 10 00 b0 00 00 02 08 00 00 00
10: 04 00 9c fe 00 00 00 00 00 00 00 00 00 00 00 00
20: 00 00 00 00 00 00 00 00 00 00 00 00 43 10 26 82
30: 00 00 9a fe 40 00 00 00 00 00 00 00 0a 01 00 00
40: 01 48 02 c0 00 00 00 00 05 58 81 00 0c 30 e0 fe
50: 00 00 00 00 61 41 00 00 10 6c 01 00 80 7f 00 00
60: 00 20 1b 00 11 f4 03 00 40 00 11 10 03 00 28 81
70: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
80: 00 00 00 00 69 19 48 10 00 00 00 00 00 00 00 00
90: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
a0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
b0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
c0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
d0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
e0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
f0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00


reg00: base=0x80000000 (2048MB), size=2048MB: uncachable, count=1
reg01: base=0x00000000 (   0MB), size=4096MB: write-back, count=1
reg02: base=0x100000000 (4096MB), size=2048MB: write-back, count=1


Linux version 2.6.23-rc6 (ad@martell) (gcc version 4.1.2 (Gentoo 4.1.2 p1.0.2)) #14 SMP PREEMPT Sat Apr 19 17:46:31 MSD 2008
Command line: root=/dev/sda2 netconsole=@192.168.0.1/eth0,9353@192.168.0.42/00:1b:38:af:22:49 ignore_loglevel
BIOS-provided physical RAM map:
 BIOS-e820: 0000000000000000 - 000000000009fc00 (usable)
 BIOS-e820: 000000000009fc00 - 00000000000a0000 (reserved)
 BIOS-e820: 00000000000e4000 - 0000000000100000 (reserved)
 BIOS-e820: 0000000000100000 - 000000007ff90000 (usable)
 BIOS-e820: 000000007ff90000 - 000000007ff9e000 (ACPI data)
 BIOS-e820: 000000007ff9e000 - 000000007ffe0000 (ACPI NVS)
 BIOS-e820: 000000007ffe0000 - 0000000080000000 (reserved)
 BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved)
 BIOS-e820: 00000000ffb00000 - 0000000100000000 (reserved)
 BIOS-e820: 0000000100000000 - 0000000180000000 (usable)
Entering add_active_range(0, 0, 159) 0 entries of 256 used
Entering add_active_range(0, 256, 524176) 1 entries of 256 used
Entering add_active_range(0, 1048576, 1572864) 2 entries of 256 used
end_pfn_map = 1572864
DMI 2.4 present.
ACPI: RSDP 000FA980, 0024 (r2 ACPIAM)
ACPI: XSDT 7FF90100, 0054 (r1 KOZIRO FRONTIER  2000707 MSFT       97)
ACPI: FACP 7FF90290, 00F4 (r3 MSTEST OEMFACP   2000707 MSFT       97)
ACPI: DSDT 7FF905C0, 8FA9 (r1  A0637 A0637000        0 INTL 20060113)
ACPI: FACS 7FF9E000, 0040
ACPI: APIC 7FF90390, 006C (r1 MSTEST OEMAPIC   2000707 MSFT       97)
ACPI: MCFG 7FF90400, 003C (r1 MSTEST OEMMCFG   2000707 MSFT       97)
ACPI: SLIC 7FF90440, 0176 (r1 KOZIRO FRONTIER  2000707 MSFT       97)
ACPI: OEMB 7FF9E040, 007B (r1 MSTEST AMI_OEM   2000707 MSFT       97)
ACPI: HPET 7FF99570, 0038 (r1 MSTEST OEMHPET   2000707 MSFT       97)
Entering add_active_range(0, 0, 159) 0 entries of 256 used
Entering add_active_range(0, 256, 524176) 1 entries of 256 used
Entering add_active_range(0, 1048576, 1572864) 2 entries of 256 used
Zone PFN ranges:
  DMA             0 ->     4096
  DMA32        4096 ->  1048576
  Normal    1048576 ->  1572864
Movable zone start PFN for each node
early_node_map[3] active PFN ranges
    0:        0 ->      159
    0:      256 ->   524176
    0:  1048576 ->  1572864
On node 0 totalpages: 1048367
  DMA zone: 56 pages used for memmap
  DMA zone: 2000 pages reserved
  DMA zone: 1943 pages, LIFO batch:0
  DMA32 zone: 14280 pages used for memmap
  DMA32 zone: 505800 pages, LIFO batch:31
  Normal zone: 7168 pages used for memmap
  Normal zone: 517120 pages, LIFO batch:31
  Movable zone: 0 pages used for memmap
ACPI: Local APIC address 0xfee00000
ACPI: LAPIC (acpi_id[0x01] lapic_id[0x00] enabled)
Processor #0 (Bootup-CPU)
ACPI: LAPIC (acpi_id[0x02] lapic_id[0x01] enabled)
Processor #1
ACPI: LAPIC (acpi_id[0x03] lapic_id[0x82] disabled)
ACPI: LAPIC (acpi_id[0x04] lapic_id[0x83] disabled)
ACPI: IOAPIC (id[0x02] address[0xfec00000] gsi_base[0])
IOAPIC[0]: apic_id 2, address 0xfec00000, GSI 0-23
ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level)
ACPI: IRQ0 used by override.
ACPI: IRQ2 used by override.
ACPI: IRQ9 used by override.
Setting APIC routing to flat
ACPI: HPET id: 0x8086a202 base: 0xfed00000
Using ACPI (MADT) for SMP configuration information
Allocating PCI resources starting at 88000000 (gap: 80000000:7ee00000)
PERCPU: Allocating 29912 bytes of per cpu data
Built 1 zonelists in Zone order.  Total pages: 1024863
Kernel command line: root=/dev/sda2 netconsole=@192.168.0.1/eth0,9353@192.168.0.42/00:1b:38:af:22:49 ignore_loglevel
netconsole: local port 6665
netconsole: local IP 192.168.0.1
netconsole: interface eth0
netconsole: remote port 9353
netconsole: remote IP 192.168.0.42
netconsole: remote ethernet address 00:1b:38:af:22:49
debug: ignoring loglevel setting.
Initializing CPU#0
PID hash table entries: 4096 (order: 12, 32768 bytes)
Extended CMOS year: 2000
time.c: Detected 2135.040 MHz processor.
Console: colour VGA+ 80x25
console [tty0] enabled
Lock dependency validator: Copyright (c) 2006 Red Hat, Inc., Ingo Molnar
... MAX_LOCKDEP_SUBCLASSES:    8
... MAX_LOCK_DEPTH:          30
... MAX_LOCKDEP_KEYS:        2048
... CLASSHASH_SIZE:           1024
... MAX_LOCKDEP_ENTRIES:     8192
... MAX_LOCKDEP_CHAINS:      16384
... CHAINHASH_SIZE:          8192
 memory used by lock dependency info: 1648 kB
 per task-struct memory footprint: 1680 bytes
Dentry cache hash table entries: 524288 (order: 10, 4194304 bytes)
Inode-cache hash table entries: 262144 (order: 9, 2097152 bytes)
Checking aperture...
PCI-DMA: Using software bounce buffering for IO (SWIOTLB)
Placing software IO TLB between 0x161b000 - 0x561b000
Memory: 4026652k/6291456k available (2330k kernel code, 166608k reserved, 1300k data, 200k init)
SLUB: Genslabs=22, HWalign=64, Order=0-1, MinObjects=4, CPUs=2, Nodes=1
Calibrating delay using timer specific routine.. 4273.23 BogoMIPS (lpj=2136619)
Mount-cache hash table entries: 256
CPU: L1 I cache: 32K, L1 D cache: 32K
CPU: L2 cache: 2048K
using mwait in idle threads.
CPU: Physical Processor ID: 0
CPU: Processor Core ID: 0
CPU0: Thermal monitoring enabled (TM2)
Freeing SMP alternatives: 19k freed
ACPI: Core revision 20070126
Using local APIC timer interrupts.
result 16679993
Detected 16.679 MHz APIC timer.
lockdep: not fixing up alternatives.
Booting processor 1/2 APIC 0x1
Initializing CPU#1
Calibrating delay using timer specific routine.. 4270.09 BogoMIPS (lpj=2135045)
CPU: L1 I cache: 32K, L1 D cache: 32K
CPU: L2 cache: 2048K
CPU: Physical Processor ID: 0
CPU: Processor Core ID: 1
CPU1: Thermal monitoring enabled (TM2)
Intel(R) Core(TM)2 CPU          6400  @ 2.13GHz stepping 02
checking TSC synchronization [CPU#0 -> CPU#1]: passed.
Brought up 2 CPUs
NET: Registered protocol family 16
No dock devices found.
ACPI: bus type pci registered
PCI: Using configuration type 1
ACPI: EC: Look up EC in DSDT
ACPI: Interpreter enabled
ACPI: Using IOAPIC for interrupt routing
ACPI: PCI Root Bridge [PCI0] (0000:00)
PCI quirk: region 0800-087f claimed by ICH6 ACPI/GPIO/TCO
PCI quirk: region 0480-04bf claimed by ICH6 GPIO
PCI: Transparent bridge - 0000:00:1e.0
ACPI: PCI Interrupt Routing Table [\_SB_.PCI0._PRT]
ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.P0P2._PRT]
ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.P0P1._PRT]
ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.P0P4._PRT]
ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.P0P7._PRT]
ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.P0P8._PRT]
ACPI: PCI Interrupt Link [LNKA] (IRQs 3 4 5 6 7 10 *11 12 14 15)
ACPI: PCI Interrupt Link [LNKB] (IRQs 3 4 *5 6 7 10 11 12 14 15)
ACPI: PCI Interrupt Link [LNKC] (IRQs 3 4 5 6 7 10 11 12 14 *15)
ACPI: PCI Interrupt Link [LNKD] (IRQs 3 4 5 6 7 *10 11 12 14 15)
ACPI: PCI Interrupt Link [LNKE] (IRQs 3 4 5 6 7 10 11 12 14 15) *0, disabled.
ACPI: PCI Interrupt Link [LNKF] (IRQs 3 4 5 6 7 10 11 12 14 15) *0, disabled.
ACPI: PCI Interrupt Link [LNKG] (IRQs *3 4 5 6 7 10 11 12 14 15)
ACPI: PCI Interrupt Link [LNKH] (IRQs 3 4 5 6 7 10 11 12 *14 15)
ACPI Warning (tbutils-0217): Incorrect checksum in table [OEMB] -  08, should be 03 [20070126]
Linux Plug and Play Support v0.97 (c) Adam Belay
pnp: PnP ACPI init
ACPI: bus type pnp registered
pnp: PnP ACPI: found 15 devices
ACPI: ACPI bus type pnp unregistered
SCSI subsystem initialized
libata version 2.21 loaded.
PCI: Using ACPI for IRQ routing
PCI: If a device doesn't work, try "pci=routeirq".  If it helps, post a report
PCI-GART: No AMD northbridge found.
Time: tsc clocksource has been installed.
pnp: 00:01: iomem range 0xfed14000-0xfed19fff has been reserved
pnp: 00:07: ioport range 0x290-0x297 has been reserved
pnp: 00:08: iomem range 0xffafe000-0xffb0cbff could not be reserved
pnp: 00:08: iomem range 0xffb00000-0xffbfffff could not be reserved
pnp: 00:08: iomem range 0xfed1c000-0xfed1ffff has been reserved
pnp: 00:08: iomem range 0xfed20000-0xfed8ffff has been reserved
pnp: 00:0b: iomem range 0xfec00000-0xfec00fff has been reserved
pnp: 00:0b: iomem range 0xfee00000-0xfee00fff could not be reserved
pnp: 00:0d: iomem range 0xe0000000-0xefffffff has been reserved
pnp: 00:0e: iomem range 0x0-0x9ffff could not be reserved
pnp: 00:0e: iomem range 0xc0000-0xcffff has been reserved
pnp: 00:0e: iomem range 0xe0000-0xfffff could not be reserved
pnp: 00:0e: iomem range 0x100000-0x7fffffff could not be reserved
PCI: Bridge: 0000:00:01.0
  IO window: 9000-9fff
  MEM window: f8700000-fe7fffff
  PREFETCH window: bfe00000-dfdfffff
PCI: Bridge: 0000:00:1c.0
  IO window: disabled.
  MEM window: disabled.
  PREFETCH window: dfe00000-dfefffff
PCI: Bridge: 0000:00:1c.3
  IO window: disabled.
  MEM window: fe900000-fe9fffff
  PREFETCH window: disabled.
PCI: Bridge: 0000:00:1c.4
  IO window: a000-afff
  MEM window: fe800000-fe8fffff
  PREFETCH window: disabled.
PCI: Bridge: 0000:00:1e.0
  IO window: b000-bfff
  MEM window: fea00000-feafffff
  PREFETCH window: 88000000-880fffff
ACPI: PCI Interrupt 0000:00:01.0[A] -> GSI 16 (level, low) -> IRQ 16
PCI: Setting latency timer of device 0000:00:01.0 to 64
ACPI: PCI Interrupt 0000:00:1c.0[A] -> GSI 16 (level, low) -> IRQ 16
PCI: Setting latency timer of device 0000:00:1c.0 to 64
ACPI: PCI Interrupt 0000:00:1c.3[D] -> GSI 19 (level, low) -> IRQ 19
PCI: Setting latency timer of device 0000:00:1c.3 to 64
ACPI: PCI Interrupt 0000:00:1c.4[A] -> GSI 16 (level, low) -> IRQ 16
PCI: Setting latency timer of device 0000:00:1c.4 to 64
PCI: Setting latency timer of device 0000:00:1e.0 to 64
NET: Registered protocol family 2
IP route cache hash table entries: 131072 (order: 8, 1048576 bytes)
TCP established hash table entries: 65536 (order: 10, 4194304 bytes)
TCP bind hash table entries: 65536 (order: 9, 3670016 bytes)
TCP: Hash tables configured (established 65536 bind 65536)
TCP reno registered
io scheduler noop registered
io scheduler cfq registered (default)
Boot video device is 0000:01:00.0
Real Time Clock Driver v1.12ac
Linux agpgart interface v0.102
ACPI: PCI Interrupt 0000:03:00.0[A] -> GSI 19 (level, low) -> IRQ 19
PCI: Setting latency timer of device 0000:03:00.0 to 64
atl1 0000:03:00.0: version 2.0.7
8139too Fast Ethernet driver 0.9.28
ACPI: PCI Interrupt 0000:05:02.0[A] -> GSI 23 (level, low) -> IRQ 23
eth1: RealTek RTL8139 at 0xb800, 00:80:48:2e:06:2e, IRQ 23
eth1:  Identified 8139 chip type 'RTL-8100B/8139D'
netconsole: device eth0 not up yet, forcing it
atl1 0000:03:00.0: eth0 link is up 1000 Mbps full duplex
console [netcon0] enabled
netconsole: network logging started
ahci 0000:02:00.0: version 2.3
ACPI: PCI Interrupt 0000:02:00.0[A] -> GSI 16 (level, low) -> IRQ 16
ahci 0000:02:00.0: AHCI 0001.0000 32 slots 2 ports 3 Gbps 0x3 impl SATA mode
ahci 0000:02:00.0: flags: 64bit ncq pm led clo pmp pio slum part 
PCI: Setting latency timer of device 0000:02:00.0 to 64
scsi0 : ahci
scsi1 : ahci
ata1: SATA max UDMA/133 cmd 0xffffc20000024100 ctl 0x0000000000000000 bmdma 0x0000000000000000 irq 16
ata2: SATA max UDMA/133 cmd 0xffffc20000024180 ctl 0x0000000000000000 bmdma 0x0000000000000000 irq 16
ata1: SATA link down (SStatus 0 SControl 300)
ata2: SATA link down (SStatus 0 SControl 300)
ata_piix 0000:00:1f.2: version 2.12
ata_piix 0000:00:1f.2: MAP [ P0 P2 P1 P3 ]
ACPI: PCI Interrupt 0000:00:1f.2[B] -> GSI 19 (level, low) -> IRQ 19
PCI: Setting latency timer of device 0000:00:1f.2 to 64
scsi2 : ata_piix
scsi3 : ata_piix
ata3: SATA max UDMA/133 cmd 0x000000000001ec00 ctl 0x000000000001e882 bmdma 0x000000000001e400 irq 19
ata4: SATA max UDMA/133 cmd 0x000000000001e800 ctl 0x000000000001e482 bmdma 0x000000000001e408 irq 19
ata3.00: ATA-8: ST3750330AS, SD15, max UDMA/133
ata3.00: 1465149168 sectors, multi 16: LBA48 NCQ (depth 0/32)
ata3.01: ATA-7: ST3160811AS, 3.AAE, max UDMA/133
ata3.01: 312581808 sectors, multi 16: LBA48 NCQ (depth 0/32)
ata3.00: configured for UDMA/133
ata3.01: configured for UDMA/133
ata4.00: ATA-7: ST3250620AS, 3.AAE, max UDMA/133
ata4.00: 488397168 sectors, multi 16: LBA48 NCQ (depth 0/32)
ata4.00: configured for UDMA/133
scsi 2:0:0:0: Direct-Access     ATA      ST3750330AS      SD15 PQ: 0 ANSI: 5
sd 2:0:0:0: [sda] 1465149168 512-byte hardware sectors (750156 MB)
sd 2:0:0:0: [sda] Write Protect is off
sd 2:0:0:0: [sda] Mode Sense: 00 3a 00 00
sd 2:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
sd 2:0:0:0: [sda] 1465149168 512-byte hardware sectors (750156 MB)
sd 2:0:0:0: [sda] Write Protect is off
sd 2:0:0:0: [sda] Mode Sense: 00 3a 00 00
sd 2:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
 sda: sda1 sda2
sd 2:0:0:0: [sda] Attached SCSI disk
scsi 2:0:1:0: Direct-Access     ATA      ST3160811AS      3.AA PQ: 0 ANSI: 5
sd 2:0:1:0: [sdb] 312581808 512-byte hardware sectors (160042 MB)
sd 2:0:1:0: [sdb] Write Protect is off
sd 2:0:1:0: [sdb] Mode Sense: 00 3a 00 00
sd 2:0:1:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
sd 2:0:1:0: [sdb] 312581808 512-byte hardware sectors (160042 MB)
sd 2:0:1:0: [sdb] Write Protect is off
sd 2:0:1:0: [sdb] Mode Sense: 00 3a 00 00
sd 2:0:1:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
 sdb: unknown partition table
sd 2:0:1:0: [sdb] Attached SCSI disk
scsi 3:0:0:0: Direct-Access     ATA      ST3250620AS      3.AA PQ: 0 ANSI: 5
sd 3:0:0:0: [sdc] 488397168 512-byte hardware sectors (250059 MB)
sd 3:0:0:0: [sdc] Write Protect is off
sd 3:0:0:0: [sdc] Mode Sense: 00 3a 00 00
sd 3:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
sd 3:0:0:0: [sdc] 488397168 512-byte hardware sectors (250059 MB)
sd 3:0:0:0: [sdc] Write Protect is off
sd 3:0:0:0: [sdc] Mode Sense: 00 3a 00 00
sd 3:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
 sdc: sdc1
sd 3:0:0:0: [sdc] Attached SCSI disk
ata_piix 0000:00:1f.5: MAP [ P0 P2 P1 P3 ]
ACPI: PCI Interrupt 0000:00:1f.5[B] -> GSI 19 (level, low) -> IRQ 19
PCI: Setting latency timer of device 0000:00:1f.5 to 64
scsi4 : ata_piix
scsi5 : ata_piix
ata5: SATA max UDMA/133 cmd 0x000000000001d400 ctl 0x000000000001d082 bmdma 0x000000000001c880 irq 19
ata6: SATA max UDMA/133 cmd 0x000000000001d000 ctl 0x000000000001cc02 bmdma 0x000000000001c888 irq 19
ACPI: PCI Interrupt 0000:02:00.1[B] -> GSI 17 (level, low) -> IRQ 17
PCI: Setting latency timer of device 0000:02:00.1 to 64
scsi6 : pata_jmicron
scsi7 : pata_jmicron
ata7: PATA max UDMA/100 cmd 0x000000000001ac00 ctl 0x000000000001a882 bmdma 0x000000000001a400 irq 17
ata8: PATA max UDMA/100 cmd 0x000000000001a800 ctl 0x000000000001a482 bmdma 0x000000000001a408 irq 17
ata7.01: ATAPI: Optiarc DVD RW AD-7173A, 1-01, max UDMA/66
ata7.01: configured for UDMA/66
scsi 6:0:1:0: CD-ROM            Optiarc  DVD RW AD-7173A  1-01 PQ: 0 ANSI: 5
PNP: PS/2 Controller [PNP0303:PS2K] at 0x60,0x64 irq 1
PNP: PS/2 appears to have AUX port disabled, if this is incorrect please boot with i8042.nopnp
serio: i8042 KBD port at 0x60,0x64 irq 1
mice: PS/2 mouse device common for all mice
Advanced Linux Sound Architecture Driver Version 1.0.14 (Fri Jul 20 09:12:58 2007 UTC).
ACPI: PCI Interrupt 0000:00:1b.0[A] -> GSI 22 (level, low) -> IRQ 22
PCI: Setting latency timer of device 0000:00:1b.0 to 64
input: AT Translated Set 2 keyboard as /class/input/input0
ALSA device list:
  #0: HDA Intel at 0xfebf8000 irq 22
TCP cubic registered
NET: Registered protocol family 1
kjournald starting.  Commit interval 5 seconds
EXT3-fs: mounted filesystem with ordered data mode.
VFS: Mounted root (ext3 filesystem) readonly.
Freeing unused kernel memory: 200k freed
Write protecting the kernel read-only data: 3216k
usbcore: registered new interface driver usbfs
usbcore: registered new interface driver hub
usbcore: registered new device driver usb
ACPI: PCI Interrupt 0000:00:1a.7[C] -> GSI 18 (level, low) -> IRQ 18
PCI: Setting latency timer of device 0000:00:1a.7 to 64
ehci_hcd 0000:00:1a.7: EHCI Host Controller
ehci_hcd 0000:00:1a.7: new USB bus registered, assigned bus number 1
ehci_hcd 0000:00:1a.7: debug port 1
PCI: cache line size of 32 is not supported by device 0000:00:1a.7
ehci_hcd 0000:00:1a.7: irq 18, io mem 0xfebffc00
USB Universal Host Controller Interface driver v3.0
ehci_hcd 0000:00:1a.7: USB 2.0 started, EHCI 1.00, driver 10 Dec 2004
usb usb1: configuration #1 chosen from 1 choice
hub 1-0:1.0: USB hub found
hub 1-0:1.0: 4 ports detected
sr0: scsi3-mmc drive: 48x/48x writer dvd-ram cd/rw xa/form2 cdda tray
Uniform CD-ROM driver Revision: 3.20
sr 6:0:1:0: Attached scsi CD-ROM sr0
ACPI: PCI Interrupt 0000:00:1d.7[A] -> GSI 23 (level, low) -> IRQ 23
PCI: Setting latency timer of device 0000:00:1d.7 to 64
ehci_hcd 0000:00:1d.7: EHCI Host Controller
ehci_hcd 0000:00:1d.7: new USB bus registered, assigned bus number 2
ehci_hcd 0000:00:1d.7: debug port 1
PCI: cache line size of 32 is not supported by device 0000:00:1d.7
ehci_hcd 0000:00:1d.7: irq 23, io mem 0xfebff800
ehci_hcd 0000:00:1d.7: USB 2.0 started, EHCI 1.00, driver 10 Dec 2004
usb usb2: configuration #1 chosen from 1 choice
hub 2-0:1.0: USB hub found
hub 2-0:1.0: 6 ports detected
ACPI: PCI Interrupt 0000:00:1a.0[A] -> GSI 16 (level, low) -> IRQ 16
PCI: Setting latency timer of device 0000:00:1a.0 to 64
uhci_hcd 0000:00:1a.0: UHCI Host Controller
uhci_hcd 0000:00:1a.0: new USB bus registered, assigned bus number 3
uhci_hcd 0000:00:1a.0: irq 16, io base 0x0000dc00
usb usb3: configuration #1 chosen from 1 choice
hub 3-0:1.0: USB hub found
hub 3-0:1.0: 2 ports detected
ACPI: PCI Interrupt 0000:00:1a.1[B] -> GSI 17 (level, low) -> IRQ 17
PCI: Setting latency timer of device 0000:00:1a.1 to 64
uhci_hcd 0000:00:1a.1: UHCI Host Controller
uhci_hcd 0000:00:1a.1: new USB bus registered, assigned bus number 4
uhci_hcd 0000:00:1a.1: irq 17, io base 0x0000e000
usb usb4: configuration #1 chosen from 1 choice
hub 4-0:1.0: USB hub found
hub 4-0:1.0: 2 ports detected
ACPI: PCI Interrupt 0000:00:1d.0[A] -> GSI 23 (level, low) -> IRQ 23
PCI: Setting latency timer of device 0000:00:1d.0 to 64
uhci_hcd 0000:00:1d.0: UHCI Host Controller
uhci_hcd 0000:00:1d.0: new USB bus registered, assigned bus number 5
uhci_hcd 0000:00:1d.0: irq 23, io base 0x0000d480
usb usb5: configuration #1 chosen from 1 choice
hub 5-0:1.0: USB hub found
hub 5-0:1.0: 2 ports detected
ACPI: PCI Interrupt 0000:00:1d.1[B] -> GSI 19 (level, low) -> IRQ 19
PCI: Setting latency timer of device 0000:00:1d.1 to 64
uhci_hcd 0000:00:1d.1: UHCI Host Controller
uhci_hcd 0000:00:1d.1: new USB bus registered, assigned bus number 6
uhci_hcd 0000:00:1d.1: irq 19, io base 0x0000d800
usb usb6: configuration #1 chosen from 1 choice
hub 6-0:1.0: USB hub found
hub 6-0:1.0: 2 ports detected
ACPI: PCI Interrupt 0000:00:1d.2[C] -> GSI 18 (level, low) -> IRQ 18
PCI: Setting latency timer of device 0000:00:1d.2 to 64
uhci_hcd 0000:00:1d.2: UHCI Host Controller
uhci_hcd 0000:00:1d.2: new USB bus registered, assigned bus number 7
uhci_hcd 0000:00:1d.2: irq 18, io base 0x0000d880
usb usb7: configuration #1 chosen from 1 choice
hub 7-0:1.0: USB hub found
hub 7-0:1.0: 2 ports detected
EXT3 FS on sda2, internal journal
usbcore: registered new interface driver usblp
Adding 9775512k swap on /dev/sda1.  Priority:-1 extents:1 across:9775512k
ip_tables: (C) 2000-2006 Netfilter Core Team
Netfilter messages via NETLINK v0.30.
nf_conntrack version 0.5.0 (16384 buckets, 65536 max)
eth1: link up, 100Mbps, full-duplex, lpa 0x45E1


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: atl1 64-bit => 32-bit DMA borkage (reproducible, bisected)
  2008-04-19 14:45               ` atl1 64-bit => 32-bit DMA borkage (reproducible, bisected) Alexey Dobriyan
@ 2008-04-20  2:54                 ` Jay Cliburn
  2008-04-20 11:14                   ` Alexey Dobriyan
  0 siblings, 1 reply; 71+ messages in thread
From: Jay Cliburn @ 2008-04-20  2:54 UTC (permalink / raw
  To: Alexey Dobriyan
  Cc: Luca Tettamanti, Chris Snook, Jeff Garzik, Pekka Enberg,
	Andrew Morton, linux-kernel, netdev, Christoph Lameter, torvalds

On Sat, 19 Apr 2008 18:45:35 +0400
Alexey Dobriyan <adobriyan@gmail.com> wrote:

> OK, nailed it.
> 
> It's commit 5f08e46b621a769e52a9545a23ab1d5fb2aec1d4 aka "atl1:
> disable broken 64-bit DMA".
> 
> With this commit in tree, I can reproduce either
> a) kmalloc-2048 corruption after initscripts shutdown eth0
> 	http://marc.info/?l=linux-kernel&m=120820360221261&w=2
> 
> b) or oopses at filp_close() first reported long ago
> 	(sorry, can't find that email)
> 
> c) or hard hang after initscripts shutdown eth0 with even SysRq not
> working. http://marc.info/?l=linux-kernel&m=120795046008115&w=2
> 
> I have two boxes one with atl1, 4G RAM with 2G remapped after 4G
> boundary, another with r8169 connected with just ethernet cable. NICs
> agree on 1Gbps speed.
> 
> So, it's enough to scp 200 MB git archive and immediately start
> rebooting sequence for horrors described above to appear. It's not
> 100% reproducible but more like 90%.

Do I understand correctly that these failures occur only while the
network interface is going down?

Jay

^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: atl1 64-bit => 32-bit DMA borkage (reproducible, bisected)
  2008-04-20 11:14                   ` Alexey Dobriyan
@ 2008-04-20 11:06                     ` Jay Cliburn
  2008-04-20 12:26                       ` Alexey Dobriyan
  0 siblings, 1 reply; 71+ messages in thread
From: Jay Cliburn @ 2008-04-20 11:06 UTC (permalink / raw
  To: Alexey Dobriyan
  Cc: Luca Tettamanti, Chris Snook, Jeff Garzik, Pekka Enberg,
	Andrew Morton, linux-kernel, netdev, Christoph Lameter, torvalds

On Sun, 20 Apr 2008 15:14:53 +0400
Alexey Dobriyan <adobriyan@gmail.com> wrote:

> On Sat, Apr 19, 2008 at 09:54:44PM -0500, Jay Cliburn wrote:
> > On Sat, 19 Apr 2008 18:45:35 +0400
> > Alexey Dobriyan <adobriyan@gmail.com> wrote:
> > 
> > > OK, nailed it.
> > > 
> > > It's commit 5f08e46b621a769e52a9545a23ab1d5fb2aec1d4 aka "atl1:
> > > disable broken 64-bit DMA".
> > > 
> > > With this commit in tree, I can reproduce either
> > > a) kmalloc-2048 corruption after initscripts shutdown eth0
> > > 	http://marc.info/?l=linux-kernel&m=120820360221261&w=2
> > > 
> > > b) or oopses at filp_close() first reported long ago
> > > 	(sorry, can't find that email)
> > > 
> > > c) or hard hang after initscripts shutdown eth0 with even SysRq
> > > not working.
> > > http://marc.info/?l=linux-kernel&m=120795046008115&w=2
> > > 
> > > I have two boxes one with atl1, 4G RAM with 2G remapped after 4G
> > > boundary, another with r8169 connected with just ethernet cable.
> > > NICs agree on 1Gbps speed.
> > > 
> > > So, it's enough to scp 200 MB git archive and immediately start
> > > rebooting sequence for horrors described above to appear. It's not
> > > 100% reproducible but more like 90%.
> > 
> > Do I understand correctly that these failures occur only while the
> > network interface is going down?
> 
> Yep. During up or running there were no problems with this card.
> 

One more question:  Does it happen whether or not you're using atl1 as
a netconsole?

^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: atl1 64-bit => 32-bit DMA borkage (reproducible, bisected)
  2008-04-20  2:54                 ` Jay Cliburn
@ 2008-04-20 11:14                   ` Alexey Dobriyan
  2008-04-20 11:06                     ` Jay Cliburn
  0 siblings, 1 reply; 71+ messages in thread
From: Alexey Dobriyan @ 2008-04-20 11:14 UTC (permalink / raw
  To: Jay Cliburn
  Cc: Luca Tettamanti, Chris Snook, Jeff Garzik, Pekka Enberg,
	Andrew Morton, linux-kernel, netdev, Christoph Lameter, torvalds

On Sat, Apr 19, 2008 at 09:54:44PM -0500, Jay Cliburn wrote:
> On Sat, 19 Apr 2008 18:45:35 +0400
> Alexey Dobriyan <adobriyan@gmail.com> wrote:
> 
> > OK, nailed it.
> > 
> > It's commit 5f08e46b621a769e52a9545a23ab1d5fb2aec1d4 aka "atl1:
> > disable broken 64-bit DMA".
> > 
> > With this commit in tree, I can reproduce either
> > a) kmalloc-2048 corruption after initscripts shutdown eth0
> > 	http://marc.info/?l=linux-kernel&m=120820360221261&w=2
> > 
> > b) or oopses at filp_close() first reported long ago
> > 	(sorry, can't find that email)
> > 
> > c) or hard hang after initscripts shutdown eth0 with even SysRq not
> > working. http://marc.info/?l=linux-kernel&m=120795046008115&w=2
> > 
> > I have two boxes one with atl1, 4G RAM with 2G remapped after 4G
> > boundary, another with r8169 connected with just ethernet cable. NICs
> > agree on 1Gbps speed.
> > 
> > So, it's enough to scp 200 MB git archive and immediately start
> > rebooting sequence for horrors described above to appear. It's not
> > 100% reproducible but more like 90%.
> 
> Do I understand correctly that these failures occur only while the
> network interface is going down?

Yep. During up or running there were no problems with this card.


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: atl1 64-bit => 32-bit DMA borkage (reproducible, bisected)
  2008-04-20 11:06                     ` Jay Cliburn
@ 2008-04-20 12:26                       ` Alexey Dobriyan
  2008-04-20 18:37                         ` Jay Cliburn
  0 siblings, 1 reply; 71+ messages in thread
From: Alexey Dobriyan @ 2008-04-20 12:26 UTC (permalink / raw
  To: Jay Cliburn
  Cc: Luca Tettamanti, Chris Snook, Jeff Garzik, Pekka Enberg,
	Andrew Morton, linux-kernel, netdev, Christoph Lameter, torvalds

On Sun, Apr 20, 2008 at 06:06:07AM -0500, Jay Cliburn wrote:
> On Sun, 20 Apr 2008 15:14:53 +0400
> Alexey Dobriyan <adobriyan@gmail.com> wrote:
> 
> > On Sat, Apr 19, 2008 at 09:54:44PM -0500, Jay Cliburn wrote:
> > > On Sat, 19 Apr 2008 18:45:35 +0400
> > > Alexey Dobriyan <adobriyan@gmail.com> wrote:
> > > 
> > > > OK, nailed it.
> > > > 
> > > > It's commit 5f08e46b621a769e52a9545a23ab1d5fb2aec1d4 aka "atl1:
> > > > disable broken 64-bit DMA".
> > > > 
> > > > With this commit in tree, I can reproduce either
> > > > a) kmalloc-2048 corruption after initscripts shutdown eth0
> > > > 	http://marc.info/?l=linux-kernel&m=120820360221261&w=2
> > > > 
> > > > b) or oopses at filp_close() first reported long ago
> > > > 	(sorry, can't find that email)
> > > > 
> > > > c) or hard hang after initscripts shutdown eth0 with even SysRq
> > > > not working.
> > > > http://marc.info/?l=linux-kernel&m=120795046008115&w=2
> > > > 
> > > > I have two boxes one with atl1, 4G RAM with 2G remapped after 4G
> > > > boundary, another with r8169 connected with just ethernet cable.
> > > > NICs agree on 1Gbps speed.
> > > > 
> > > > So, it's enough to scp 200 MB git archive and immediately start
> > > > rebooting sequence for horrors described above to appear. It's not
> > > > 100% reproducible but more like 90%.
> > > 
> > > Do I understand correctly that these failures occur only while the
> > > network interface is going down?
> > 
> > Yep. During up or running there were no problems with this card.
> > 
> 
> One more question:  Does it happen whether or not you're using atl1 as
> a netconsole?

Without netconsole bugs happens too.


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: atl1 64-bit => 32-bit DMA borkage (reproducible, bisected)
  2008-04-20 12:26                       ` Alexey Dobriyan
@ 2008-04-20 18:37                         ` Jay Cliburn
  2008-04-20 20:55                           ` Alexey Dobriyan
  0 siblings, 1 reply; 71+ messages in thread
From: Jay Cliburn @ 2008-04-20 18:37 UTC (permalink / raw
  To: Alexey Dobriyan
  Cc: Luca Tettamanti, Chris Snook, Jeff Garzik, Pekka Enberg,
	Andrew Morton, linux-kernel, netdev, Christoph Lameter, torvalds

On Sun, 20 Apr 2008 16:26:31 +0400
Alexey Dobriyan <adobriyan@gmail.com> wrote:

> On Sun, Apr 20, 2008 at 06:06:07AM -0500, Jay Cliburn wrote:
> > On Sun, 20 Apr 2008 15:14:53 +0400
> > Alexey Dobriyan <adobriyan@gmail.com> wrote:
> > 
> > > On Sat, Apr 19, 2008 at 09:54:44PM -0500, Jay Cliburn wrote:
> > > > On Sat, 19 Apr 2008 18:45:35 +0400
> > > > Alexey Dobriyan <adobriyan@gmail.com> wrote:
[...]
> > > > > So, it's enough to scp 200 MB git archive and immediately
> > > > > start rebooting sequence for horrors described above to
> > > > > appear. It's not 100% reproducible but more like 90%.
> > > > 
> > > > Do I understand correctly that these failures occur only while
> > > > the network interface is going down?
> > > 
> > > Yep. During up or running there were no problems with this card.
> > > 
> > 
> > One more question:  Does it happen whether or not you're using atl1
> > as a netconsole?
> 
> Without netconsole bugs happens too.
> 

I can't duplicate this error, but it's probably because my machine
doesn't have 4GB of memory.

I have one report in Febroary 2008 of another user encountering strange
oopses in 2.6.23.12 and 2.6.24 whenever he downed the interface.  I
suspect your experience is a repeat of that.

Just to be clear, you transfer about 200MB to the NIC (Rx direction),
then immediately reboot, right?  Can you duplicate the problem if you
simply ifconfig down instead of rebooting after the transfer?  

Thanks for your help.

Jay

^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: atl1 64-bit => 32-bit DMA borkage (reproducible, bisected)
  2008-04-20 18:37                         ` Jay Cliburn
@ 2008-04-20 20:55                           ` Alexey Dobriyan
  2008-04-21 18:42                             ` Chris Snook
  2008-04-22  2:08                             ` Jay Cliburn
  0 siblings, 2 replies; 71+ messages in thread
From: Alexey Dobriyan @ 2008-04-20 20:55 UTC (permalink / raw
  To: Jay Cliburn
  Cc: Luca Tettamanti, Chris Snook, Jeff Garzik, Pekka Enberg,
	Andrew Morton, linux-kernel, netdev, Christoph Lameter, torvalds

On Sun, Apr 20, 2008 at 01:37:04PM -0500, Jay Cliburn wrote:
> On Sun, 20 Apr 2008 16:26:31 +0400
> Alexey Dobriyan <adobriyan@gmail.com> wrote:
> 
> > On Sun, Apr 20, 2008 at 06:06:07AM -0500, Jay Cliburn wrote:
> > > On Sun, 20 Apr 2008 15:14:53 +0400
> > > Alexey Dobriyan <adobriyan@gmail.com> wrote:
> > > 
> > > > On Sat, Apr 19, 2008 at 09:54:44PM -0500, Jay Cliburn wrote:
> > > > > On Sat, 19 Apr 2008 18:45:35 +0400
> > > > > Alexey Dobriyan <adobriyan@gmail.com> wrote:
> [...]
> > > > > > So, it's enough to scp 200 MB git archive and immediately
> > > > > > start rebooting sequence for horrors described above to
> > > > > > appear. It's not 100% reproducible but more like 90%.
> > > > > 
> > > > > Do I understand correctly that these failures occur only while
> > > > > the network interface is going down?
> > > > 
> > > > Yep. During up or running there were no problems with this card.
> > > > 
> > > 
> > > One more question:  Does it happen whether or not you're using atl1
> > > as a netconsole?
> > 
> > Without netconsole bugs happens too.
> > 
> 
> I can't duplicate this error, but it's probably because my machine
> doesn't have 4GB of memory.
> 
> I have one report in Febroary 2008 of another user encountering strange
> oopses in 2.6.23.12 and 2.6.24 whenever he downed the interface.  I
> suspect your experience is a repeat of that.
> 
> Just to be clear, you transfer about 200MB to the NIC (Rx direction),
> then immediately reboot, right?

Yup!

> Can you duplicate the problem if you
> simply ifconfig down instead of rebooting after the transfer?  

Aha, ifconfig down is enough. Here is how reproducer looks like now:

	./sync-linux-linus && ssh core2 "sudo /sbin/ifconfig eth0 down"

where first script is basically scp(1).

Also, booting with 1G or 2G of RAM (mem=1024m) makes issue go away.

printk at dev_close() time shows that NETIF_F_HIGHDMA was not somehow
enabled.


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: atl1 64-bit => 32-bit DMA borkage (reproducible, bisected)
  2008-04-20 20:55                           ` Alexey Dobriyan
@ 2008-04-21 18:42                             ` Chris Snook
  2008-04-21 19:56                               ` Alexey Dobriyan
  2008-04-22  2:08                             ` Jay Cliburn
  1 sibling, 1 reply; 71+ messages in thread
From: Chris Snook @ 2008-04-21 18:42 UTC (permalink / raw
  To: Alexey Dobriyan
  Cc: Jay Cliburn, Luca Tettamanti, Jeff Garzik, Pekka Enberg,
	Andrew Morton, linux-kernel, netdev, Christoph Lameter, torvalds

Alexey Dobriyan wrote:
> On Sun, Apr 20, 2008 at 01:37:04PM -0500, Jay Cliburn wrote:
>> On Sun, 20 Apr 2008 16:26:31 +0400
>> Alexey Dobriyan <adobriyan@gmail.com> wrote:
>>
>>> On Sun, Apr 20, 2008 at 06:06:07AM -0500, Jay Cliburn wrote:
>>>> On Sun, 20 Apr 2008 15:14:53 +0400
>>>> Alexey Dobriyan <adobriyan@gmail.com> wrote:
>>>>
>>>>> On Sat, Apr 19, 2008 at 09:54:44PM -0500, Jay Cliburn wrote:
>>>>>> On Sat, 19 Apr 2008 18:45:35 +0400
>>>>>> Alexey Dobriyan <adobriyan@gmail.com> wrote:
>> [...]
>>>>>>> So, it's enough to scp 200 MB git archive and immediately
>>>>>>> start rebooting sequence for horrors described above to
>>>>>>> appear. It's not 100% reproducible but more like 90%.
>>>>>> Do I understand correctly that these failures occur only while
>>>>>> the network interface is going down?
>>>>> Yep. During up or running there were no problems with this card.
>>>>>
>>>> One more question:  Does it happen whether or not you're using atl1
>>>> as a netconsole?
>>> Without netconsole bugs happens too.
>>>
>> I can't duplicate this error, but it's probably because my machine
>> doesn't have 4GB of memory.
>>
>> I have one report in Febroary 2008 of another user encountering strange
>> oopses in 2.6.23.12 and 2.6.24 whenever he downed the interface.  I
>> suspect your experience is a repeat of that.
>>
>> Just to be clear, you transfer about 200MB to the NIC (Rx direction),
>> then immediately reboot, right?
> 
> Yup!
> 
>> Can you duplicate the problem if you
>> simply ifconfig down instead of rebooting after the transfer?  
> 
> Aha, ifconfig down is enough. Here is how reproducer looks like now:
> 
> 	./sync-linux-linus && ssh core2 "sudo /sbin/ifconfig eth0 down"
> 
> where first script is basically scp(1).
> 
> Also, booting with 1G or 2G of RAM (mem=1024m) makes issue go away.
> 
> printk at dev_close() time shows that NETIF_F_HIGHDMA was not somehow
> enabled.
> 

Does the problem go away with iommu=nomerge?  If so, I suspect we're not 
properly flushing an iowrite somewhere.

-- Chris

^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: atl1 64-bit => 32-bit DMA borkage (reproducible, bisected)
  2008-04-21 18:42                             ` Chris Snook
@ 2008-04-21 19:56                               ` Alexey Dobriyan
  0 siblings, 0 replies; 71+ messages in thread
From: Alexey Dobriyan @ 2008-04-21 19:56 UTC (permalink / raw
  To: Chris Snook
  Cc: Jay Cliburn, Luca Tettamanti, Jeff Garzik, Pekka Enberg,
	Andrew Morton, linux-kernel, netdev, Christoph Lameter, torvalds

On Mon, Apr 21, 2008 at 02:42:42PM -0400, Chris Snook wrote:
> Alexey Dobriyan wrote:
>> On Sun, Apr 20, 2008 at 01:37:04PM -0500, Jay Cliburn wrote:
>>> On Sun, 20 Apr 2008 16:26:31 +0400
>>> Alexey Dobriyan <adobriyan@gmail.com> wrote:
>>>
>>>> On Sun, Apr 20, 2008 at 06:06:07AM -0500, Jay Cliburn wrote:
>>>>> On Sun, 20 Apr 2008 15:14:53 +0400
>>>>> Alexey Dobriyan <adobriyan@gmail.com> wrote:
>>>>>
>>>>>> On Sat, Apr 19, 2008 at 09:54:44PM -0500, Jay Cliburn wrote:
>>>>>>> On Sat, 19 Apr 2008 18:45:35 +0400
>>>>>>> Alexey Dobriyan <adobriyan@gmail.com> wrote:
>>> [...]
>>>>>>>> So, it's enough to scp 200 MB git archive and immediately
>>>>>>>> start rebooting sequence for horrors described above to
>>>>>>>> appear. It's not 100% reproducible but more like 90%.
>>>>>>> Do I understand correctly that these failures occur only while
>>>>>>> the network interface is going down?
>>>>>> Yep. During up or running there were no problems with this card.
>>>>>>
>>>>> One more question:  Does it happen whether or not you're using atl1
>>>>> as a netconsole?
>>>> Without netconsole bugs happens too.
>>>>
>>> I can't duplicate this error, but it's probably because my machine
>>> doesn't have 4GB of memory.
>>>
>>> I have one report in Febroary 2008 of another user encountering strange
>>> oopses in 2.6.23.12 and 2.6.24 whenever he downed the interface.  I
>>> suspect your experience is a repeat of that.
>>>
>>> Just to be clear, you transfer about 200MB to the NIC (Rx direction),
>>> then immediately reboot, right?
>> Yup!
>>> Can you duplicate the problem if you
>>> simply ifconfig down instead of rebooting after the transfer?  
>> Aha, ifconfig down is enough. Here is how reproducer looks like now:
>> 	./sync-linux-linus && ssh core2 "sudo /sbin/ifconfig eth0 down"
>> where first script is basically scp(1).
>> Also, booting with 1G or 2G of RAM (mem=1024m) makes issue go away.
>> printk at dev_close() time shows that NETIF_F_HIGHDMA was not somehow
>> enabled.
>
> Does the problem go away with iommu=nomerge?  If so, I suspect we're not 
> properly flushing an iowrite somewhere.

nomerge doesn't help.


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: atl1 64-bit => 32-bit DMA borkage (reproducible, bisected)
  2008-04-20 20:55                           ` Alexey Dobriyan
  2008-04-21 18:42                             ` Chris Snook
@ 2008-04-22  2:08                             ` Jay Cliburn
  2008-04-22 19:02                               ` Alexey Dobriyan
  2008-04-26  0:57                               ` Jay Cliburn
  1 sibling, 2 replies; 71+ messages in thread
From: Jay Cliburn @ 2008-04-22  2:08 UTC (permalink / raw
  To: Alexey Dobriyan
  Cc: Luca Tettamanti, Chris Snook, Jeff Garzik, Pekka Enberg,
	Andrew Morton, linux-kernel, netdev, Christoph Lameter, torvalds

On Mon, 21 Apr 2008 00:55:00 +0400
Alexey Dobriyan <adobriyan@gmail.com> wrote:

> Aha, ifconfig down is enough. Here is how reproducer looks like now:
> 
> 	./sync-linux-linus && ssh core2 "sudo /sbin/ifconfig eth0
> down"
> 
> where first script is basically scp(1).
> 
> Also, booting with 1G or 2G of RAM (mem=1024m) makes issue go away.
> 
> printk at dev_close() time shows that NETIF_F_HIGHDMA was not somehow
> enabled.
> 

Alexey, can you please try this (very minimally tested) patch?

diff --git a/drivers/net/atlx/atl1.c b/drivers/net/atlx/atl1.c
index 5586fc6..07fe5c0 100644
--- a/drivers/net/atlx/atl1.c
+++ b/drivers/net/atlx/atl1.c
@@ -1115,9 +1115,6 @@ static void atl1_free_ring_resources(struct atl1_adapter *adapter)
 	struct atl1_rrd_ring *rrd_ring = &adapter->rrd_ring;
 	struct atl1_ring_header *ring_header = &adapter->ring_header;
 
-	atl1_clean_tx_ring(adapter);
-	atl1_clean_rx_ring(adapter);
-
 	kfree(tpd_ring->buffer_info);
 	pci_free_consistent(pdev, ring_header->size, ring_header->desc,
 		ring_header->dma);
@@ -3423,6 +3420,8 @@ static int atl1_set_ringparam(struct net_device *netdev,
 		adapter->rrd_ring = rrd_old;
 		adapter->tpd_ring = tpd_old;
 		adapter->ring_header = rhdr_old;
+		atl1_clean_tx_ring(adapter);
+		atl1_clean_rx_ring(adapter);
 		atl1_free_ring_resources(adapter);
 		adapter->rfd_ring = rfd_new;
 		adapter->rrd_ring = rrd_new;

^ permalink raw reply related	[flat|nested] 71+ messages in thread

* Re: atl1 64-bit => 32-bit DMA borkage (reproducible, bisected)
  2008-04-22  2:08                             ` Jay Cliburn
@ 2008-04-22 19:02                               ` Alexey Dobriyan
  2008-04-26  0:57                               ` Jay Cliburn
  1 sibling, 0 replies; 71+ messages in thread
From: Alexey Dobriyan @ 2008-04-22 19:02 UTC (permalink / raw
  To: Jay Cliburn
  Cc: Luca Tettamanti, Chris Snook, Jeff Garzik, Pekka Enberg,
	Andrew Morton, linux-kernel, netdev, Christoph Lameter, torvalds

[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #1: Type: text/plain; charset=unknown-8bit, Size: 4583 bytes --]

On Mon, Apr 21, 2008 at 09:08:21PM -0500, Jay Cliburn wrote:
> On Mon, 21 Apr 2008 00:55:00 +0400
> Alexey Dobriyan <adobriyan@gmail.com> wrote:
> 
> > Aha, ifconfig down is enough. Here is how reproducer looks like now:
> > 
> > 	./sync-linux-linus && ssh core2 "sudo /sbin/ifconfig eth0
> > down"
> > 
> > where first script is basically scp(1).
> > 
> > Also, booting with 1G or 2G of RAM (mem=1024m) makes issue go away.
> > 
> > printk at dev_close() time shows that NETIF_F_HIGHDMA was not somehow
> > enabled.
> > 
> 
> Alexey, can you please try this (very minimally tested) patch?
> 
> diff --git a/drivers/net/atlx/atl1.c b/drivers/net/atlx/atl1.c
> index 5586fc6..07fe5c0 100644
> --- a/drivers/net/atlx/atl1.c
> +++ b/drivers/net/atlx/atl1.c
> @@ -1115,9 +1115,6 @@ static void atl1_free_ring_resources(struct atl1_adapter *adapter)
>  	struct atl1_rrd_ring *rrd_ring = &adapter->rrd_ring;
>  	struct atl1_ring_header *ring_header = &adapter->ring_header;
>  
> -	atl1_clean_tx_ring(adapter);
> -	atl1_clean_rx_ring(adapter);
> -
>  	kfree(tpd_ring->buffer_info);
>  	pci_free_consistent(pdev, ring_header->size, ring_header->desc,
>  		ring_header->dma);
> @@ -3423,6 +3420,8 @@ static int atl1_set_ringparam(struct net_device *netdev,
>  		adapter->rrd_ring = rrd_old;
>  		adapter->tpd_ring = tpd_old;
>  		adapter->ring_header = rhdr_old;
> +		atl1_clean_tx_ring(adapter);
> +		atl1_clean_rx_ring(adapter);
>  		atl1_free_ring_resources(adapter);

Patch doesn't help unfortunately.

BTW, below is clean corruption trace:


atl1 0000:03:00.0: eth0 link is up 1000 Mbps full duplex
=============================================================================
BUG kmalloc-2048: Poison overwritten
-----------------------------------------------------------------------------

INFO: 0xffff81017ed7a97a-0xffff81017ed7af71. First byte 0x0 instead of 0x6b
INFO: Allocated in dev_alloc_skb+0x18/0x30 age=23894 cpu=1 pid=30255
INFO: Freed in skb_release_data+0x7a/0xc0 age=20700 cpu=0 pid=0
INFO: Slab 0xffffe200053bf240 used=12 fp=0xffff81017ed7a968 flags=0x17c000000040c3
INFO: Object 0xffff81017ed7a968 @offset=10600 fp=0xffff81017ed7ca88

Bytes b4 0xffff81017ed7a958:  14 09 a7 01 01 00 00 00 5a 5a 5a 5a 5a 5a 5a 5a ..§.....ZZZZZZZZ
  Object 0xffff81017ed7a968:  6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b kkkkkkkkkkkkkkkk
  Object 0xffff81017ed7a978:  6b 6b 00 18 f3 a2 9f 90 00 1b 38 af 22 49 08 00 kk..ó¢....8¯"I..
  Object 0xffff81017ed7a988:  45 10 00 4c a4 9f 40 00 40 11 d2 fe c0 a8 00 2a E..L¤.@.@.ÒþÀ¨.*
  Object 0xffff81017ed7a998:  59 6f a8 b1 9d e9 00 7b 00 38 58 29 23 00 00 00 Yo¨±.é.{.8X)#...
  Object 0xffff81017ed7a9a8:  00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
  Object 0xffff81017ed7a9b8:  00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
  Object 0xffff81017ed7a9c8:  00 00 00 00 1e 31 61 fa 08 5e 9a 73 de cf ce 94 .....1aú.^.sÞÏÎ.
  Object 0xffff81017ed7a9d8:  63 64 65 66 67 68 6a 69 6b 6c 6d 6e 6f 70 71 72 cdefghjiklmnopqr
 Redzone 0xffff81017ed7b168:  bb bb bb bb bb bb bb bb                         »»»»»»»»        
 Padding 0xffff81017ed7b1a8:  5a 5a 5a 5a 5a 5a 5a 5a                         ZZZZZZZZ        
Pid: 31677, comm: ifconfig Not tainted 2.6.25-3925e6fc1f774048404fdd910b0345b06c699eb4 #5

Call Trace:
 [<ffffffff80288277>] print_trailer+0xe7/0x170
 [<ffffffff802883a5>] check_bytes_and_report+0xa5/0xd0
 [<ffffffff80288678>] check_object+0xa8/0x250
 [<ffffffff80289975>] __slab_alloc+0x535/0x690
 [<ffffffff80253f3e>] ? mark_held_locks+0x3e/0x80
 [<ffffffff803f2fd8>] ? dev_alloc_skb+0x18/0x30
 [<ffffffff8028aff6>] __kmalloc_track_caller+0xe6/0x100
 [<ffffffff803f2fd8>] ? dev_alloc_skb+0x18/0x30
 [<ffffffff803f2b8f>] __alloc_skb+0x6f/0x160
 [<ffffffff803f2fd8>] dev_alloc_skb+0x18/0x30
 [<ffffffff8036512a>] atl1_alloc_rx_buffers+0x11a/0x260
 [<ffffffff80366dc7>] atl1_up+0x77/0x750
 [<ffffffff80367a0b>] atl1_open+0x3b/0x50
 [<ffffffff803fa3fa>] dev_open+0x5a/0x90
 [<ffffffff803f8ca9>] dev_change_flags+0x99/0x1b0
 [<ffffffff8043d1d2>] devinet_ioctl+0x592/0x740
 [<ffffffff803fa229>] ? dev_ioctl+0x479/0x550
 [<ffffffff8043d891>] inet_ioctl+0x61/0x80
 [<ffffffff803eaa16>] sock_ioctl+0x56/0x240
 [<ffffffff8029b271>] vfs_ioctl+0x31/0x90
 [<ffffffff8029b343>] do_vfs_ioctl+0x73/0x2d0
 [<ffffffff8029b5ea>] sys_ioctl+0x4a/0x80
 [<ffffffff8020b54b>] system_call_after_swapgs+0x7b/0x80

FIX kmalloc-2048: Restoring 0xffff81017ed7a97a-0xffff81017ed7af71=0x6b

FIX kmalloc-2048: Marking all objects used
atl1 0000:03:00.0: eth0 link is up 1000 Mbps full duplex


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: atl1 64-bit => 32-bit DMA borkage (reproducible, bisected)
  2008-04-22  2:08                             ` Jay Cliburn
  2008-04-22 19:02                               ` Alexey Dobriyan
@ 2008-04-26  0:57                               ` Jay Cliburn
  2008-04-28  6:42                                 ` Alexey Dobriyan
  2008-05-04 21:15                                 ` Alexey Dobriyan
  1 sibling, 2 replies; 71+ messages in thread
From: Jay Cliburn @ 2008-04-26  0:57 UTC (permalink / raw
  To: Alexey Dobriyan
  Cc: Jay Cliburn, Luca Tettamanti, Chris Snook, Jeff Garzik,
	Pekka Enberg, Andrew Morton, linux-kernel, netdev,
	Christoph Lameter, torvalds

On Mon, 21 Apr 2008 21:08:21 -0500
Jay Cliburn <jacliburn@bellsouth.net> wrote:

> 
> Alexey, can you please try this (very minimally tested) patch?

Alexey, have you found time to try this patch yet?

Thanks.

> 
> diff --git a/drivers/net/atlx/atl1.c b/drivers/net/atlx/atl1.c
> index 5586fc6..07fe5c0 100644
> --- a/drivers/net/atlx/atl1.c
> +++ b/drivers/net/atlx/atl1.c
> @@ -1115,9 +1115,6 @@ static void atl1_free_ring_resources(struct
> atl1_adapter *adapter) struct atl1_rrd_ring *rrd_ring =
> &adapter->rrd_ring; struct atl1_ring_header *ring_header =
> &adapter->ring_header; 
> -	atl1_clean_tx_ring(adapter);
> -	atl1_clean_rx_ring(adapter);
> -
>  	kfree(tpd_ring->buffer_info);
>  	pci_free_consistent(pdev, ring_header->size,
> ring_header->desc, ring_header->dma);
> @@ -3423,6 +3420,8 @@ static int atl1_set_ringparam(struct net_device
> *netdev, adapter->rrd_ring = rrd_old;
>  		adapter->tpd_ring = tpd_old;
>  		adapter->ring_header = rhdr_old;
> +		atl1_clean_tx_ring(adapter);
> +		atl1_clean_rx_ring(adapter);
>  		atl1_free_ring_resources(adapter);
>  		adapter->rfd_ring = rfd_new;
>  		adapter->rrd_ring = rrd_new;

^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: atl1 64-bit => 32-bit DMA borkage (reproducible, bisected)
  2008-04-26  0:57                               ` Jay Cliburn
@ 2008-04-28  6:42                                 ` Alexey Dobriyan
  2008-05-04 21:15                                 ` Alexey Dobriyan
  1 sibling, 0 replies; 71+ messages in thread
From: Alexey Dobriyan @ 2008-04-28  6:42 UTC (permalink / raw
  To: Jay Cliburn
  Cc: Luca Tettamanti, Chris Snook, Jeff Garzik, Pekka Enberg,
	Andrew Morton, linux-kernel, netdev, Christoph Lameter, torvalds

On Fri, Apr 25, 2008 at 07:57:43PM -0500, Jay Cliburn wrote:
> On Mon, 21 Apr 2008 21:08:21 -0500
> Jay Cliburn <jacliburn@bellsouth.net> wrote:
> 
> > 
> > Alexey, can you please try this (very minimally tested) patch?
> 
> Alexey, have you found time to try this patch yet?

I've tried it and it doesn't help.
http://marc.info/?l=linux-netdev&m=120888791230434&w=2

> > --- a/drivers/net/atlx/atl1.c
> > +++ b/drivers/net/atlx/atl1.c
> > @@ -1115,9 +1115,6 @@ static void atl1_free_ring_resources(struct
> > atl1_adapter *adapter) struct atl1_rrd_ring *rrd_ring =
> > &adapter->rrd_ring; struct atl1_ring_header *ring_header =
> > &adapter->ring_header; 
> > -	atl1_clean_tx_ring(adapter);
> > -	atl1_clean_rx_ring(adapter);
> > -
> >  	kfree(tpd_ring->buffer_info);
> >  	pci_free_consistent(pdev, ring_header->size,
> > ring_header->desc, ring_header->dma);
> > @@ -3423,6 +3420,8 @@ static int atl1_set_ringparam(struct net_device
> > *netdev, adapter->rrd_ring = rrd_old;
> >  		adapter->tpd_ring = tpd_old;
> >  		adapter->ring_header = rhdr_old;
> > +		atl1_clean_tx_ring(adapter);
> > +		atl1_clean_rx_ring(adapter);
> >  		atl1_free_ring_resources(adapter);
> >  		adapter->rfd_ring = rfd_new;
> >  		adapter->rrd_ring = rrd_new;


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: atl1 64-bit => 32-bit DMA borkage (reproducible, bisected)
  2008-04-26  0:57                               ` Jay Cliburn
  2008-04-28  6:42                                 ` Alexey Dobriyan
@ 2008-05-04 21:15                                 ` Alexey Dobriyan
  2008-05-05  0:31                                   ` Jay Cliburn
  2008-05-06 16:02                                   ` Jay Cliburn
  1 sibling, 2 replies; 71+ messages in thread
From: Alexey Dobriyan @ 2008-05-04 21:15 UTC (permalink / raw
  To: Jay Cliburn
  Cc: Luca Tettamanti, Chris Snook, Jeff Garzik, Pekka Enberg,
	Andrew Morton, linux-kernel, netdev, Christoph Lameter, torvalds

Looking at how other netdevice drivers:

8139too and others checks netif_running() in interrupt handler.

r8169 has scary "50k$" question comment re irqs disabled after
interacting with hardware.

But the r8169 case should be fixed by atlx_irq_disable()?

Writes to REG_IMR, REG_ISR are commented in atl1_reset_hw(), why?
(I'll test that soon)

Do we have a theory why changing from 64-bit DMA mask to 32-bit mask
resurrects the bug? NIC here never showed any sort of corruption
described in commit which banned 64-bit DMA.


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: atl1 64-bit => 32-bit DMA borkage (reproducible, bisected)
  2008-05-04 21:15                                 ` Alexey Dobriyan
@ 2008-05-05  0:31                                   ` Jay Cliburn
  2008-05-05  0:34                                     ` Jay Cliburn
  2008-05-06 16:02                                   ` Jay Cliburn
  1 sibling, 1 reply; 71+ messages in thread
From: Jay Cliburn @ 2008-05-05  0:31 UTC (permalink / raw
  To: Alexey Dobriyan
  Cc: Luca Tettamanti, Chris Snook, Jeff Garzik, Pekka Enberg,
	Andrew Morton, linux-kernel, netdev, Christoph Lameter, torvalds

On Mon, 5 May 2008 01:15:07 +0400
Alexey Dobriyan <adobriyan@gmail.com> wrote:

> Looking at how other netdevice drivers:
> 
> 8139too and others checks netif_running() in interrupt handler.
> 
> r8169 has scary "50k$" question comment re irqs disabled after
> interacting with hardware.
> 
> But the r8169 case should be fixed by atlx_irq_disable()?

Agreed.

> 
> Writes to REG_IMR, REG_ISR are commented in atl1_reset_hw(), why?

Came from the vendor that way.

> (I'll test that soon)
> 
> Do we have a theory why changing from 64-bit DMA mask to 32-bit mask
> resurrects the bug? NIC here never showed any sort of corruption
> described in commit which banned 64-bit DMA.

We had multiple reports of users who encountered repeated memory
corruption when transferring large files while running with a 64-bit DMA
mask.  Chris Snook noticed the upper 32 bits of the descriptor address
register are shared among five other registers, each containing the low
bits for one of five descriptors.  All the descriptors, therefore, have
to live within the same 4GB address space.

I'll keep poking at it as time permits through the week, but I probably
won't be able to devote a whole lot of time to it until next weekend.

^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: atl1 64-bit => 32-bit DMA borkage (reproducible, bisected)
  2008-05-05  0:31                                   ` Jay Cliburn
@ 2008-05-05  0:34                                     ` Jay Cliburn
  0 siblings, 0 replies; 71+ messages in thread
From: Jay Cliburn @ 2008-05-05  0:34 UTC (permalink / raw
  To: Jay Cliburn
  Cc: Alexey Dobriyan, Luca Tettamanti, Chris Snook, Jeff Garzik,
	Pekka Enberg, Andrew Morton, linux-kernel, netdev,
	Christoph Lameter, torvalds

On Sun, 4 May 2008 19:31:28 -0500
Jay Cliburn <jacliburn@bellsouth.net> wrote:

> All the descriptors, therefore, have to live within the same 4GB address
> space.

Make that "...within the same 2GB address space."

^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: atl1 64-bit => 32-bit DMA borkage (reproducible, bisected)
  2008-05-04 21:15                                 ` Alexey Dobriyan
  2008-05-05  0:31                                   ` Jay Cliburn
@ 2008-05-06 16:02                                   ` Jay Cliburn
  2008-05-09 19:51                                     ` Alexey Dobriyan
  1 sibling, 1 reply; 71+ messages in thread
From: Jay Cliburn @ 2008-05-06 16:02 UTC (permalink / raw
  To: Alexey Dobriyan
  Cc: Luca Tettamanti, Chris Snook, Jeff Garzik, Pekka Enberg,
	Andrew Morton, linux-kernel, netdev, Christoph Lameter, torvalds

On Mon, 5 May 2008 01:15:07 +0400
Alexey Dobriyan <adobriyan@gmail.com> wrote:

> Looking at how other netdevice drivers:
> 
> 8139too and others checks netif_running() in interrupt handler.
> 
> r8169 has scary "50k$" question comment re irqs disabled after
> interacting with hardware.
> 
> But the r8169 case should be fixed by atlx_irq_disable()?
> 
> Writes to REG_IMR, REG_ISR are commented in atl1_reset_hw(), why?
> (I'll test that soon)

I've tried all the stuff you mentioned above, and more, to prevent the
memory corruption, all to no avail.

I booted with mem=4000M and didn't hit the bug.  I diffed dmesg between
booting with mem=4000M and booting without it, and found that iommu
was being disabled when booting with full memory:

--- dmesg-4000.txt      2008-05-06 10:14:07.000000000 -0500
+++ dmesg-4096.txt      2008-05-06 10:09:19.000000000 -0500
@@ -1,5 +1,5 @@
 Linux version 2.6.26-rc1 (jcliburn@finch.hogchain.net) (gcc version 4.1.2 20070
925 (Red Hat 4.1.2-27)) #4 SMP Mon May 5 18:03:48 CDT 2008
-Command line: ro root=LABEL=/1 console=ttyS0,38400 console=tty0 slub_debug=FZPU mem=4000M
+Command line: ro root=LABEL=/1 console=ttyS0,38400 console=tty0 slub_debug=FZPU
[...]
+Looks like a VIA chipset. Disabling IOMMU. Override with iommu=allowed
[...]

So I then booted with iommu=allowed.  No errors.  Can't hit the bug to
save my life.

Why would disabling iommu cause the atl1 driver to write over poisoned
memory?

Alexey, can you please try booting with iommu=allowed and see if you
avoid the problem?

Thanks,
Jay

^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: atl1 64-bit => 32-bit DMA borkage (reproducible, bisected)
  2008-05-09 19:51                                     ` Alexey Dobriyan
@ 2008-05-09 18:56                                       ` Chris Snook
  2008-05-09 20:07                                         ` Alexey Dobriyan
  0 siblings, 1 reply; 71+ messages in thread
From: Chris Snook @ 2008-05-09 18:56 UTC (permalink / raw
  To: Alexey Dobriyan
  Cc: Jay Cliburn, Luca Tettamanti, Jeff Garzik, Pekka Enberg,
	Andrew Morton, linux-kernel, netdev, Christoph Lameter, torvalds

Alexey Dobriyan wrote:
> Hmmm, there was a wonderful oops on interface stop here when the other end
> of atl1 cable was physically unplugged (but there was traffic before):
> 
> 	atl1_down
> 	atl1_clean_rx_ring
> 	swiotlb_unmap_single
> 	swiotlb_unmap_single_attrs
> 	memcpy_c
> 

Intel chip, or AMD?

-- Chris

^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: atl1 64-bit => 32-bit DMA borkage (reproducible, bisected)
  2008-05-09 20:07                                         ` Alexey Dobriyan
@ 2008-05-09 19:38                                           ` Jay Cliburn
  2008-05-10 19:31                                             ` [PATCH] " Alexey Dobriyan
  0 siblings, 1 reply; 71+ messages in thread
From: Jay Cliburn @ 2008-05-09 19:38 UTC (permalink / raw
  To: Alexey Dobriyan
  Cc: Chris Snook, Luca Tettamanti, Jeff Garzik, Andrew Morton,
	linux-kernel, netdev

[trimmed cc list slightly]

On Sat, 10 May 2008 00:07:15 +0400
Alexey Dobriyan <adobriyan@gmail.com> wrote:

> On Fri, May 09, 2008 at 02:56:21PM -0400, Chris Snook wrote:
> > Alexey Dobriyan wrote:
> >> Hmmm, there was a wonderful oops on interface stop here when the
> >> other end of atl1 cable was physically unplugged (but there was
> >> traffic before): atl1_down
> >> 	atl1_clean_rx_ring
> >> 	swiotlb_unmap_single
> >> 	swiotlb_unmap_single_attrs
> >> 	memcpy_c
> >
> > Intel chip, or AMD?
> 
> Intel(R) Core(TM)2 CPU          6400  @ 2.13GHz
> Asus P5B-E motherboard.
> 

I see the same thing with a Socked AM2-based board (Asus M2V) with 4GB
RAM installed. The problem occurs only when SWIOTLB is active, which
happens automatically at boot (in arch/x86/kernel/pci-swiotlb.c) when
the page frame number exceeds 1048576 (corresponding to 2^32 bytes).

I thought for awhile that the problem went away with iommu=allowed, but
I was wrong.

The bug appears to be a "simple" skb write-after-free that happens only
when bounce buffers are in use, but I'll be damned if I can find the
cause of it.

<continues looking>

=============================================================================
BUG kmalloc-2048: Poison overwritten
-----------------------------------------------------------------------------

INFO: 0xffff81010004297a-0xffff810100042f71. First byte 0x0 instead of 0x6b
INFO: Allocated in dev_alloc_skb+0x16/0x2c age=5813 cpu=0 pid=3029
INFO: Freed in skb_release_data+0xa8/0xad age=201 cpu=0 pid=0
INFO: Slab 0xffffe20005801600 objects=15 used=0 fp=0xffff810100045b18 flags=0x8000000000002082
INFO: Object 0xffff810100042968 @offset=10600 fp=0xffff8101000418d8

Bytes b4 0xffff810100042958:  aa 91 fd ff 00 00 00 00 5a 5a 5a 5a 5a 5a 5a 5a �.��....ZZZZZZZZ
  Object 0xffff810100042968:  6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b kkkkkkkkkkkkkkkk
  Object 0xffff810100042978:  6b 6b 00 17 31 4e 9d 41 00 0f db bc af 14 08 00 kk..1N.A..ۼ�...
  Object 0xffff810100042988:  45 00 00 4e 87 5e 00 00 40 11 6e 82 c0 a8 01 fe E..N.^..@.n.�������.�
  Object 0xffff810100042998:  c0 a8 01 70 00 89 00 89 00 3a 3b 67 00 09 00 00 ��.p.....:;g....
  Object 0xffff8101000429a8:  00 01 00 00 00 00 00 00 20 43 4b 41 41 41 41 41 .........CKAAAAA
  Object 0xffff8101000429b8:  41 41 41 41 41 41 41 41 41 41 41 41 41 41 41 41 AAAAAAAAAAAAAAAA
  Object 0xffff8101000429c8:  41 41 41 41 41 41 41 41 41 00 00 21 00 01 f0 53 AAAAAAAAA..!..
  Object 0xffff8101000429d8:  56 17 df 3e 3b 9f b7 1f 2d 29 f0 68 cf 4d 61 97 V.�>;.�.-)�h�Ma.
 Redzone 0xffff810100043168:  bb bb bb bb bb bb bb bb                         �𰻻������        
 Padding 0xffff8101000431a8:  5a 5a 5a 5a 5a 5a 5a 5a                         ZZZZZZZZ        
Pid: 3030, comm: ifconfig Not tainted 2.6.26-rc1 #3

Call Trace:
 [<ffffffff8108cf62>] print_trailer+0x123/0x12c
 [<ffffffff8108d00f>] check_bytes_and_report+0xa4/0xcb
 [<ffffffff8108d33e>] check_object+0xca/0x212
 [<ffffffff8108d6cd>] __free_slab+0x85/0xfd
 [<ffffffff811e5dd3>] ? skb_release_data+0xa8/0xad
 [<ffffffff8108d77d>] discard_slab+0x38/0x3a
 [<ffffffff8108e172>] __slab_free+0xdb/0x2ac
 [<ffffffff8108e47a>] kfree+0xbc/0xcb
 [<ffffffff811e5dd3>] ? skb_release_data+0xa8/0xad
 [<ffffffff811e5dd3>] skb_release_data+0xa8/0xad
 [<ffffffff811e6494>] skb_release_all+0xc9/0xce
 [<ffffffff811e5c2e>] __kfree_skb+0x11/0x78
 [<ffffffff811e5cbc>] kfree_skb+0x27/0x29
 [<ffffffffa00cc3aa>] :atl1:atl1_clean_rx_ring+0x7e/0xe2
 [<ffffffffa00cc4d7>] :atl1:atl1_down+0xc9/0xce
 [<ffffffffa00cedcd>] :atl1:atl1_close+0x18/0x27
 [<ffffffff811ebe2d>] dev_close+0x57/0x72
 [<ffffffff811ebb31>] dev_change_flags+0xa8/0x164
 [<ffffffff8122f44c>] devinet_ioctl+0x26a/0x5f6
 [<ffffffff8122fc79>] inet_ioctl+0x92/0xaa
 [<ffffffff811df6d4>] sock_ioctl+0x1da/0x202
 [<ffffffff8109f252>] vfs_ioctl+0x2a/0x77
 [<ffffffff8109f501>] do_vfs_ioctl+0x262/0x27f
 [<ffffffff8109f575>] sys_ioctl+0x57/0x7a
 [<ffffffff8100bff7>] tracesys+0xd5/0xda

FIX kmalloc-2048: Restoring 0xffff81010004297a-0xffff810100042f71=0x6b

^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: atl1 64-bit => 32-bit DMA borkage (reproducible, bisected)
  2008-05-06 16:02                                   ` Jay Cliburn
@ 2008-05-09 19:51                                     ` Alexey Dobriyan
  2008-05-09 18:56                                       ` Chris Snook
  0 siblings, 1 reply; 71+ messages in thread
From: Alexey Dobriyan @ 2008-05-09 19:51 UTC (permalink / raw
  To: Jay Cliburn
  Cc: Luca Tettamanti, Chris Snook, Jeff Garzik, Pekka Enberg,
	Andrew Morton, linux-kernel, netdev, Christoph Lameter, torvalds

Hmmm, there was a wonderful oops on interface stop here when the other end
of atl1 cable was physically unplugged (but there was traffic before):

	atl1_down
	atl1_clean_rx_ring
	swiotlb_unmap_single
	swiotlb_unmap_single_attrs
	memcpy_c


^ permalink raw reply	[flat|nested] 71+ messages in thread

* Re: atl1 64-bit => 32-bit DMA borkage (reproducible, bisected)
  2008-05-09 18:56                                       ` Chris Snook
@ 2008-05-09 20:07                                         ` Alexey Dobriyan
  2008-05-09 19:38                                           ` Jay Cliburn
  0 siblings, 1 reply; 71+ messages in thread
From: Alexey Dobriyan @ 2008-05-09 20:07 UTC (permalink / raw
  To: Chris Snook
  Cc: Jay Cliburn, Luca Tettamanti, Jeff Garzik, Pekka Enberg,
	Andrew Morton, linux-kernel, netdev, Christoph Lameter, torvalds

On Fri, May 09, 2008 at 02:56:21PM -0400, Chris Snook wrote:
> Alexey Dobriyan wrote:
>> Hmmm, there was a wonderful oops on interface stop here when the other end
>> of atl1 cable was physically unplugged (but there was traffic before):
>> 	atl1_down
>> 	atl1_clean_rx_ring
>> 	swiotlb_unmap_single
>> 	swiotlb_unmap_single_attrs
>> 	memcpy_c
>
> Intel chip, or AMD?

Intel(R) Core(TM)2 CPU          6400  @ 2.13GHz
Asus P5B-E motherboard.


^ permalink raw reply	[flat|nested] 71+ messages in thread

* [PATCH] Re: atl1 64-bit => 32-bit DMA borkage (reproducible, bisected)
  2008-05-09 19:38                                           ` Jay Cliburn
@ 2008-05-10 19:31                                             ` Alexey Dobriyan
  2008-05-11  1:58                                               ` Jay Cliburn
  0 siblings, 1 reply; 71+ messages in thread
From: Alexey Dobriyan @ 2008-05-10 19:31 UTC (permalink / raw
  To: Jay Cliburn
  Cc: Chris Snook, Luca Tettamanti, Jeff Garzik, Andrew Morton,
	linux-kernel, netdev

On Fri, May 09, 2008 at 02:38:54PM -0500, Jay Cliburn wrote:
> [trimmed cc list slightly]
> 
> On Sat, 10 May 2008 00:07:15 +0400
> Alexey Dobriyan <adobriyan@gmail.com> wrote:
> 
> > On Fri, May 09, 2008 at 02:56:21PM -0400, Chris Snook wrote:
> > > Alexey Dobriyan wrote:
> > >> Hmmm, there was a wonderful oops on interface stop here when the
> > >> other end of atl1 cable was physically unplugged (but there was
> > >> traffic before): atl1_down
> > >> 	atl1_clean_rx_ring
> > >> 	swiotlb_unmap_single
> > >> 	swiotlb_unmap_single_attrs
> > >> 	memcpy_c
> > >
> > > Intel chip, or AMD?
> > 
> > Intel(R) Core(TM)2 CPU          6400  @ 2.13GHz
> > Asus P5B-E motherboard.
> > 
> 
> I see the same thing with a Socked AM2-based board (Asus M2V) with 4GB
> RAM installed. The problem occurs only when SWIOTLB is active, which
> happens automatically at boot (in arch/x86/kernel/pci-swiotlb.c) when
> the page frame number exceeds 1048576 (corresponding to 2^32 bytes).
> 
> I thought for awhile that the problem went away with iommu=allowed, but
> I was wrong.
> 
> The bug appears to be a "simple" skb write-after-free that happens only
> when bounce buffers are in use, but I'll be damned if I can find the
> cause of it.
> 
> <continues looking>

Try this patch! If scared, remove swiotlb poisoning, I'm not entirely
sure it's correct, but it makes aforementioned second oops
deterministic.

--- a/drivers/net/atlx/atl1.c
+++ b/drivers/net/atlx/atl1.c
@@ -2027,6 +2029,7 @@ rrd_ok:
 		/* Good Receive */
 		pci_unmap_page(adapter->pdev, buffer_info->dma,
 			       buffer_info->length, PCI_DMA_FROMDEVICE);
+		buffer_info->dma = 0;
 		skb = buffer_info->skb;
 		length = le16_to_cpu(rrd->xsz.xsum_sz.pkt_size);
 
diff --git a/lib/swiotlb.c b/lib/swiotlb.c
index d568894..f6165ed 100644
--- a/lib/swiotlb.c
+++ b/lib/swiotlb.c
@@ -399,12 +399,14 @@ unmap_single(struct device *hwdev, char *dma_addr, size_t size, int dir)
 	/*
 	 * First, sync the memory before unmapping the entry
 	 */
-	if (buffer && ((dir == DMA_FROM_DEVICE) || (dir == DMA_BIDIRECTIONAL)))
+	if (buffer && ((dir == DMA_FROM_DEVICE) || (dir == DMA_BIDIRECTIONAL))) {
 		/*
 		 * bounce... copy the data back into the original buffer * and
 		 * delete the bounce buffer.
 		 */
 		memcpy(buffer, dma_addr, size);
+		io_tlb_orig_addr[index] = (void *)0x9a9a9a9a9a9a9a9aUL;
+	}
 
 	/*
 	 * Return the buffer to the free list by setting the corresponding


^ permalink raw reply related	[flat|nested] 71+ messages in thread

* Re: [PATCH] Re: atl1 64-bit => 32-bit DMA borkage (reproducible, bisected)
  2008-05-10 19:31                                             ` [PATCH] " Alexey Dobriyan
@ 2008-05-11  1:58                                               ` Jay Cliburn
  0 siblings, 0 replies; 71+ messages in thread
From: Jay Cliburn @ 2008-05-11  1:58 UTC (permalink / raw
  To: Alexey Dobriyan
  Cc: Chris Snook, Luca Tettamanti, Jeff Garzik, Andrew Morton,
	linux-kernel, netdev

On Sat, 10 May 2008 23:31:07 +0400
Alexey Dobriyan <adobriyan@gmail.com> wrote:

> On Fri, May 09, 2008 at 02:38:54PM -0500, Jay Cliburn wrote:

> > The bug appears to be a "simple" skb write-after-free that happens
> > only when bounce buffers are in use, but I'll be damned if I can
> > find the cause of it.
> > 
> > <continues looking>
> 
> Try this patch! If scared, remove swiotlb poisoning, I'm not entirely
> sure it's correct, but it makes aforementioned second oops
> deterministic.

Seems to fix it for me.  Nicely done, Alexey!  Thanks!

I looked at that blasted unmap a thousand times, but never noticed the
missing buffer_info->dma clear.

I'll get input from one more tester, and if it's positive, I'll submit
this to Jeff.

Thanks again.

> 
> --- a/drivers/net/atlx/atl1.c
> +++ b/drivers/net/atlx/atl1.c
> @@ -2027,6 +2029,7 @@ rrd_ok:
>  		/* Good Receive */
>  		pci_unmap_page(adapter->pdev, buffer_info->dma,
>  			       buffer_info->length,
> PCI_DMA_FROMDEVICE);
> +		buffer_info->dma = 0;
>  		skb = buffer_info->skb;
>  		length = le16_to_cpu(rrd->xsz.xsum_sz.pkt_size);
>  
> diff --git a/lib/swiotlb.c b/lib/swiotlb.c
> index d568894..f6165ed 100644
> --- a/lib/swiotlb.c
> +++ b/lib/swiotlb.c
> @@ -399,12 +399,14 @@ unmap_single(struct device *hwdev, char
> *dma_addr, size_t size, int dir) /*
>  	 * First, sync the memory before unmapping the entry
>  	 */
> -	if (buffer && ((dir == DMA_FROM_DEVICE) || (dir ==
> DMA_BIDIRECTIONAL)))
> +	if (buffer && ((dir == DMA_FROM_DEVICE) || (dir ==
> DMA_BIDIRECTIONAL))) { /*
>  		 * bounce... copy the data back into the original
> buffer * and
>  		 * delete the bounce buffer.
>  		 */
>  		memcpy(buffer, dma_addr, size);
> +		io_tlb_orig_addr[index] = (void
> *)0x9a9a9a9a9a9a9a9aUL;
> +	}
>  
>  	/*
>  	 * Return the buffer to the free list by setting the
> corresponding
> 

^ permalink raw reply	[flat|nested] 71+ messages in thread

end of thread, other threads:[~2008-05-11  1:59 UTC | newest]

Thread overview: 71+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-04-11  3:33 2.6.25-rc8-mm2 Andrew Morton
2008-04-11  6:28 ` 2.6.25-rc8-mm2: boot hang after "ACPI: using IOAPIC for interrupt routing" Alexey Dobriyan
2008-04-11  6:29 ` Alexey Dobriyan
2008-04-11  6:43   ` Pekka Enberg
2008-04-11 10:35   ` Pekka Enberg
2008-04-11 21:07     ` Alexey Dobriyan
2008-04-11 21:25       ` Pekka J Enberg
2008-04-11 23:09         ` Alexey Dobriyan
2008-04-11  9:57 ` 2.6.25-rc8-mm2 KAMEZAWA Hiroyuki
2008-04-11 10:23   ` 2.6.25-rc8-mm2 KAMEZAWA Hiroyuki
2008-04-11 10:34     ` 2.6.25-rc8-mm2 Pekka Enberg
2008-04-11 10:57       ` 2.6.25-rc8-mm2 KAMEZAWA Hiroyuki
2008-04-11 11:17         ` 2.6.25-rc8-mm2 KAMEZAWA Hiroyuki
2008-04-11 13:17           ` 2.6.25-rc8-mm2 KAMEZAWA Hiroyuki
2008-04-11 14:24             ` 2.6.25-rc8-mm2 Pekka Enberg
2008-04-14  3:13               ` 2.6.25-rc8-mm2 KAMEZAWA Hiroyuki
2008-04-11 13:40 ` 2.6.25-rc8-mm2 Adrian Bunk
2008-04-12  4:22   ` 2.6.25-rc8-mm2 Andrew Morton
2008-04-12 18:43     ` 2.6.25-rc8-mm2 Aneesh Kumar K.V
2008-04-13  7:45       ` 2.6.25-rc8-mm2 Michael Schmitz
2008-04-13  8:11         ` 2.6.25-rc8-mm2 Geert Uytterhoeven
2008-04-11 23:43 ` 2.6.25-rc8-mm2: panic involving mount_block_root and down the road Alexey Dobriyan
2008-04-12  0:59   ` 2.6.25-rc8-mm2: CONFIG_ATA_SFF: " Alexey Dobriyan
2008-04-12  6:52     ` Adrian Bunk
2008-04-12 10:02       ` Jeff Garzik
2008-04-13  0:45         ` Tejun Heo
2008-04-13  0:51         ` [mm patch] select ATA_SFF Adrian Bunk
2008-04-12  5:53 ` 2.6.25-rc8-mm2 Valdis.Kletnieks
2008-04-12  6:07   ` 2.6.25-rc8-mm2 Andrew Morton
2008-04-12  8:52 ` 2.6.25-rc8-mm2 - CONFIG_PROFILE_LIKELY broke again Valdis.Kletnieks
2008-04-12  9:41 ` 2.6.25-rc8-mm2 Valdis.Kletnieks
2008-04-12 10:47 ` 2.6.25-rc8-mm2 - ftraced chews 100% of a CPU Valdis.Kletnieks
2008-04-13 20:44 ` 2.6.25-rc8-mm2: IP: [<ffffffff802868f9>] __kmalloc+0x69/0x110 Alexey Dobriyan
2008-04-13 20:53   ` Andrew Morton
2008-04-14 16:18     ` Pekka Enberg
2008-04-14 18:00     ` Christoph Lameter
2008-04-14  9:51   ` Pekka J Enberg
2008-04-14 16:01   ` Pekka Enberg
2008-04-14 17:53     ` Christoph Lameter
2008-04-14 17:57       ` Christoph Lameter
2008-04-14 18:32       ` Alexey Dobriyan
2008-04-14 19:56         ` 2.6.25-rc8-mm2: FIX kmalloc-2048 (was Re: 2.6.25-rc8-mm2: IP: [<ffffffff802868f9>] __kmalloc+0x69/0x110) Alexey Dobriyan
2008-04-14 20:05           ` Christoph Lameter
2008-04-19 11:17             ` Alexey Dobriyan
2008-04-19 14:45               ` atl1 64-bit => 32-bit DMA borkage (reproducible, bisected) Alexey Dobriyan
2008-04-20  2:54                 ` Jay Cliburn
2008-04-20 11:14                   ` Alexey Dobriyan
2008-04-20 11:06                     ` Jay Cliburn
2008-04-20 12:26                       ` Alexey Dobriyan
2008-04-20 18:37                         ` Jay Cliburn
2008-04-20 20:55                           ` Alexey Dobriyan
2008-04-21 18:42                             ` Chris Snook
2008-04-21 19:56                               ` Alexey Dobriyan
2008-04-22  2:08                             ` Jay Cliburn
2008-04-22 19:02                               ` Alexey Dobriyan
2008-04-26  0:57                               ` Jay Cliburn
2008-04-28  6:42                                 ` Alexey Dobriyan
2008-05-04 21:15                                 ` Alexey Dobriyan
2008-05-05  0:31                                   ` Jay Cliburn
2008-05-05  0:34                                     ` Jay Cliburn
2008-05-06 16:02                                   ` Jay Cliburn
2008-05-09 19:51                                     ` Alexey Dobriyan
2008-05-09 18:56                                       ` Chris Snook
2008-05-09 20:07                                         ` Alexey Dobriyan
2008-05-09 19:38                                           ` Jay Cliburn
2008-05-10 19:31                                             ` [PATCH] " Alexey Dobriyan
2008-05-11  1:58                                               ` Jay Cliburn
2008-04-14  8:07 ` BUG at __dentry_open [Was: 2.6.25-rc8-mm2] Jiri Slaby
2008-04-15 17:00 ` 2.6.25-rc8-mm2: Observed problems: Not a detailed bug report Zan Lynx
2008-04-15 18:42   ` Andrew Morton
2008-04-15 20:12     ` Rafael J. Wysocki

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.