| Anonymous | Login | Signup for a new account | 2009-11-22 06:42 UTC |
| Main | My View | View Issues | Roadmap | Docs |
| Viewing Issue Simple Details [ Jump to Notes ] | [ View Advanced ] [ Issue History ] [ Print ] | |||||||||||
| ID | Category | Severity | Reproducibility | Date Submitted | Last Update | |||||||
| 0003968 | [CentOS-5] kernel | crash | always | 2009-11-02 23:14 | 2009-11-18 14:52 | |||||||
| Reporter | ps7776 | View Status | public | |||||||||
| Assigned To | ||||||||||||
| Priority | normal | Resolution | open | |||||||||
| Status | new | Product Version | 5.3 | |||||||||
| Summary | 0003968: low memory exhaustion cases oom-killer to be invoked | |||||||||||
| Description |
Fairly heavily loaded server with 8 CPUs, 16 GB of RAM, 64 GB of swap running lots of memory intensive stuff plus httpd, nfs, mysql, dovecot etc. For legacy reasons we are using 32 bit PAE kernels ( which may be part of the problem ...) . System has been very stable ( normal uptime ~ 90 days ) but started crashing every few days after upgrading to 2.26.18-164 kernel. Basically what would happen is that the kernel would run fine for a few days rarely touching swap at all with LowFree hovering around 9M . However, any spike in demand for low memory driving LowFree below ~8K would start a chain reaction with oom-killer being invoked and the kernel eventually deadlocking with LowFree = 0 . The kernel was unable to free up enough low memory to bounce low memory to high memory. Increasing swappiness to close to 100 had no effect - the kernel rarely used swap. Setting overcommit_memory to 2 had no effect either and lowering lowmem_reserve_ratio ( to fex. 64 64 8 ) didn't result in any increase in LowFree Downgrading to 2.18.18-128.1.16 appears to improve the situation ( no crash yet ... ) in that LowFree is kept much, much larger ( around 320M ). So - what changed in the low memory allocation from 2.6.18-128 to 2.18.6-164 ? |
|||||||||||
| Additional Information |
|
|||||||||||
| Tags | No tags attached. | |||||||||||
| Attached Files | ||||||||||||
|
|
||||||||||||
Notes |
|
|
(0010239) jhughes@hughesjr.com (administrator) 2009-11-02 23:25 |
I have had issues with VMWare host server and running out of memory, maybe try setting this variable in sysctl.conf: vm.min_free_kbytes=307200 That will try to maintain 300MB of free RAM. |
|
(0010279) ps7776 (reporter) 2009-11-04 15:37 |
After running the 2.6.18-128 PAE kernel for a while I now see that it actually behaves the same way as -164 kernel - at some point it re-balances memory such that LowFree is reduced to around 8K . However, increasing vm.min_free_kbytes from the default ~4K to 32K has a definite effect on LowFree . Maybe this isn't really a bug - it may just be a question of high demands from the hardware ( 3 gigabit ethernet cards + a fast RAID array ) capable of sustained rates of 80 Mb/s . Hmmm ... |
|
(0010280) smooge (developer) 2009-11-04 15:56 |
PAE kernels do weird things to get an architecture that is 4GB limited to work on more than that. It gets even weirder depending what the underlying CPU is (the new Nehamim allocates memory in 2GB increments per MPU which means your box works better in 6GB increments versus 4 GB like older CPUs) You might want to look at slabtop to get an idea of what is using that memory. |
|
(0010286) ps7776 (reporter) 2009-11-04 21:44 |
Right - I know that the PAE kernel is a bit of a kludge and the long term plan is to migrate everything to 64 bits. This box has two Quad-Core ( 2532 ) AMD Opterons. I'm not sure what happens when things go wrong but during "normal" running nfs4d dominates by far : 429396 429396 100% 0.34K 39036 11 156144K nfsd4_stateowners 691560 512891 74% 0.05K 9605 72 38420K buffer_head 72416 64012 88% 0.48K 9052 8 36208K ext3_inode_cache 404443 404440 99% 0.07K 7631 53 30524K nfsd4_stateids 165039 162169 98% 0.13K 5691 29 22764K dentry_cache 59402 59402 100% 0.27K 4243 14 16972K radix_tree_node 435502 435396 99% 0.03K 3854 113 15416K size-32 729 685 93% 4.00K 729 1 2916K pmd 604 589 97% 4.00K 604 1 2416K size-4096 17468 16368 93% 0.09K 397 44 1588K vm_area_struct 670 653 97% 2.00K 335 2 1340K size-2048 6240 5337 85% 0.19K 312 20 1248K filp 1568 1527 97% 0.50K 196 8 784K size-512 10089 5999 59% 0.06K 171 59 684K size-64 1683 1490 88% 0.33K 153 11 612K inode_cache 1617 1585 98% 0.35K 147 11 588K proc_inode_cache 423 420 99% 1.33K 141 3 564K task_struct 372 372 100% 1.31K 124 3 496K sighand_cache 392 392 100% 1.00K 98 4 392K size-1024 2730 2445 89% 0.12K 91 30 364K size-128 1170 1127 96% 0.25K 78 15 312K size-256 5928 5856 98% 0.05K 76 78 304K sysfs_dir_cache 73 65 89% 4.00K 73 1 292K jbd_4k 432 399 92% 0.44K 48 9 192K signal_cache 432 380 87% 0.43K 48 9 192K shmem_inode_cache 264 256 96% 0.68K 24 11 192K raid5/md0 |
|
(0010287) smooge (developer) 2009-11-04 22:17 |
Another thing that came up in LWN today was turning off the kernel overcommit. The fact that the 'vm.overcommit_memory = 2' didnt fix things kind of makes it more PAE like. Are you using NFSv4+ on the system? |
|
(0010289) ps7776 (reporter) 2009-11-04 22:41 |
Yes - see the note I added a few minutes ago. There are some twenty other boxes auto mounting directories at any given time . |
|
(0010330) ps7776 (reporter) 2009-11-10 18:01 |
I tried setting min_free_kbytes to 32768 and the 2.6.18-128 kernel ran for 7 days before going on its oom-killer spree . What puzzles me is that there isn't much difference in /proc/meminfo over the Note: 0000030 second interval that the first ( ~720 MB ) job was killed : Before : After : MemTotal: 16635504 kB MemTotal: 16635504 kB MemFree: 12062112 kB MemFree: 13484684 kB Buffers: 1900 kB Buffers: 1848 kB Cached: 2052840 kB Cached: 1340096 kB SwapCached: 0 kB SwapCached: 0 kB Active: 2533416 kB Active: 1600648 kB Inactive: 1254116 kB Inactive: 764248 kB HighTotal: 15859520 kB HighTotal: 15859520 kB HighFree: 12024792 kB HighFree: 13447952 kB LowTotal: 775984 kB LowTotal: 775984 kB LowFree: 37320 kB LowFree: 36732 kB SwapTotal: 61697624 kB SwapTotal: 61697624 kB SwapFree: 61697492 kB SwapFree: 61697492 kB Dirty: 288 kB Dirty: 252 kB Writeback: 12 kB Writeback: 0 kB AnonPages: 1732912 kB AnonPages: 1023000 kB Mapped: 72896 kB Mapped: 86956 kB Slab: 723336 kB Slab: 723472 kB PageTables: 12284 kB PageTables: 11136 kB NFS_Unstable: 0 kB NFS_Unstable: 0 kB Bounce: 0 kB Bounce: 0 kB CommitLimit: 70015376 kB CommitLimit: 70015376 kB Committed_AS: 2271788 kB Committed_AS: 1560112 kB VmallocTotal: 116728 kB VmallocTotal: 116728 kB VmallocUsed: 33656 kB VmallocUsed: 33656 kB VmallocChunk: 82492 kB VmallocChunk: 82492 kB HugePages_Total: 0 HugePages_Total: 0 HugePages_Free: 0 HugePages_Free: 0 HugePages_Rsvd: 0 HugePages_Rsvd: 0 Hugepagesize: 2048 kB Hugepagesize: 2048 kB In fact, there is no difference in low memory. Here is the output from /var/log/messages Nov 10 00:00:07 owl kernel: sshd invoked oom-killer: gfp_mask=0xd0, order=0, oomkilladj=0 Nov 10 00:00:07 owl kernel: [<c0457add>] out_of_memory+0x72/0x1a5 Nov 10 00:00:07 owl kernel: [<c0458fcf>] __alloc_pages+0x216/0x297 Nov 10 00:00:07 owl kernel: [<c046e31a>] cache_alloc_refill+0x26d/0x450 Nov 10 00:00:07 owl kernel: [<c046e0a3>] kmem_cache_alloc+0x41/0x4b Nov 10 00:00:07 owl kernel: [<c047e920>] getname+0x1a/0xb0 Nov 10 00:00:07 owl kernel: [<c04706ad>] do_sys_open+0x12/0xae Nov 10 00:00:07 owl kernel: [<c0470776>] sys_open+0x16/0x18 Nov 10 00:00:07 owl kernel: [<c0404ead>] sysenter_past_esp+0x56/0x79 Nov 10 00:00:07 owl kernel: ======================= Nov 10 00:00:07 owl kernel: Mem-info: Nov 10 00:00:07 owl kernel: DMA per-cpu: Nov 10 00:00:07 owl kernel: cpu 0 hot: high 0, batch 1 used:0 Nov 10 00:00:07 owl kernel: cpu 0 cold: high 0, batch 1 used:0 Nov 10 00:00:07 owl kernel: cpu 1 hot: high 0, batch 1 used:0 Nov 10 00:00:07 owl kernel: cpu 1 cold: high 0, batch 1 used:0 Nov 10 00:00:07 owl kernel: cpu 2 hot: high 0, batch 1 used:0 Nov 10 00:00:07 owl kernel: cpu 2 cold: high 0, batch 1 used:0 Nov 10 00:00:07 owl kernel: cpu 3 hot: high 0, batch 1 used:0 Nov 10 00:00:07 owl kernel: cpu 3 cold: high 0, batch 1 used:0 Nov 10 00:00:07 owl kernel: cpu 4 hot: high 0, batch 1 used:0 Nov 10 00:00:07 owl kernel: cpu 4 cold: high 0, batch 1 used:0 Nov 10 00:00:07 owl kernel: cpu 5 hot: high 0, batch 1 used:0 Nov 10 00:00:07 owl kernel: cpu 5 cold: high 0, batch 1 used:0 Nov 10 00:00:07 owl kernel: cpu 6 hot: high 0, batch 1 used:0 Nov 10 00:00:07 owl kernel: cpu 6 cold: high 0, batch 1 used:0 Nov 10 00:00:07 owl kernel: cpu 7 hot: high 0, batch 1 used:0 Nov 10 00:00:07 owl kernel: cpu 7 cold: high 0, batch 1 used:0 Nov 10 00:00:07 owl kernel: DMA32 per-cpu: empty Nov 10 00:00:07 owl kernel: Normal per-cpu: Nov 10 00:00:07 owl kernel: cpu 0 hot: high 186, batch 31 used:68 Nov 10 00:00:07 owl kernel: cpu 0 cold: high 62, batch 15 used:58 Nov 10 00:00:07 owl kernel: cpu 1 hot: high 186, batch 31 used:31 Nov 10 00:00:08 owl kernel: cpu 1 cold: high 62, batch 15 used:56 Nov 10 00:00:08 owl kernel: cpu 2 hot: high 186, batch 31 used:30 Nov 10 00:00:08 owl kernel: cpu 2 cold: high 62, batch 15 used:53 Nov 10 00:00:08 owl kernel: cpu 3 hot: high 186, batch 31 used:29 Nov 10 00:00:08 owl kernel: cpu 3 cold: high 62, batch 15 used:59 Nov 10 00:00:08 owl kernel: cpu 4 hot: high 186, batch 31 used:53 Nov 10 00:00:08 owl kernel: cpu 4 cold: high 62, batch 15 used:60 Nov 10 00:00:08 owl kernel: cpu 5 hot: high 186, batch 31 used:28 Nov 10 00:00:08 owl kernel: cpu 5 cold: high 62, batch 15 used:47 Nov 10 00:00:08 owl kernel: cpu 6 hot: high 186, batch 31 used:174 Nov 10 00:00:08 owl kernel: cpu 6 cold: high 62, batch 15 used:48 Nov 10 00:00:08 owl kernel: cpu 7 hot: high 186, batch 31 used:21 Nov 10 00:00:08 owl kernel: cpu 7 cold: high 62, batch 15 used:50 Nov 10 00:00:08 owl kernel: HighMem per-cpu: Nov 10 00:00:08 owl kernel: cpu 0 hot: high 186, batch 31 used:111 Nov 10 00:00:08 owl kernel: cpu 0 cold: high 62, batch 15 used:8 Nov 10 00:00:08 owl kernel: cpu 1 hot: high 186, batch 31 used:180 Nov 10 00:00:08 owl kernel: cpu 1 cold: high 62, batch 15 used:8 Nov 10 00:00:08 owl kernel: cpu 2 hot: high 186, batch 31 used:27 Nov 10 00:00:08 owl kernel: cpu 2 cold: high 62, batch 15 used:10 Nov 10 00:00:08 owl kernel: cpu 3 hot: high 186, batch 31 used:26 Nov 10 00:00:08 owl kernel: cpu 3 cold: high 62, batch 15 used:14 Nov 10 00:00:08 owl kernel: cpu 4 hot: high 186, batch 31 used:162 Nov 10 00:00:08 owl kernel: cpu 4 cold: high 62, batch 15 used:9 Nov 10 00:00:08 owl kernel: cpu 5 hot: high 186, batch 31 used:11 Nov 10 00:00:08 owl kernel: cpu 5 cold: high 62, batch 15 used:7 Nov 10 00:00:08 owl kernel: cpu 6 hot: high 186, batch 31 used:133 Nov 10 00:00:08 owl kernel: cpu 6 cold: high 62, batch 15 used:8 Nov 10 00:00:08 owl kernel: cpu 7 hot: high 186, batch 31 used:31 Nov 10 00:00:08 owl kernel: cpu 7 cold: high 62, batch 15 used:0 Nov 10 00:00:08 owl kernel: Free pages: 12019504kB (11983320kB HighMem) Nov 10 00:00:08 owl kernel: Active:636501 inactive:320862 dirty:0 writeback:0 unstable:0 free:3004876 slab:180927 Nov 10 00:00:08 owl kernel: DMA free:4104kB min:584kB low:728kB high:876kB active:0kB inactive:36kB present:16384kB pages_scanned:155 all_unreclaimable? yes Nov 10 00:00:08 owl kernel: lowmem_reserve[]: 0 0 880 17008 Nov 10 00:00:08 owl kernel: DMA32 free:0kB min:0kB low:0kB high:0kB active:0kB inactive:0kB present:0kB pages_scanned:0 all_unreclaimable? no Nov 10 00:00:08 owl kernel: lowmem_reserve[]: 0 0 880 17008 Nov 10 00:00:08 owl kernel: Normal free:32080kB min:32180kB low:40224kB high:48268kB active:1252kB inactive:1128kB present:901120kB pages_scanned:3486 all_unreclaimable? yes Nov 10 00:00:08 owl kernel: lowmem_reserve[]: 0 0 0 129024 Nov 10 00:00:08 owl kernel: HighMem free:11983320kB min:512kB low:147968kB high:295424kB active:2544756kB inactive:1282284kB present:16515072kB pages_scanned:0 all_unreclaimable? no Nov 10 00:00:08 owl kernel: lowmem_reserve[]: 0 0 0 0 Nov 10 00:00:08 owl kernel: DMA: 0*4kB 1*8kB 14*16kB 11*32kB 1*64kB 1*128kB 1*256kB 0*512kB 1*1024kB 1*2048kB 0*4096kB = 4104kB Nov 10 00:00:08 owl kernel: DMA32: empty Nov 10 00:00:08 owl kernel: Normal: 0*4kB 0*8kB 365*16kB 670*32kB 19*64kB 0*128kB 0*256kB 1*512kB 1*1024kB 1*2048kB 0*4096kB = 32080kB Nov 10 00:00:08 owl kernel: HighMem: 0*4kB 31751*8kB 33178*16kB 50726*32kB 22935*64kB 9125*128kB 9573*256kB 4843*512kB 1290*1024kB 258*2048kB 39*4096kB = 11983320kB Nov 10 00:00:08 owl kernel: 520978 pagecache pages Nov 10 00:00:08 owl kernel: Swap cache: add 51, delete 51, find 0/0, race 0+0 Nov 10 00:00:08 owl kernel: Free swap = 61697492kB Nov 10 00:00:08 owl kernel: Total swap = 61697624kB Nov 10 00:00:08 owl kernel: Free swap: 61697492kB Nov 10 00:00:08 owl kernel: 4358144 pages of RAM Nov 10 00:00:08 owl kernel: 4128768 pages of HIGHMEM Nov 10 00:00:08 owl kernel: 199268 reserved pages Nov 10 00:00:08 owl kernel: 559738 pages shared Nov 10 00:00:08 owl kernel: 0 pages swap cached Nov 10 00:00:08 owl kernel: 57 pages dirty Nov 10 00:00:08 owl kernel: 27 pages writeback Nov 10 00:00:08 owl kernel: 21378 pages mapped Nov 10 00:00:08 owl kernel: 180927 pages slab mapped-file:21378 mapped-anon:436264 pagetables:3153Nov 10 00:00:08 owl kernel: lowmem_reserve[]: 0 0 880 17008 Nov 10 00:00:08 owl kernel: DMA32 free:0kB min:0kB low:0kB high:0kB active:0kB inactive:0kB present:0kB pages_scanned:0 all_unreclaimable? no Nov 10 00:00:08 owl kernel: lowmem_reserve[]: 0 0 880 17008 Nov 10 00:00:08 owl kernel: Normal free:32080kB min:32180kB low:40224kB high:48268kB active:1252kB inactive:1128kB prese nt:901120kB pages_scanned:3486 all_unreclaimable? yes Nov 10 00:00:08 owl kernel: lowmem_reserve[]: 0 0 0 129024 Nov 10 00:00:08 owl kernel: HighMem free:11983320kB min:512kB low:147968kB high:295424kB active:2544756kB inactive:1282284kB present:16515072kB pages_scanned:0 all_unreclaimable? no Nov 10 00:00:08 owl kernel: lowmem_reserve[]: 0 0 0 0 Nov 10 00:00:08 owl kernel: DMA: 0*4kB 1*8kB 14*16kB 11*32kB 1*64kB 1*128kB 1*256kB 0*512kB 1*1024kB 1*2048kB 0*4096kB = 4104kB Nov 10 00:00:08 owl kernel: DMA32: empty Nov 10 00:00:08 owl kernel: Normal: 0*4kB 0*8kB 365*16kB 670*32kB 19*64kB 0*128kB 0*256kB 1*512kB 1*1024kB 1*2048kB 0*4096kB = 32080kB Nov 10 00:00:08 owl kernel: HighMem: 0*4kB 31751*8kB 33178*16kB 50726*32kB 22935*64kB 9125*128kB 9573*256kB 4843*512kB 1290*1024kB 258*2048kB 39*4096kB = 11983320kB Nov 10 00:00:08 owl kernel: 520978 pagecache pages Nov 10 00:00:08 owl kernel: Swap cache: add 51, delete 51, find 0/0, race 0+0 Nov 10 00:00:08 owl kernel: Free swap = 61697492kB Nov 10 00:00:08 owl kernel: Total swap = 61697624kB Nov 10 00:00:08 owl kernel: Free swap: 61697492kB Nov 10 00:00:08 owl kernel: 4358144 pages of RAM Nov 10 00:00:08 owl kernel: 4128768 pages of HIGHMEM Nov 10 00:00:08 owl kernel: 199268 reserved pages Nov 10 00:00:08 owl kernel: 559738 pages shared Nov 10 00:00:08 owl kernel: 0 pages swap cached Nov 10 00:00:08 owl kernel: 57 pages dirty Nov 10 00:00:08 owl kernel: 27 pages writeback Nov 10 00:00:08 owl kernel: 21378 pages mapped Nov 10 00:00:08 owl kernel: 180927 pages slab Nov 10 00:00:08 owl kernel: 3153 pages pagetables Nov 10 00:00:08 owl kernel: Out of memory: Killed process 9970 (neutron_counter). I'll try increasing vm.min_free_kbytes . Anything esle I should try ? ( Apart from switching to a 64 bit kernel ) |
|
(0010353) ps7776 (reporter) 2009-11-12 16:02 |
Have tried various values of vm.min_free_kbytes but all to no avail. oom-killer eventually goes on a rampage no matter what the value is. However, monitoring /proc/slabinfo makes me suspect nfs4d : the number of state_owners and state_ids is huge when oom-killer kicks in ( around 1.5M ) . It just keeps going up and up as time goes by. This problem has actually been reported on the NFS4 mailing list ( http://linux-nfs.org/pipermail/nfsv4/2009-August/011050.html [^] ) |
|
(0010378) Zool (reporter) 2009-11-18 09:40 |
i have seen this error, too. Nov 18 00:45:59 ws-web1 kernel: httpd invoked oom-killer: gfp_mask=0x201d2, order=0, oomkilladj=0 Nov 18 00:45:59 ws-web1 kernel: Nov 18 00:45:59 ws-web1 kernel: Call Trace: Nov 18 00:45:59 ws-web1 kernel: [<ffffffff802beb38>] out_of_memory+0x8b/0x203 Nov 18 00:45:59 ws-web1 kernel: [<ffffffff8020f825>] __alloc_pages+0x245/0x2ce Nov 18 00:45:59 ws-web1 kernel: [<ffffffff8021377f>] __do_page_cache_readahead+0xc6/0x1ab Nov 18 00:45:59 ws-web1 kernel: [<ffffffff80214015>] filemap_nopage+0x14c/0x360 Nov 18 00:45:59 ws-web1 kernel: [<ffffffff80208ebc>] __handle_mm_fault+0x443/0x1337 Nov 18 00:45:59 ws-web1 kernel: [<ffffffff8020cb42>] _atomic_dec_and_lock+0x39/0x57 Nov 18 00:45:59 ws-web1 kernel: [<ffffffff8026766a>] do_page_fault+0xf7b/0x12e0 Nov 18 00:45:59 ws-web1 kernel: [<ffffffff8026ef1b>] monotonic_clock+0x35/0x7b Nov 18 00:45:59 ws-web1 kernel: [<ffffffff80234dc4>] do_setitimer+0x45f/0x4c7 Nov 18 00:45:59 ws-web1 kernel: [<ffffffff80264915>] _spin_lock_irq+0x9/0x14 Nov 18 00:45:59 ws-web1 kernel: [<ffffffff80229332>] do_sigaction+0x18c/0x1a0 Nov 18 00:45:59 ws-web1 kernel: [<ffffffff8026082b>] error_exit+0x0/0x6e Nov 18 00:45:59 ws-web1 kernel: Nov 18 00:45:59 ws-web1 kernel: Mem-info: Nov 18 00:45:59 ws-web1 kernel: DMA per-cpu: Nov 18 00:45:59 ws-web1 kernel: cpu 0 hot: high 186, batch 31 used:26 Nov 18 00:45:59 ws-web1 kernel: cpu 0 cold: high 62, batch 15 used:53 Nov 18 00:45:59 ws-web1 kernel: cpu 1 hot: high 186, batch 31 used:22 Nov 18 00:45:59 ws-web1 kernel: cpu 1 cold: high 62, batch 15 used:58 Nov 18 00:45:59 ws-web1 kernel: DMA32 per-cpu: empty Nov 18 00:45:59 ws-web1 kernel: Normal per-cpu: empty Nov 18 00:45:59 ws-web1 kernel: HighMem per-cpu: empty Nov 18 00:45:59 ws-web1 kernel: Free pages: 9024kB (0kB HighMem) Nov 18 00:45:59 ws-web1 kernel: Active:650516 inactive:598639 dirty:0 writeback:0 unstable:0 free:2256 slab:8569 mapped-file:1 mapped-anon:485589 pagetables:13805 Nov 18 00:45:59 ws-web1 kernel: DMA free:9024kB min:9156kB low:11444kB high:13732kB active:2601940kB inactive:2394556kB present:5242880kB pages_scanned:20462072 all_unreclaimable? yes Nov 18 00:45:59 ws-web1 kernel: lowmem_reserve[]: 0 0 0 0 Nov 18 00:45:59 ws-web1 kernel: DMA32 free:0kB min:0kB low:0kB high:0kB active:0kB inactive:0kB present:0kB pages_scanned:0 all_unreclaimable? no Nov 18 00:45:59 ws-web1 kernel: lowmem_reserve[]: 0 0 0 0 Nov 18 00:45:59 ws-web1 kernel: Normal free:0kB min:0kB low:0kB high:0kB active:0kB inactive:0kB present:0kB pages_scanned:0 all_unreclaimable? no Nov 18 00:45:59 ws-web1 kernel: lowmem_reserve[]: 0 0 0 0 Nov 18 00:45:59 ws-web1 kernel: HighMem free:0kB min:128kB low:128kB high:128kB active:0kB inactive:0kB present:0kB pages_scanned:0 all_unreclaimable? no Nov 18 00:45:59 ws-web1 kernel: lowmem_reserve[]: 0 0 0 0 Nov 18 00:45:59 ws-web1 kernel: DMA: 4*4kB 0*8kB 3*16kB 6*32kB 11*64kB 3*128kB 2*256kB 2*512kB 2*1024kB 0*2048kB 1*4096kB = 9024kB Nov 18 00:45:59 ws-web1 kernel: DMA32: empty Nov 18 00:45:59 ws-web1 kernel: Normal: empty Nov 18 00:45:59 ws-web1 kernel: HighMem: empty Nov 18 00:45:59 ws-web1 kernel: 764206 pagecache pages Nov 18 00:45:59 ws-web1 kernel: Swap cache: add 7352701, delete 7352708, find 2397646/3001624, race 0+53 Nov 18 00:45:59 ws-web1 kernel: Free swap = 0kB Nov 18 00:45:59 ws-web1 kernel: Total swap = 3149392kB Nov 18 00:45:59 ws-web1 kernel: Free swap: 0kB Nov 18 00:45:59 ws-web1 kernel: 1310720 pages of RAM Nov 18 00:45:59 ws-web1 kernel: 29034 reserved pages Nov 18 00:45:59 ws-web1 kernel: 104002 pages shared Nov 18 00:45:59 ws-web1 kernel: 1 pages swap cached Nov 18 02:41:41 ws-web1 kernel: httpd[20949]: segfault at 00007fffb8f8bf80 rip 00002abf1e781b0d rsp 00007fffb8f8bf60 error 6 Linux ws-web1 2.6.18-164.el5xen 0000001 SMP Thu Sep 3 04:03:03 EDT 2009 x86_64 x86_64 x86_64 GNU/Linux [root@ws-web1 httpd]# free -m total used free shared buffers cached Mem: 5120 5106 13 0 64 2173 -/+ buffers/cache: 2869 2250 Swap: 3075 451 2623 |
|
(0010380) ps7776 (reporter) 2009-11-18 14:52 |
I believe this issue has been fixed in kernel 2.6.18-164.6.1 . At least there are comments in the release notes to that effect . So I installed this kernel even though it is not part of 5.3 ( it is in 5.4 updates only ) . I didn't install any other packages from 5.4 . Has been running without any problems for a week now. The number of NFS stateowners is reasonable and the memory used is believable. Whether or not there still is a very small memory leak is hard to tell. I think the number more or less scales with the number of cached inodes which sort of sounds reasonable. |
Issue History |
|||
| Date Modified | Username | Field | Change |
| 2009-11-02 23:14 | ps7776 | New Issue | |
| 2009-11-02 23:14 | ps7776 | Assigned To | => kbsingh@karan.org |
| 2009-11-02 23:25 | jhughes@hughesjr.com | Note Added: 0010239 | |
| 2009-11-04 15:37 | ps7776 | Note Added: 0010279 | |
| 2009-11-04 15:56 | smooge | Note Added: 0010280 | |
| 2009-11-04 21:44 | ps7776 | Note Added: 0010286 | |
| 2009-11-04 22:17 | smooge | Note Added: 0010287 | |
| 2009-11-04 22:41 | ps7776 | Note Added: 0010289 | |
| 2009-11-10 18:01 | ps7776 | Note Added: 0010330 | |
| 2009-11-12 16:02 | ps7776 | Note Added: 0010353 | |
| 2009-11-18 09:40 | Zool | Note Added: 0010378 | |
| 2009-11-18 09:50 | Zool | Issue Monitored: Zool | |
| 2009-11-18 14:52 | ps7776 | Note Added: 0010380 | |
| Copyright © 2000 - 2009 Mantis Group |