Hallo,
jetzt ist wieder mal der Server total lahm, wie es mehrmals in der Woche passiert. Seiten bauen sich nur noch sehr sehr langsam auf.
Das MaxClients Limit wurde erreicht (eingestellt 512):
In der apache2 error_log gehts mit dem Performanceeinfall wieder los mit:
... und endet mit:
Dann noch diese Statistik:
dann noch:
Diese Performanceeinbrüche passieren oft und nerven. Was kann die Ursache dafür sein? Wie kann man herausbekommen, wer oder was die child process Fehlermeldungen erzeugt?
Schon seit vielen Tagen bin ich ratlos und ich würde mich riesig freuen, wenn dieses Problem irgendwann gelöst werden könnte.
Gruß,
jetzt ist wieder mal der Server total lahm, wie es mehrmals in der Woche passiert. Seiten bauen sich nur noch sehr sehr langsam auf.
Das MaxClients Limit wurde erreicht (eingestellt 512):
Code:
ps ax | grep apache2 | wc -l
514
--------------------------------------
Code:
[Sun Aug 27 15:53:47 2006] [warn] child process 31054 still did not exit, sending a SIGTERM
[Sun Aug 27 15:53:47 2006] [warn] child process 12587 still did not exit, sending a SIGTERM
[Sun Aug 27 15:53:47 2006] [warn] child process 7875 still did not exit, sending a SIGTERM
[Sun Aug 27 15:53:47 2006] [warn] child process 30734 still did not exit, sending a SIGTERM
[Sun Aug 27 15:53:47 2006] [warn] child process 7578 still did not exit, sending a SIGTERM
[Sun Aug 27 15:53:47 2006] [warn] child process 6815 still did not exit, sending a SIGTERM
[Sun Aug 27 15:53:47 2006] [warn] child process 2604 still did not exit, sending a SIGTERM
Code:
[Sun Aug 27 15:54:02 2006] [error] child process 9590 still did not exit, sending a SIGKILL
[Sun Aug 27 15:54:02 2006] [error] child process 12721 still did not exit, sending a SIGKILL
[Sun Aug 27 15:54:02 2006] [error] child process 9897 still did not exit, sending a SIGKILL
[Sun Aug 27 15:54:02 2006] [error] child process 9951 still did not exit, sending a SIGKILL
[Sun Aug 27 15:54:02 2006] [error] child process 11906 still did not exit, sending a SIGKILL
[Sun Aug 27 15:54:02 2006] [error] child process 10234 still did not exit, sending a SIGKILL
[Sun Aug 27 15:54:02 2006] [error] child process 10237 still did not exit, sending a SIGKILL
[Sun Aug 27 15:54:02 2006] [error] child process 10239 still did not exit, sending a SIGKILL
[Sun Aug 27 15:54:02 2006] [error] child process 10682 still did not exit, sending a SIGKILL
[Sun Aug 27 15:54:02 2006] [error] child process 11909 still did not exit, sending a SIGKILL
[Sun Aug 27 15:54:02 2006] [error] child process 11956 still did not exit, sending a SIGKILL
[Sun Aug 27 15:54:03 2006] [notice] caught SIGTERM, shutting down
[Sun Aug 27 15:54:05 2006] [warn] Init: Session Cache is not configured [hint: SSLSessionCache]
[Sun Aug 27 15:54:05 2006] [warn] RSA server certificate CommonName (CN) `hXXXXXX.serverkompetenz.net' does NOT match server name!?
[Sun Aug 27 15:54:05 2006] [warn] RSA server certificate CommonName (CN) `hXXXXXX.serverkompetenz.net' does NOT match server name!?
No worker file and no worker options in httpd.conf \nuse JkWorkerFile to set workers\n
[Sun Aug 27 15:54:05 2006] [notice] suEXEC mechanism enabled (wrapper: /usr/sbin/suexec2)
[Sun Aug 27 15:54:05 2006] [warn] RSA server certificate CommonName (CN) `hXXXXXX.serverkompetenz.net' does NOT match server name!?
[Sun Aug 27 15:54:05 2006] [warn] RSA server certificate CommonName (CN) `hXXXXXX.serverkompetenz.net' does NOT match server name!?
No worker file and no worker options in httpd.conf \nuse JkWorkerFile to set workers\n
[Sun Aug 27 15:54:05 2006] [notice] mod_python: Creating 32 session mutexes based on 512 max processes and 0 max threads.
[Sun Aug 27 15:54:05 2006] [notice] Apache/2.0.54 (Linux/SUSE) configured -- resuming normal operations
[Sun Aug 27 15:57:16 2006] [error] server reached MaxClients setting, consider raising the MaxClients setting
--------------------------------------
Code:
netstat -nlp
Active Internet connections (only servers)
Proto Recv-Q Send-Q Local Address Foreign Address State PID/Program name
tcp 0 0 0.0.0.0:993 0.0.0.0:* LISTEN 4490/couriertcpd
tcp 0 0 0.0.0.0:995 0.0.0.0:* LISTEN 4509/couriertcpd
tcp 0 0 0.0.0.0:3306 0.0.0.0:* LISTEN 4896/mysqld
tcp 0 0 0.0.0.0:106 0.0.0.0:* LISTEN 4686/xinetd
tcp 0 0 0.0.0.0:110 0.0.0.0:* LISTEN 4499/couriertcpd
tcp 0 0 0.0.0.0:143 0.0.0.0:* LISTEN 4479/couriertcpd
tcp 0 0 0.0.0.0:80 0.0.0.0:* LISTEN 21907/httpd2-prefor
tcp 0 0 0.0.0.0:8880 0.0.0.0:* LISTEN 5212/httpsd
tcp 0 0 0.0.0.0:465 0.0.0.0:* LISTEN 4686/xinetd
tcp 0 0 0.0.0.0:21 0.0.0.0:* LISTEN 4686/xinetd
tcp 0 0 81.169.185.127:53 0.0.0.0:* LISTEN 4841/named
tcp 0 0 127.0.0.1:53 0.0.0.0:* LISTEN 4841/named
tcp 0 0 0.0.0.0:22 0.0.0.0:* LISTEN 4682/sshd
tcp 0 0 127.0.0.1:3000 0.0.0.0:* LISTEN 22553/drwebd
tcp 0 0 127.0.0.1:5432 0.0.0.0:* LISTEN 5004/postmaster
tcp 0 0 127.0.0.1:953 0.0.0.0:* LISTEN 4841/named
tcp 0 0 0.0.0.0:25 0.0.0.0:* LISTEN 4686/xinetd
tcp 0 0 0.0.0.0:443 0.0.0.0:* LISTEN 21907/httpd2-prefor
tcp 0 0 0.0.0.0:8443 0.0.0.0:* LISTEN 5212/httpsd
udp 0 0 0.0.0.0:32768 0.0.0.0:* 4841/named
udp 0 0 81.169.185.127:53 0.0.0.0:* 4841/named
udp 0 0 127.0.0.1:53 0.0.0.0:* 4841/named
udp 0 0 0.0.0.0:68 0.0.0.0:* 4230/dhcpcd
udp 0 0 81.169.185.127:123 0.0.0.0:* 4718/ntpd
udp 0 0 127.0.0.1:123 0.0.0.0:* 4718/ntpd
udp 0 0 0.0.0.0:123 0.0.0.0:* 4718/ntpd
Active UNIX domain sockets (only servers)
Proto RefCnt Flags Type State I-Node PID/Program name Path
unix 2 [ ACC ] STREAM LISTENING 11054 5004/postmaster /tmp/.s.PGSQL.5432
unix 2 [ ACC ] STREAM LISTENING 11162 5062/null /tmp/spamd_full.sock
unix 2 [ ACC ] STREAM LISTENING 8370 3677/dbus-daemon
Code:
/var/run/dbus/system_bus_socket
unix 2 [ ACC ] STREAM LISTENING 9904 4463/acpid /var/run/acpid.socket
unix 2 [ ACC ] STREAM LISTENING 11173 5064/null /tmp/spamd_light.sock
unix 2 [ ACC ] STREAM LISTENING 10612 4767/nscd /var/run/nscd/socket
unix 2 [ ACC ] STREAM LISTENING 10798 4896/mysqld
Code:
/var/lib/mysql/mysql.sock
unix 2 [ ACC ] STREAM LISTENING 4183160 22553/drwebd
Code:
/var/drweb/run/.daemon
unix 2 [ ACC ] STREAM LISTENING 11846 5338/hald
Code:
@/tmp/hald-local/dbus-4QGdBhexc8
--------------------------------------
Code:
top - 16:16:10 up 12 days, 17:53, 1 user, load average: 0.24, 0.53, 2.57
Tasks: 610 total, 1 running, 609 sleeping, 0 stopped, 0 zombie
Cpu(s): 0.0% us, 0.3% sy, 0.0% ni, 97.2% id, 0.0% wa, 0.3% hi, 2.2% si
Mem: 2073848k total, 1860816k used, 213032k free, 15564k buffers
Swap: 1052248k total, 86632k used, 965616k free, 642588k cached
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
1076 root 16 0 2376 1324 772 R 0.7 0.1 0:00.12 top
1 root 16 0 692 68 40 S 0.0 0.0 0:02.06 init
2 root RT 0 0 0 0 S 0.0 0.0 0:00.36 migration/0
3 root 34 19 0 0 0 S 0.0 0.0 0:00.00 ksoftirqd/0
4 root RT 0 0 0 0 S 0.0 0.0 0:00.34 migration/1
5 root 34 19 0 0 0 S 0.0 0.0 0:00.00 ksoftirqd/1
6 root 10 -5 0 0 0 S 0.0 0.0 0:00.00 events/0
7 root 10 -5 0 0 0 S 0.0 0.0 0:00.00 events/1
8 root 10 -5 0 0 0 S 0.0 0.0 0:00.34 khelper
9 root 10 -5 0 0 0 S 0.0 0.0 0:00.00 kthread
18 root 10 -5 0 0 0 S 0.0 0.0 0:00.00 kacpid
327 root 10 -5 0 0 0 S 0.0 0.0 0:13.55 kblockd/0
331 root 10 -5 0 0 0 S 0.0 0.0 0:00.37 kblockd/1
386 root 16 0 0 0 0 S 0.0 0.0 4:28.03 kswapd0
387 root 11 -5 0 0 0 S 0.0 0.0 0:00.00 aio/0
388 root 11 -5 0 0 0 S 0.0 0.0 0:00.00 aio/1
977 root 10 -5 0 0 0 S 0.0 0.0 0:00.00 kseriod
1030 root 15 0 0 0 0 S 0.0 0.0 0:00.00 kirqd
1107 root 12 -5 0 0 0 S 0.0 0.0 0:00.00 ata/0
1108 root 10 -5 0 0 0 S 0.0 0.0 0:00.00 ata/1
1230 root 15 0 0 0 0 S 0.0 0.0 1:59.66 kjournald
2076 root 12 -4 2040 640 580 S 0.0 0.0 0:00.74 udevd
2793 root 20 0 0 0 0 S 0.0 0.0 0:00.00 shpchpd_event
3677 messageb 17 0 3520 996 880 S 0.0 0.0 0:02.36 dbus-daemon
4230 root 16 0 1524 460 416 S 0.0 0.0 0:01.47 dhcpcd
Diese Performanceeinbrüche passieren oft und nerven. Was kann die Ursache dafür sein? Wie kann man herausbekommen, wer oder was die child process Fehlermeldungen erzeugt?
Schon seit vielen Tagen bin ich ratlos und ich würde mich riesig freuen, wenn dieses Problem irgendwann gelöst werden könnte.
Gruß,
Last edited by a moderator: