I have a very strange problem. Yesterday morning I got a "server down" alert. Restarted httpd and everything run ok ... until today morning, the same problem again.
Symptoms:
1. The webserver did not stop working, it just took too much time to respond.
2. I cannot find anything suspicious in the logs.
3. I started to log the number of apache processes in 4 minutes interval, it did not increase during the failure but remained at a very reasonable number.
4. Now, almost 3 hrs from the last failure, there are 36 apache processes, each eating 14M RAM, server has 4GB ram total, no swapping, almost 3GB are free (cached).
The question is. How should I prepare for the expected tommorow failure, to be able finaly localise the problem?
I have 3 servers ServerA(Web, mail), ServerB(MySQL+Master replication), ServerC(Mysqlslave+web)
It happens that my website stops responding for few mins and then it comes back again automatically. I checked the server logs but I couldn't find any suspicious.
Also, while the website is not accessible when I try to connect internally from ServerC to ServerA or ServerB using SSH. It takes lots of time (approx more than 60 seconds) to connect. When website starts working SSH is also working fine.
This is very complicated for me. Can anyone let me know what should be the problem or how can I find root cause of this problem?
I have submitted a ticket 15 days ago and still no response from Talanovbackup.com. I mentioned an error on their site which seems to be fixed now but I am still unable to login to my data by SSH.
We got a trouble with their service and they don't reply our complain message looking for a good solution. Up till now we still waiting for their respond..Asish...WHERE ARE YOU...!
our site is having problems since a week or so our forum is down, our emails are all down we have 10 staff using different aliases on the same email domain our hosting provider is bluefishhosting.com we have been unable to reach them since the last week we have contacted the company that bluefish buys their reselling account from, but they also tell us to wait another 7 days our office cannot bear this down time of 14 days
recently I have update our Apache 2.2 instance to 2.4.10, and started using Apache in Windows 2008 R2 64 bit (before we were using it in Win 2003). Our Apache is used mainly as reverse proxy to 3 apps. Well, it seems that, even if I have scheduled a nightly reboot of its service, every day it hangs, becoming unresponsive and forcing us to restart it to make it work again.
ThePlanet not responding to a ticket for nearly a week?
I've used theplanet for years, but haven't used their support much. But after getting a new server, I discovered the configuration wasn't as promised (it was missing 5 IPs).
I left a ticket last Friday afternoon (12/4), but no response so I called support on the weekend. Although he promised they'd expedite it, the only result was a slightly sarcastic post to my ticket implying my expecting an acknowledgement within 24 hours was unreasonable.
I called on Monday, and the salesperson promised a manager would respond by Tue. Nothing by Wed, so I called again, and was promised that someone would contact me shortly but still nothing. So now it's just a day short of a week after filing my ticket, and nothing but a runaround. Has their service gone downhill that much recently, or is it just me?
I got my hands on a used 2161DS (unit only, no cables etc), but somehow it doesnt seem to respond on the hyperterminal to any keyboard input.
As its used it doesnt have any support directly from Dell and Dell is unable even to give us a cost estimate as the service tag number is not in their system. Anyway...
When I power up the 2161DS KVM-Over-IP switch all I get in the Hyperterminal is
From there on its not working. The keyboard and mouse directly connected to it are working. Also all the menus, but we can set the network address, do firmware updates etc, only from the serial port connection (once the serial port is working).
I checked the hyperterminal settings (9600 baud, 8 bit, 1 stop bit, no parity, no flow control).
Is there anything that I might be missing?
Perhaps I have to use something like a crossed serial cable instead of a straight one ? Character set issues? Terminal type issues?
We have spent days in front of the machine and on the phone with Dell without really getting anywhere. Its our first 2161DS so I still want to believe that we must be missing something.
I have a problem with slow Apache 2.4.4. It's only related with https which is in general 5 time slower than the same site via http. I see this difference on the monitoring software which is measuring response time to http and https every 5 seconds. In some cases i got even timeouts in the browser on https while in the same time site is opening over http - slowly but opens always.
Apache runs on Win2k8 Enterprise, Version 2.4.4 x64 - VC10. Server is connected with quite poor internet connection as it's located in Africa. Despite of connection quality http is working properly all the time.
I use apache 2.4.1 and mod_fcgid (same config form apache 2.2.22+mod_fcgid 2.3.6) and without any error message, apache stop responding randomly.There is no problem with apache 2.2.22+mod_fcgid 2.3.6 and with apache 2.4.1 + php5_module i have
I have two whm cpanel servers on one provider and they both reporting same error when i click on phpmyadmin on whm: #2002 - The server is not responding (or the local MySQL server's socket is not correctly configured)
I'm having a very odd problem with one of my Linux (CentOS) cpanel server, all the server's services (http, ssh, mail, dns, etc) stop responding but the server still responds to ping.
I can't find anything wrong at all on the log files either, and the technicians that manually restart the server have told me that there is no indication of a problem on the screen.
I suspected a hardware issue and had the data center techs run a hardware test on the server but everything cleared ok.
This issue started a couple of weeks ago, no major upgrade or install took place when it started happening. From what i can see the halts are completely random, some times it goes for days without it happening and some times it happens just hours after the reboots.
Just got a strange problem on my plesk server. (11.0.9)
Qmail isn't working...
In Home>Tools & Settings>Services management SMTP "Server (QMail)" is stoppped.
When I try to start it, it say "Information: Please allow for some time for the service to start." but never starts...
In command line i try to restart it with "service qmail restart" and it says "OK" also if i run "service qmail status" it says "qmail-send (pid 2880) is running..."
but, it really doesn't work... queue is getting bigger and smtp isnt responding...!
Smtp service (qmail) stops responding on port 25: # time telnet localhost 25 Trying 127.0.0.1... quit quit Connected to localhost.localdomain (127.0.0.1). Escape character is '^]'. quit quit Connection closed by foreign host.
real 4m10.629s user 0m0.000s sys 0m0.002s
After server restart or sometimes apache stop or ixnetd restart its responding for a some random time, and then again it stops to respond. Plesk panel show it as stopped but qmail itself running in memory, and does other its work, it just stops responding at port 25, or responds with a huge delay.
I've tried change it to postfix, reconfigured with mchk, repaired with repair.sh -r, disabled and uninstalled parallels antivirus, antispam, dnsbl, disabled firewall, disabled smtp lock. Checked apache, dns. Enabled submission port which works when 25 port doesnt, but i need working 25 port.
Nothing solves problem, its just stops responding after some random time. There is no errors on maillog.
I think this problem occured after recent plesk microupdate, because i didn't do anything to server configuration in last months.
This article says it might be dnsbl [URL] .... but it disabled(from plesk panel) on my server, maybe there is way to focefully kill any relation to dnsbl?
Plesk info: OS Red Hat Enterprise Linux Server 5.9 (Tikanga) Panel version 11.5.30 Update #50, last updated at May 18, 2015 05:21 PM The system is up-to-date; last checked at May 17, 2015 10:56 PM
update: xinetd restart is definitely brings smtp alive, but it goes off after random period of time (5min ~ couple hours)
I am having issues in receieving emails. For some reason, the rbl lists I had setup are causing the server to reject emails (retry - timeout). So, I need to take this rbl list completely. How can I do that? exim.conf is locked and using the advanced editor is no fun even though I tried it putting the dnslists without the rbl causing the problem.
this is often happening on my new servers, with FreeBSD and exim 4.69 2 exim process start using a lot of CPU (that's not 100%, but it's like 40% for one process and 35% for other) for hours...
but, as soon as I restart exim, that stops so it's not a high mail load on server, nor anything like that
I even checked logs to see if it was on some kind of infinite loop (auto-auto-auto-auto-reply), etc, but can't find anything out of ordinary
I have a dedicated server with WHM installed on it, but recently I've been having problems with emails, specifically exim.
The main issue appears to be a huge number of exim processes all running at the same time. It pushes the server load higher and higher (and when I say high I mean over 100), and basically locks everything else up until I can get a command through to kill exim.
After a bit more investigation I found that the mail queue in WHM appears to be seperate to the one I can find with the exom -bpc command, and gets full of email sent to non existant domains or accounts. So my first theory is that at some point exim tries to deliver all of these at once and that causes the massive load spikes. I don't know if that's possible, or probable, but there isn't enough legitimate email coming into the server that there ought to be any issues.
i've read about how to control the mail q from exim, but that doesn't appear to make a different to the q shown in whm. Currently the server is being held up by a cron running every half hour to restart exim automatically, but at peak times this doesn't appear to be doing enough, and at one point yesterday exim had 400 running processes.
Obviously this is causing a few problems. I don't have the technical knowledge to diagnose or fix the problem past the guesswork i've already done, so i'd appreciate any suggestions
I have some clients who own large forums, and during usage Mass Mail CPU goes up to 100%. Is there any way to re-configure the exim so not to distrupt the CPU that much?
I got a mail "spamd failed @ Fri Jan 11 04:34:53 2008. A restart was attempted automatically".And I checked the server.Then I found that spamd is not working.Its a cpanel server.I've tried to restart exim but spamd is not starting.
I'm trying to diagnose some server load spikes, and I've noticed that my exim log files are getting huge (5 gigs, plus 4 gzips at 1.7gigs)...my server status shows the gzips and greps on these log files putting my cpu load at 99.9%...how do i keep these from getting so huge and/or keep them from maxing out my server?
I recently switched over from Virtuozzo to WHM (on a vps), and was going through some of the different pages there. I noticed one page that displays the exim stats, similar to running it through the command line. Anyway there is one section I'm not entirely sure what it's referring to.
Quote:
Top 50 mail rejection reasons by message count
Messages Mail rejection reason 311 Rejected RCPT: No such person at this address 75 Rejected RCPT: Sender verify failed 25"The mail server detected your message as spam and has prevented delivery (200)."
I'm not sure if this is referring to inbound addresses being blocked, or forged emails from my server being rejected by outside servers.