Author: Barry

  • Load Balancer Update

    A while back, I posted about some testing we were doing of various software load balancers for WordPress.com.  We chose to use Pound and have been using it past 2-ish years.  We started to run into some issues, however, so we starting looking elsewhere.  Some of these problems were:

    • Lack of true configuration reload support made managing our 20+ load balancers cumbersome.  We had a solution (hack) in place, but it was getting to be a pain.
    • When something would break on the backend and cause 20-50k connections to pile up, the thread creation would cause huge load spikes and sometimes render the servers useless.
    • As we started to push 700-1000 requests per second per load balancer, it seemed things started to slow down.  Hard to get quantitative data on this because page load times are dependent on so many things.

    So…  A couple weeks ago we finished converting all our load balancers to Nginx.  We have been using Nginx for Gravatar for a few months and have been impressed by its performance, so moving WordPress.com over was the obvious next step.  Here is a graph that shows CPU usage before and after the switch.  Pretty impressive!

    Before choosing nginx, we looked at HAProxy, Perlbal, and LVS. Here are some of the reasons we chose Nginx:

    • Easy and flexible configuration (true config “reload” support has made my life easier)
    • Can also be used as a web server, which allows us to simplify our software stack (we are not using nginx as a web server currently, but may switch at some point).
    • Only software we tested which could handle 8000 (live traffic, not benchmark) requests/second on a single server
    We are currently using Nginx 0.6.29 with the upstream hash module which gives us the static hashing we need to proxy to varnish.  We are regularly serving about 8-9k requests/second  and about 1.2Gbit/sec through a few Nginx instances and have plenty of room to grow!
  • Amazon AWS Outage

    Looks like quite a few (if not all) of the Amazon AWS services are down or performance is significantly degraded this morning. This is the first significant outage since we started using S3 to serve images for WordPress.com. Currently we serve about 1500 image requests per second across WordPress.com. About 80-100 per second are served through S3; the rest being served from our local caches. When the outage occurred, our systems detected the errors and automatically sent the requests normally bound for S3 to local image servers that we use for backup and failover purposes. The outage is currently going on 2+ hours. I wonder what impact, if any, this will have on AWS. It seems like quite a few folks are using S3 and EC2 as their sole source of computing power and storage. I wonder if these folks will move to more traditional hosting providers where there are formal SLAs, support, etc.

    UPDATE: Looks like after about 2.5 hours of downtime, things are starting to come back online over at Amazon.

    UPDATE: I guess there is a SLA for S3.

  • Empire State Building Pink and Red for Valentine’s Day

    Before I moved to New York, I never realized that the Empire State Building’s lighting scheme changed so frequently. I found the site where they detail the lighting schedule. The wikipedia page also has some good information. Since it’s Valentine’s Day, the lighting today was red, pink, and white. Here’s what it looks like from 33 floors up and across the East River.
    empire-vday.jpg

    Happy Valentine’s Day!

    On a side note, looks like it’s time to clean the sensor on the camera…

  • Manhattan Sunset

    Finally made it to NYC after getting stuck for 13 hours in the Tuscon airport.  There is nothing like the Manhattan skyline at sunset…

    manhattan-sunset.jpg

  • Congratulations Erin!

    Today, my sister, Erin, graduated from college. We thought the day would never come 😉 Congratulations, Erin, we are all proud of you.

    erin2.jpg