![cdf files reddit cdf files reddit](https://i1.rgstatic.net/ii/profile.image/924114969702402-1597337586599_Q512/Abigail-Stricker.jpg)
They showed that a WordPress site run with this $5 a month set-up could support 40 million visitors per day with zero failures, which works out to about 31 users (470 requests) per second.
#Cdf files reddit install#
In particular, this series of posts by the blog DeliciousBrains (who do WordPress plug-in development) claimed amazing performance with the cheapest DigitalOcean server by installing nginx, PHP 7.1, and PHP-FPM (don't worry about the acronyms) to augment the standard WordPress install with ultra-fast caching. For $5 a month, you get a virtual private server with 1 TB of bandwidth per month, 1 CPU core, and 512 MB of memory. In terms of cheap hosting solutions, one that repeatedly came up was DigitalOcean. Now that we know the performance goal, we need to determine how badly the shared hosting performed, and whether there is any other hosting solution out there for a similar price that can handle this load. Nginx + PHP 7.1 + PHP-FPM = 40+ million visitors per day with zero failures Any hosting solution that will potentially be exposed to the harsh digital environment of Reddit’s front page should thus, at a minimum, support this rate. There is only a 0.01% chance of getting more than 24 users per second (360 requests) during peak traffic on a Reddit hug-of-death.
![cdf files reddit cdf files reddit](https://archive.ph/9q56w/8ecefb9fac231bb100d23afe1cf45595b8866f85.png)
This figure shows there is only a 0.01% chance of getting more than 360 requests (24 users) per second during peak traffic on a Reddit hug-of-death. Thus, here it will tell show the maximum number of callers they will expect to receive 99% of the time.įigure 3: The cumulative distribution function for the number of server requests per second (from the maximum load of 10 users per second). The CDF gives the probability of receiving a certain number of events or lower per time period.
#Cdf files reddit full#
By setting their allowable risk (here, we’ll say full coverage 99% of the time), they can look at the cumulative distribution function (CDF) for the Poisson distribution for their average call rate. The call center would want to model this to determine the minimum staffing required to cover these high-call periods, balancing out their risk of excessive hold times caused by under staffing with the financial cost of adding more customer service agents. Each call comes in independently from each other call, so there might be stretches where no calls arrive or periods where many calls arrive. There is only a 0.01% chance of getting more than 360 requests (24 users) per second during peak traffic on a Reddit hug-of-death.Ī primer on Poisson processes: One of the canonical examples of a Poisson process is the distribution of calls to a call center, where the average rate is known, but the calls come in at random times. Although we don’t have the second-by-second information, we can estimate what it looks like by modeling the user arrivals as a random Poisson process. The average rate, however, hides the fact that the requests come in randomly and thus this value will fluctuate. To give you an idea of the scale, in 2008 Wikipedia handled 50,000 requests per second with 300 servers. A request is a browser asking for a piece of data on the page (scripts, images, html), and thus one user will make many requests when they arrive. This works out to an average rate of 157 requests per second and 10 users per second. The peak traffic came at 4:00 PM, during which hour there were 565,310 server requests from 35,870 unique visitors. Figure 1: Hourly totals for server requests, unique visitors, and bandwidth during a trip to Reddit’s front page.