Re: memory use on long persistent connection (eg for e-commerce sites,

To:	lvs-users@xxxxxxxxxxxxxxxxxxxxxx
Subject:	Re: memory use on long persistent connection (eg for e-commerce sites, squids)
Cc:	Roberto Nibali <ratz@xxxxxx>, Horms <horms@xxxxxxxxxxxx>, Julian Anastasov <ja@xxxxxx>, joe@xxxxxxxxxxxxx
From:	Roberto Nibali <ratz@xxxxxxxxxxxx>
Date:	Fri, 20 Sep 2002 01:18:29 +0200

Hi Joe,

The conventional LVS wisdom is that it's not a good
idea to build an LVS e-commerce website in which httpsis persistent for long periods.

This is also a general wisdom of practical software engineering. Maybesome AC or PWC guys are subscribed to this list? Listen carefully then!

The initial idea was that a long timeout allowsthe customer to have a cup of coffee or surf toother websites while thinking about their on-linepurchase.


Unless he has to buy the cup of coffee first at thinkgeek.

The problem with this approach is that the amountof memory use is expected to be large and the directorwill run out of memory. We've been telling people

Yes, if the timeout is very high, and people keep on klicking on thesite (thus the template never expires), and you've got low memory and alot of people are interested in your site.

to rewrite their application so that state is maintainedon the realservers allowing the customer to take anindefinite time to complete their purchase.

Well, it depends what you want to offer. If it's an online shop likeamazon.com you certainly want to store the generated cookie or whateverit is on a central DB cluster where every RS can connect to and requestfor the ID if it doesn't already have one.

Currently 1G of memory costs about an hour of programmer's time
(+ benefits, + office rental/heating/airconditioning/equipment

:) I don't know about the expenses in the states but you can certainlybuy a lot of RAM over here for an hour of a programmer's time.

+ support staff). Since memory is cheap compared to the costof rewriting your application, I was wondering if bruteforce might just be acceptable.

It's a completely different layer. It's about software engineering andnot about saving money. Yes, you can probably kill the problem temporaryby adding more memory but a broken application framework remains abroken application framework.

Plus, normally when you do build an e-commerce site, you have a customerthat has outsourced this task to your company. So you do a C-requirementand a feasability study to provide the customer with a proper costestimation. Now you build the application and it is built in a brokenway so that you need to either fix it or add more RAM in our case. Thebig problem here is:


  o you might have a strict SLA that doesn't permit this
  o you change the C-requirements and thus you need a new test phase
  o the customer gets upset because she spent big bucks on you

It's lack of engineering and a typical situation of plain incompetence:When you earnestly believe you can compensate for a lack of skill bydoubling your efforts, there's no end to what you can't do.

But all this also depends on the situation. I don't think we can givepeople a generalised view of how things have to be done. One might arguethat people come to this project because of monetary constraints andthey sure do not care about the application if the problem is solved byputting more RAM into the director.

I for example rather spend a few bucks on good hardware and a lot of RAMfor the RS because they need to carry the execution weight of theapplication. The director is just a more or less intelligent router.

I can't find any estimates of the numbers involved in the HOWTO
although similar situations have been discussed on the mailinglist eg


We actually have but I can't remember where. It was back in 2000 or so ;).

http://marc.theaimsgroup.com/?l=linux-virtual-server&m=99200010425473&w=2

there the calculation was done to see how long a director would
hold up under a DoS. The answer was about 100secs for 128M memory
and 100Mbps link to the attacker doing a SYN flood.


Yes.

I'm not running one of these web
sites and I don't know the real numbers here. Is amazon.com
or ebay connected by 100Mbps to the outside world?

They might be. AFAICR google.com is/was running LVS and they certainlyhave this connection. Some of our customers do have such fat pipes too :).

What you can do with 1G of memory on the director?


It depends.

each connection requires 128bytes. 1G/128 is 8M customersonline at any one time. Assuming everyone buys somethingthis is 1500 purchases/sec. You'd need the population of
a large town just to handle shipping stuff at this rate.

First of all, you can't use 1G out of 1G for the LVS. Maybe 750MB or800MB but not 1GB. And then you have a normal TCP timeout of 2 Minutesper template. Now yes, you could actually have 6-8M potential customersbut the problem with your thinking is that you assume that as soon asthe template is created it is destroyed again within a second. But thisisn't the case. It will at least remain for 2 Minutes (Julian correct meif this value is wrong, because I do not have the code not a box to check).

So you get 6M customers and the LVS is dead until the first one decidesto move away from the site and then still it need 2 Minutes to free thatRAM (a bit untechnically spoken). Now if the guy decides to come backwithin those 2 Minutes the template timer will simply be updated andstill noone can reach the site.

I doubt if any website at peak load has8M simultaneous customers.

It's not about 6M for a peak (6M is only for the first 6M) but about8M/120s for best effort which is 50000 and this is for a low timeout of2 Minutes. That means after the initial fill of the template space inRAM you max out at 50000 conns/s with 1GB because the old templates donot get release while the timer is still active because we assume thatthe customer wants to come back during a certain amount of time(persistency).

50000 conns/s seems like a high number, and in fact it is, but nowconsider someone putting the timeout to something insane like 15 Minutesand you get 6M/900 = 6666 conns/s which is not a lot. Think of aconnection request with about let's say 200bytes, you get:


ratz@laphish:~ > echo "6666*200/1024/1024*8" | bc -l
10.17150878906250000000
ratz@laphish:~ >

That's only 10Mbit/s of requests. I'm pretty sure that amazon has ahigher request rate.

However you only have 64k ports on each realserver toconnect with customers allowing only have 64kcustomers/realserver. How much memory do you need onthe director to handle a fully connected realserver?
64k x 128 = 8M


Julian already gave the answer.

Let's say there are 8 realservers. How much memoryis needed on the director?


8 x 8M = 64M

this is not a lot of memory. So the problem isn't
memory but realserver ports AFAIK

What is the minimum throughput of customers assuming

they all take 4000 sec (66 mins) to make theirpurchase?


8 x 64k/4000 = 64 purchases/sec

You're still going to need a hire a few people to pack and
ship all this stuff. If people use only take 6mins
for their purchase, you'll be shipping 640 packages/sec.

This is all wrong based on the assumption that the ports are therestriction.

Assuming you make $10/purchase at 64 purchases/sec, that's
$2.5G/yr.


Please use Gauss for such an estimation ;)

So with 64M of memory, 8 realservers, 4000sec persistencetimeout, and a margin of $10/purchase I can make a profitof $2.5G/yr.

No offense to you Joe (since you're not an American anyway), but I thinkif business income was to be calculated in that manner I would startto understand the economical problems of the USA. :)

It seems memory is not the problem here, but realserver
ports (or being able to ship all the items you sell).

No.

Let's look at another use of persistence - for squids
(despite the arrival of the -DH scheduler, some people
prefer persistence for squids).

Ok.

Here you aren't limited by shipping and handling of purchases.
Instead you are just shipping packets to the various target
httpd servers on the internet. You are still limited to
64k clients/realserver. Assume you make persistence = 256secs
(anyone client who is idle for that time is not interested
in performance). This means that the throughput/realserver is

256hits/sec. This isn't great. I don't know what throughputto expect out of a squid, but I suspect it's a lot more.


This is not what we've been saying on the mailing list.
Have I missed something?


Yes, but Julian told you already.

Hope this helps and best regards,
Roberto Nibali, ratz
--
echo '[q]sa[ln0=aln256%Pln256/snlbx]sb3135071790101768542287578439snlbxq'|dc

<Prev in Thread]	Current Thread	[Next in Thread>
Re: memory use on long persistent connection (eg for e-commerce sites, squids), (continued) Re: memory use on long persistent connection (eg for e-commerce sites, squids), Malcolm Turnbull Re: memory use on long persistent connection (eg for e-commerce sites, squids), Roberto Nibali Re: memory use on long persistent connection (eg for e-commerce sites, squids), Malcolm Turnbull Re: memory use on long persistent connection (eg for e-commerce sites, squids), Roberto Nibali Re: memory use on long persistent connection (eg for e-commerce sites, squids), Malcolm Turnbull Re: memory use on long persistent connection (eg for e-commerce sites, squids), Roberto Nibali Re: memory use on long persistent connection (eg for e-commerce sites,squids), Sébastien Bonnet RE: memory use on long persistent connection (eg for e-commerce sites, squids), Mark Weaver Re: memory use on long persistent connection (eg for e-commerce sites, squids), Roberto Nibali Re: memory use on long persistent connection (eg for e-commerce sites, squids), Julian Anastasov Re: memory use on long persistent connection (eg for e-commerce sites, squids), Roberto Nibali <=

Previous by Date:	Re: ping hanging?, Roberto Nibali
Next by Date:	Re: memory use on long persistent connection (eg for e-commerce sites, squids), Roberto Nibali
Previous by Thread:	Re: memory use on long persistent connection (eg for e-commerce sites, squids), Julian Anastasov
Next by Thread:	ping hanging?, Ben
Indexes:	[Date] [Thread] [Top] [All Lists]