Depends. "free" reports the area used for disk buffers and programs, hence "avai...

hinkley · 2024-08-17T22:26:52 1723933612

One of the older arguments I get to keep having over and over is No, You May Not Put Another Service on These Servers. We are using those disk caches thank you very much.

I do not enjoy showing up to yet another discussion of why our response times just went up “for no reason”. Learn your latency tables people.

bayindirh · 2024-08-17T22:31:23 1723933883

Yeah, people tend to think server utilization as black and white.

Look, we're using just 50% of that RAM. Look, there're two cores that are almost idle.

No & No. Rest of the RAM is your secret for instant responses, and that spare CPU resource is for me to do system management without you notice or to front the odd torrent of requests we have semi regularly (e.g.: /. hug of death. Remember?).

hinkley · 2024-08-17T22:46:09 1723934769

I need to find a really good intro to queuing theory to send people to. A full queue is a slow queue. You actually want to aim for about 65% utilization.

Version467 · 2024-08-18T04:40:36 1723956036

This might be too basic, but I found this blog post to be an incredible introduction to queues: https://encore.dev/blog/queueing

bayindirh · 2024-08-17T22:50:22 1723935022

Also, there was a formula for determining the optimal cache size. I forget the name all the time. IIRC, in the end, caching most popular 10 items was enough to respond to 95% of your queries without hitting the disk.

redxtech · 2024-08-18T03:48:46 1723952926

If the numbers from the phoenix project are to be trusted, a loose estimate is the time spent in queue is proportional to the ratio of utilized to unutilized resources. For example, 50% used & 50% unused is 50:50 = 1 unit of time. 99% used is 99:1 = 99 units of time.