Need an equation to work out file cacheing

From: Gareth Williams (gareth_at_nospam.com)
Date: 08/01/04


Date: Sun, 1 Aug 2004 19:06:28 +0000 (UTC)

Scenario:

1. A firm receives 100,000 orders per week as electronic documents.

2. All documents are archived on CDROM.

3. A percentage of orders (average around 10%) will go wrong at some
stage and access to the original order will be required by Customer
Services to resolve the problem.

4. Retrieval from CDROM is time-consuming but a limited amount of space is
available to cache some of the documents on networked storage for more
immediate retrieval. There is not enough space to cache all the documents
so the older cached documents are deleted regularly.

5. It is not possible to determine "up front" which orders will fail -
whilst the failure rate is fairly constant over time, just about any
document could end up being requested by Customer Services.

Problem:

The firm would like to cache "N" documents (from the weekly pool of
100,000 orders) such that they can satisfy "X" percent of Customer Service
requests from the networked storage. Customer Services are prepared to
put up with [100-X]% of requests that would still need to be retrieved
from the CDROM store.

In short, how do we calculate "N"?

This reminds me a little of the "Cookie Jar" or "Sock Drawer" problem,
but it bugs me that I can't puzzle it out. I'm not a hard-core
statistician and would be grateful for any help. This is a genuine (i.e.
non-homework) request, by the way.

-- 
Regards, Gareth Williams


Relevant Pages

  • How to calculate a useful size for a given pool
    ... A firm receives 100,000 orders per week as electronic documents. ... All documents are archived on CDROM. ... document could end up being requested by Customer Services. ... requests from the networked storage. ...
    (sci.math)
  • How many documents to store?
    ... All documents are archived on CDROM. ... document could end up being requested by Customer Services. ... networked storage. ... We are prepared to put up with % of requests ...
    (sci.math.num-analysis)
  • Re: Need an equation to work out file cacheing
    ... All documents are archived on CDROM. ... > document could end up being requested by Customer Services. ... > Service requests from the networked storage. ...
    (sci.stat.math)
  • Re: Need an equation to work out file cacheing
    ... As a real-world problem -- Are you referring to huge .pdf scans ... I'm curious - really, CDROM, for high-capacity backups? ... There is not enough space to cache all the documents ... > document could end up being requested by Customer Services. ...
    (sci.stat.math)
  • Re: How many documents to store?
    ... Gareth Williams writes: ... All documents are archived on CDROM. ... > document could end up being requested by Customer Services. ... Suppose the fraction of documents you cache is p = N/50000. ...
    (sci.math.num-analysis)

Quantcast