Re: $1000 to the first person who can solve this problem
- From: crimefreesoftware@xxxxxxxxx
- Date: Tue, 17 Feb 2009 15:34:24 -0800 (PST)
Hi Stan,
Yes, that set of 8158 numbers is a little more cyclical/trendy than
most of the sets we have. I will respond to your direct email with 6
other sets of data that aren't so trendy.
However, the method you are using is perfectly acceptable. In fact,
the "Secretary Problem" waits the first 4 numbers, and then picks the
next number (5th, 6th, 7th, etc) that is LOWER than all of the 1st
four numbers, or else picks the 10th number). So it is perfectly
acceptable to "wait for" the first few numbers to go by before making
a decision on the "current" number (obviously you cant "go back in
time" and pick earlier numbers/after the fact).
But your method of evaluating and then picking the "best" number later
in the sequence is fine.
I'm not sure if this is exactly what you're after since my
"algorithm" (crude as it is) needs to wait for 5 numbers in a set to
appear before it can make a "decision" about the trend. I think what
you're after is an algorithm that can make a decision on EVERY number?
The algorith can make the decision anytime... but it's logical to
assume it MAY wait a few numbers (into the Set of 10) before deciding.
It's all up to you. Just as long as you make the decision "on the fly"/
at the "current" number that is given to you/at that time... then hope
that the Selected Number ends up being lower than the average of the
Original Numbers in the long run.
Just for clarification, we're using this terminology:
Original Numbers = "the 10 numbers you are provided in each Set, one
at a time"
Selected Number = "the single number you choose out of the 10 provided
in each Set (the Original Numbers)"
That's the bottom line goal: if you add up all the Selected Numbers
and get their average... that average needs to be lower than the
average of ALL of the Original Numbers.
I hope that helps clarify things. I will send you an email directly
that will have the 6 other sets of data with over 200,000 points
total. You can run each of those sets thru your algorithm and notice
that in the "not so trendy" sets it may not have as much success / be
consistently lower average. But it sounds like you are on the right
track and that you understand the problem at hand.
Let me know how it went, please.
Thanks,
Dan
On Feb 17, 11:36 am, crimefreesoftw...@xxxxxxxxx wrote:
In a previous post many people helped us in articulating our math
problem and we were given some possible solutions. One solution that
is VERY close to what we need was the Secretary Problem. Thanks again
to those who helped us clarify/articulate our math problem and point
us in the right direction.
Our problem is almost perfectly described in what is known as the
"Secretary Problem."
Please refer to this link for a detailed explanation:http://en.wikipedia.org/wiki/Secretary_problem
The following is an outline of our specific problem:
Sequence of Events:
1) You receive Original Numbers one at a time.
2) You will eventually receive a total of 10 Original Numbers
(there are 10 Original Numbers to a Set)
3) Each time you receive an Original Number, you must decide
immediately (“on the fly”) whether THIS Original Number will be the
Lowest of all 10 Original Numbers in this Set, even though you haven’t
seen all 10 Original Numbers (yet).
4) Once you Select an Original Number, it becomes your Selected
Number for This Set, and you skip over/ignore the remaining Original
Numbers in the Set of 10 and you prepare to look at the next Set.
What We Need:
A formula/method/algorithm that will determine the Selected
Number (on the fly) that it thinks will be the lowest of the Set of 10
Original Numbers. Repeat this process of Selecting one number for
each subsequent Set of 10 Original Numbers.
Calculations to be kept:
1) Average of all of the Original Numbers (every Original Number
in every Set).
2) Average of all of the Selected Numbers (each Selected Number
in every Set).
Goal Desired:
To have the Average of all of the Selected Numbers be lower
than the Average of all of the Original Numbers by 1 point or more.
Purpose:
To use the (lower) Selected Average as a preferred subset of
numbers as well as to obtain better/lower Average results with much
less workload required.
Current Method We Are Trying:
“Secretary Problem” formula/method seems to do exactly what we
need, but doesn’t perform as well with our set of Original Numbers.
Since the Secretary Problem assumes equally distributed data, we are
speculating that our Original Numbers must not be equally distributed.
Conclusion:
Perhaps some alteration of the Secretary Problem can be made to
achieve the Goal Desired.
Sample Data to test with:
A large sample of Original Numbers can be seen here:
http://www.crimefreesoftware.com/DataSet1c_8600datapoints.csv
This is a collection of 860 sets of 10 Original Numbers, shown in one
large series of 8,600 Original Numbers.
We have a number of large collections of data sets that can be used to
back-test your formulas. If you want more data, just let us know and
we will provide links to it.
This is a very important problem that we need help solving. If you
can help provide the solution that achieves the goals outlined above
using the datasets we provide, we are happy to pay you $1000 for your
successful solution. Any help is appreciated.
Thank you,
Dan
crimefreesoftw...@xxxxxxxxx
Looking at your data it's very cyclical in nature. So while the trend
is ascending then the minimum of the set will most likely be first
number. If the trend is descending then the minimum will most likely
be the last number of the set. Using your dataset I knocked up a very
crude algorithm in a spreadsheet that waits until the 5th number of a
set appears and compares it to the average of the first four numbers.
If n5>avg(n1:n4) then trend=up so choose n1. If n5<avg(n1:n4) then
trend=down so choose n10. Using this method my average of the selected
numbers was 13467 compared to the average of all original numbers
which was 13470.
I'm not sure if this is exactly what you're after since my
"algorithm" (crude as it is) needs to wait for 5 numbers in a set to
appear before it can make a "decision" about the trend. I think what
you're after is an algorithm that can make a decision on EVERY number?
Anyway, hope that helps.- Hide quoted text -
- Show quoted text -
.
- References:
- Re: $1000 to the first person who can solve this problem
- From: Stan Devia
- Re: $1000 to the first person who can solve this problem
- Prev by Date: Re: $1000 to the first person who can solve this problem
- Next by Date: Re: $1000 to the first person who can solve this problem
- Previous by thread: Re: $1000 to the first person who can solve this problem
- Next by thread: Re: $1000 to the first person who can solve this problem
- Index(es):
Relevant Pages
|