Download software Tutorial videos
Subscription & data-feed pricing Class schedule


New account application Trading resources
Margin rates Stock & option commissions

Attention: Discussion forums are read-only for extended maintenance until further notice.
Welcome Guest, please sign in to participate in a discussion. Search | Active Topics |

Calling all higher-math inclined traders to help with the dreaded gap! Rate this Topic:
Previous Topic · Next Topic Watch this topic · Print this topic ·
davidjohnhall
Posted : Wednesday, April 9, 2008 1:03:32 AM

Registered User
Joined: 6/6/2005
Posts: 1,157



I was hoping someone could help me arrive at a statistical probability.  I'm just not sure how to go about it.

I ran a backtest of the stocks in the Russell 1000 today to determine the frequency of gaps going back over the last 10 years.  I looked at gaps of 5%, 5-10%, 10-15%, 15-20% and finally those greater than 20%.  I wanted to see how many times gaps of these sizes had taken place with the hope of discerning a statistical probability of these events reoccurring going forward. 

5% Gaps
1914 times

5-10% Gaps
3077 times

10-15% gaps
692 times

15-20% gaps
219 times

20% or greater
230 times

Now, I know we have 2500 days, and I also know there are 1000 stocks, and for each percentage group we have a number of occurrences.  Could anyone give me the probability of reoccurrence?

I initially began this exercise because downside gaps of magnitude seem to be a legitimate concern of many traders, myself included.  It would be nice to know exactly how rare or frequent they may be.  I realize I may need to give more information, but at this time I don't know what that would be.

Thanks in advance,

David John Hall

diceman
Posted : Wednesday, April 9, 2008 2:00:05 AM
Registered User
Joined: 1/28/2005
Posts: 6,049
I believe if you had 1000 stocks and 2500 days.
 
Then you had (1000*2500) 2500000 opportunities
for a gap.  Taking all gaps:
 
1914+3077+692+219+230=6132
 
So you had 6132 gaps out of 2500000 try's.
 
Approx .245 percent of the time.
------------------------------------------------------------
 
So if you watched the SP500 for 1 year.
 
(500*250)=125000
 
125000*.00245=306.25
 
you would expect to see 306 gaps.
 
At least that the way I think it goes.
 
 
Thanks
diceman
 
scottnlena
Posted : Wednesday, April 9, 2008 2:06:21 AM

Registered User
Joined: 4/18/2005
Posts: 4,090
QUOTE (diceman)
I believe if you had 1000 stocks and 2500 days.

 

Then you had (1000*2500) 2500000 opportunities

for a gap.  Taking all gaps:

 

1914+3077+692+219+230=6132

 

So you had 6132 gaps out of 2500000 try's.

 

Approx .245 percent of the time.

------------------------------------------------------------

 

So if you watched the SP500 for 1 year.

 

(500*250)=125000

 

125000*.00245=306.25

 

you would expect to see 306 gaps.

 

At least that the way I think it goes.

 

 

Thanks

diceman

 


Where does the red above figure come from? I'm not a math inclined person so wont' be contributing but I liek the "puzzles" and to think about it and learn etc etc.  i did love a statistical math class I took several years ago.. but forget most of it by now.
scottnlena
Posted : Wednesday, April 9, 2008 2:14:48 AM

Registered User
Joined: 4/18/2005
Posts: 4,090
what about positive gaps?  They are out there as well. There is the one CowGirl caught last year.  I recently rode the ILMN gap up.  I think it safe to say they aren't the statistical norm.. but they happen enough that it's a matter of time before you get one or the other.

conversely I was also a holder for TZOO's 4/25 gap down.
diceman
Posted : Wednesday, April 9, 2008 7:20:51 AM
Registered User
Joined: 1/28/2005
Posts: 6,049
 "Where does the red above figure come from?"
------------------------------------------------------------
 
The red comes from:
 
6132/2500000=.245%
 
It should also be noted no adjustment was made
for 2 gaps in a row. If you had a rule of no gap
after a gap you would have to reduce the amount
of attempts. (the number is probably too small
to matter)
 
by the way. This works out to 1 gap every approx.
408 try's.
(2500000/6132)=407.69
 
How you see the gaps would depend on what you
were following.
 
1 Stock for 408 days you should see 1 gap.
 
25 stocks (408/25)=16.32 you should see a
gap every approx 16 days.
 
100 stocks a gap every approx 4 days.
-----------------------------------------------------------------------
 
One thing to note. This assumes a linear relationship.
I'm guessing in the real world the gaps happened in
clusters. (there were dry periods, then periods of
more gaps)
 
Its possible "length of days without a gap" could be
important for trading but that would depend on how
evenly dispersed they are.
 
 
Thanks
diceman
 
davidjohnhall
Posted : Wednesday, April 9, 2008 7:45:40 AM

Registered User
Joined: 6/6/2005
Posts: 1,157
Thanks Diceman for your help.  That is exactly the information I was looking for.

David John Hall
diceman
Posted : Wednesday, April 9, 2008 8:24:30 AM
Registered User
Joined: 1/28/2005
Posts: 6,049
One thing to note. The 1000 as listed in telechart
is actually 982 stocks.
 
 
 
Thanks
diceman
 
scottnlena
Posted : Wednesday, April 9, 2008 9:49:19 AM

Registered User
Joined: 4/18/2005
Posts: 4,090
Thanks.. I see it now.
akqj10
Posted : Thursday, April 10, 2008 3:51:04 AM
Registered User
Joined: 2/7/2006
Posts: 218

I will assume that it is only useful to pick a stock that gaps since we want to profit from them...lets take the 5% scenario...so we have 6132 gaps.  Now that is out of 1000 stocks and thats over 10 years which is 364250 or so days assuming average years.  Im assuming we want the probability of choosing a gap

so we want to choose a single stock on a single day that it gaps that gap has to be a minimum of 5 %.

that is 3 seperate things happening at the same time

so (A)=gap (B)=day (C)=stock

so if (A), (B), and (C) are mutually independant events (all pairs of events are independant), then the probability that (A), (B), and (C) all occur is

P((A)intersect (B) intersect (C)) = PPP

so [(1/6132) * (1/1000) *(1/364250) ] = 4.5e -13 or 1/2233581000000

the previous situation was (A) U (B) U (C)

diceman
Posted : Thursday, April 10, 2008 7:37:38 AM
Registered User
Joined: 1/28/2005
Posts: 6,049
"lets take the 5% scenario...so we have 6132 gaps.  Now that is out of 1000 stocks and thats over 10 years which is 364250 or so days assuming average years."
---------------------------------------------------------------------------------------------
 
The 5% scenario was 1914 Gaps.
(6132 is all gaps)
 
 
There are approx. 250 trading days in a year.
(even though the calendar is 365 we cant trade
on weekends and holidays) 
 
 
Thanks
diceman
 
 
bustermu
Posted : Thursday, April 10, 2008 8:46:07 AM
Registered User
Joined: 1/1/2005
Posts: 2,645

davidjohnhall,

I find your results most interesting and a bit surprising (to me).  I have two questions and some observations.

1)  How did you measure the size of a downside gap?  I assume it is:

-100*(H0-L1)/L1

2)  Suppose we divide the magnitude of downside gaps into equal intervals on the Logarithmic Scale.  Then, starting with 0 to 5%, we have:

0.00 to 5.00
5.00 to 9.75
9.75 to 14.26
14.26 to 18.55
18.55 to 22.62
etc. 

What I would expect is that the number of gaps falling in each these intervals would be decreasing.  Your results indicate that this is not the case.  In your 5% category, did you include all gaps in the interval 0 to 5%?

3)  Your results indicate that when downside gaps occur they tend to be significant in magnitude, i.e., the probability density function of their magnitude is not monotonic decreasing.
 
Thanks,
Jim Murphy

davidjohnhall
Posted : Thursday, April 10, 2008 11:05:54 AM

Registered User
Joined: 6/6/2005
Posts: 1,157
Hi Jim,

For the first category of gaps I looked at gaps of only 5%.  I began by looking at gaps of 0-5% but there were over 65,000 of them.  I then decided to start with gaps of 2-5%.  Once again there were over 65,000 of them (my backtesting software creates an excel spreadsheet for each backtest and it maxes out at 65,000 lines of data).  To remedy this, I decided that, for my little experiment, I would be happy starting with gaps of 5% where 0<l1*.955 and 0>l1*.95.  So technically the first set is for gaps between 4.5% and 5%.

What I really was asking is, which mathematical factors would I have to include to get a model (even a crude one) for determining, based on historical occurences, the probability that a certain event would repeat in the future. 

As far as I'm concerned, trading has a lot to do with psychology, and (for me and maybe others) the dreaded GAP down is like the proverbial monster under the bed.  What I wanted to do was create a mathematical picture of what these GAPS really are and how often they might show up so that information can be digested by the psyche and understood on a more rational basis.

Any help you could offer would be greatly appreciated.  Diceman offered a number in an earlier thread.  Have you read that number?  Would you agree with it?  Or do you have an alternative?

David John Hall
bustermu
Posted : Thursday, April 10, 2008 2:43:22 PM
Registered User
Joined: 1/1/2005
Posts: 2,645
davidjohnhall,

Thanks for the response.

I could hardly have been more wrong in my interpretation of your tests.

1)  For your tests, a downside gap is a gap down from the previous price bar low to the present price bar opening.  It is not a gap down from the previous price bar low to the present price bar high.

2)  The 5% category is the interval 4.5 to 5.0%.  It is not the interval 0 to 5%.

3)  In light of my now corrected perception of your tests, I am no longer surprised by the results.

I do not disagree with the numbers diceman gave; although, I doubt he knew their meaning at the time he gave them.  But, he does now. 

Thanks,
Jim Murphy 
akqj10
Posted : Thursday, April 10, 2008 7:47:22 PM
Registered User
Joined: 2/7/2006
Posts: 218
please tell me if im thinking backwards, i got the days wrong but if we are looking for a 5% gap up then a 20 % gap up would work also, because 20 includes 5,  that is why i took all gaps.  so replace 365250 with 2500 and run it again.  we have to change the first number also because we are choosing a gap out of a number of choices.

so probability of choosing any gap in any stock on a any day

6132/2500000 x 1/1000 x 1/2500 

so i pick a day, pick a stock, and look for a gap, that is different than watching a specific stock every day, but you still could not see a gap, and different then just counting the number of gaps there was.

the difference is in rolling 3 white dice as opposed to 3 dice, one red, one green, and one blue and wanting specific numbers on each color.

google search probability addition and multiplication

"AND" or Intersections

Independent Events

Two events are independent if the occurrence of one does not change the probability of the other occurring.

An example would be rolling a 2 on a die and flipping a head on a coin. Rolling the 2 does not affect the probability of flipping the head.

If events are independent, then the probability of them both occurring is the product of the probabilities of each occurring.

Specific Multiplication Rule

Only valid for independent events

P(A and B) = P(A) * P(B)




"OR" or Unions

Mutually Exclusive Events

Two events are mutually exclusive if they cannot occur at the same time. Another word that means mutually exclusive is disjoint.

If two events are disjoint, then the probability of them both occurring at the same time is 0.

Disjoint:  P(A and B) = 0

If two events are mutually exclusive, then the probability of either occurring is the sum of the probabilities of each occurring.

Specific Addition Rule

Only valid when the events are mutually exclusive.

P(A or B) = P(A) + P(B)
akqj10
Posted : Thursday, April 10, 2008 8:05:13 PM
Registered User
Joined: 2/7/2006
Posts: 218
I was thinking that next question was coming

conditional probability is the P(A given B) = P(A and B) / P( B )  

Google bayes' theorm or conditional probability
davidjohnhall
Posted : Thursday, April 10, 2008 8:30:28 PM

Registered User
Joined: 6/6/2005
Posts: 1,157
ak -- thanks -- I had a feeling some crazy math that I don't understand would come into play.

A couple things...

We're talking gap downs not gap ups.
Also, I'm looking at gaps of different magnitudes.  So gaps in the 15-20% range would not include gaps in the 5-10% range.   So the numbers in my test by that range are for that range only, not including the smaller gaps.

Thanks for all of your input.  I am reading up on Bayes' theorm.  That should keep me busy for quite some time.
Users browsing this topic
Guest-1

Forum Jump
You cannot post new topics in this forum.
You cannot reply to topics in this forum.
You cannot delete your posts in this forum.
You cannot edit your posts in this forum.
You cannot create polls in this forum.
You cannot vote in polls in this forum.