Abstract

This is an academic research by apply R statistics analysis to an agency A of an existing betting consultancy firm A. According to the Dixon and Pope (2004)11 Kindly refer to 24th paper in Reference for industry knowdelege and academic research portion for the paper. in 7.4 References, due to business confidential and privacy I am also using agency A and firm A in this paper. The purpose of the anaysis is measure the staking model of the firm A. For more sample which using R for Soccer Betting see http://rpubs.com/englianhu. Here is the references of rmarkdown and An Introduction to R Markdown. You are welcome to read the Tony Hirst (2014)22 Kindly refer to 1st paper in Reference for technical research on programming and coding portion for the paper. in 7.4 References if you are getting interest to write a data analysis on Sports-book.

1. Introduction to the Betting Stategics

Section 1.1 Introducing Betting Strategies - Introduce Betting Strategies
Section 1.2 Value Betting - Odds Price and Overrounds Changared by Bookmakers
Section 1.3 Professional Gambler - Punters’ life and How Hedge Fund Works

2. Data

Section 2.1 Collect and Reprocess the Data - Data from Firm A
Section 2.2 Overrounds / Vigorish - Odds Price and Overrounds Changared by Bookmakers

3. Summarise the Staking Model

Section 3.1 Summarise Diversified Periodic Stakes - Summarise the Stakes and Return
Section 3.2 Summarise the Staking Handicap - Summarise the Staking Handicap Breakdown
Section 3.3 Summarise the Staking Prices - Summarise the Staking Price Range Breakdown
Section 3.4 Summarise the In-Play Staking Timing - Summarise the In-Play Staking Breakdown by Time Range
Section 3.5 Summarise the In-Play Staking Based on Current Score - Summarise the In-Play Staking Breakdown by Current Score

4. Staking Ⓜodel

Section 4.1 Basic Equation - Analyse the Odds Price and Probabilities
Section 4.2 Linear Ⓜodel - Reversed Engineer to get the EMOdds derived from Stakes
Section 4.3 Kelly Ⓜodel - Test the Kelly Model.
- Section 4.3.1 Basic Kelly Ⓜodel - Basic Kelly model.
- Section [4.3.2 Weight Function] - Fractional Kelly model.
- Section [4.3.3 weighted Fractional Kelly Ⓜodel] - Weighted Fractional Kelly model.
- Section 4.3.4 Dynamic Fractional Kelly Ⓜodel - Dynamic Weighted Fractional Kelly model.
- Section 4.3.5 Bank Roll - Bank roll listing across the Kelly models.
Section 4.4 Poisson Ⓜodel - Soccer Scores Odds Modelling.
- Section 4.4.1 Niko Marttinen (2001) - Creating a Profitable Sportsbook Betting Modelling.
- Section 4.4.2 Dixon and Coles (1996) - Odds Modelling and Betting Stretagy.
- Section 4.4.3 ®γσ, Eng Lian Hu (2016) - Odds Modelling and Testing Efficiency in Sportsbook Betting.
- Section 4.4.4 Combination Handicap - Descrete result of handicap and P&L.
Section 4.5 Staking Ⓜodel and Ⓜoney Ⓜanagement - Simulate the staking model. and fund portfolio management
- Section 4.5.1 Risk Management - Introduce to Investment Fund Management
- Section 4.5.2 Markowitz Portfolio - Optimal Investment Fund Portfolio
- Section 4.5.3 Optimal Kelly Portfolio
- Section 4.5.4 Investors’ Fund Refill
Section 4.6 Expectation Ⓜaximization and Staking Simulation - Enhance by weighted function on Staking model and Simulation.
- Section 4.6.1 Truncated Bivariate Normal Distribution - Test the best fit random scoring models.
- Section 4.6.2 Resampling Scores and Stakes - Application of resampling method on Kelly portfolio to get the optimal return from investment.

4.1 Basic Equation

Before we start modelling, we look at the summary of investment return rates.

Profit and Loss of Investment

Annual Stakes and Profit and Loss of Firm A at Agency A (2011-2015) ($0,000)

table 4.1.1 : 5 x 5 : Return of annually investment summary table.33 Kindly refer to the list of colors via Dark yellow with hexadecimal color code #9B870C for plot the stylist table.

\[\Re = \sum_{i=1}^{n}\rho_{i}^{EM}/\sum_{i=1}^{n}\rho_{i}^{BK} \cdots equation\ 4.1.1\]

$\Re$ is the edge or so call advantage for an investment. The $\rho_i^{EM}$ is the estimated probabilities which is the calculated by firm A from match 1,2… until $n$ matches while $\rho_{i}^{BK}$ is the net/pure probability (real odds) offer by bookmakers after we fit the equation 4.1.2 into equation 4.1.1.

\[\rho_i = P_i^{Lay} / (P_i^{Back} + P_i^{Lay}) \cdots equation\ 4.1.2\]

$P_i^{Back}$ and $P_i^{Lay}$ is the backed and layed fair price offer by bookmakers.

We can simply apply equation above to get the value $\Re$. From the table above we know that the EMPrice calculated by firm A invested at a threshold edge (price greater) 2.20%, 3.19%, 0.64%, 3.87%, 0.54% than the prices offer by bookmakers. There are some description about $\Re$ on Dixon and Coles (1996)44 Kindly refer to 25th paper in Reference for industry knowdelege and academic research portion for the paper. under 7.4 References. The optimal value of $\rho_{i}$ (rEMProbB) will be calculated based on bootstrapping/resampling method in section 4.3 Kelly Ⓜodel.

Now we look at the result of the soccer matches prior to filter out for further modelling from this section.

Profit and Loss of Investment

Stakes and Profit and Loss of Firm A at Agency A (2011~2015) ($0,000)

table 4.1.2 : 7 x 8 : Summary of betting results.

The table above summarize the stakes and return on soccer matches result. Well, below table list the handicaps placed by firm A on agency A. Due to the Cancelled result is null observation in descrete data modelling and we cannot be count into our model. Here Ifilter out the particular observation from the data from here and now the total observation of the dataset became 41055.

CORRECTION : need to keep the cancelled matches as the “push” to count in the probability of the cancellation betslip as well which is occurred in real life.

table 4.1.3 : 41055 x 66 : Odds price and probabilities sample table.

Above table list a part of sample odds prices and probabilities of soccer match $i$ while $n$ indicates the number of soccer matches. We can know the values rEMProbB, netProbB and so forth.

graph 4.1.1 : A sample graph about the relationship between the investmental probabilities -vs- bookmakers’ probabilities.

Graph above shows the probabilities calculated by firm A to back against real probabilities offered by bookmakers over 41055 soccer matches.

I list the handicap below prior to test the coefficient according to the handicap in next section 4.2 Linear Ⓜodel.

table 4.1.4 : 8 x 6 : The handicap in sample data.

4.3 Kelly Ⓜodel

diagram 4.3.0 : The overview of Kelly models.

4.3.2 Fractional Kelly Ⓜodel

Now we try to fit a weight function into basic Kelly model to be fractional Kelly model. I try to use log to test the maximum value of weight parameter. You can just simply use $w = \frac{1}{4}$ or $log(w) = \frac{1}{2}$ while $w$ is a vector. Please be mind that the value greater than 1 will be risky since involve leverage and lesser will be more conservative.

\[log(w_{i}) + log(\rho_{i}) \cdots equation\ 4.3.2.1\]

From Niko Marttinen (2001), we can know the full-Kelly generates couple times profit compare to fractional Kelly-models. However there has two points need to be enhanced.

The high risk at the beginning period of investment.
Test on different level of edge and concludes that the 145% generates the highest return.

Below Fabián Enrique Moya (2012) also test the fractional Kelly models with diversify money management methods.

paper 4.3.2.1 : Fabián Enrique Moya (2012)

League Stakes Profiling

Stakes of Firm A at Agency A (2011~2015) ($0,000)

table 4.3.2.1 : 177 x 6 : League stakes profiling of firm A year 2011~2015.

Above league risk profile suppose to stores the maximum bet for every single league but I only randomly select 6 leagues as sample. However due to I’ve not yet write a function for real time API3030 There are a lot of real time XML odds price and staking softwares similar with 4lowin2 which was states at the begining section in Part I with operators and test the maximum stakes per bet therefore here I reverse the mean value as the baseline stakes for every single league with a certain range of standard deviation for resampling simulation in later section.

Stakes based reversed Kelly models

Basic Fractional Models

Stakes based reversed Kelly models are the application of the parameter from reversion of the stakes where add-on some modified version Kelly models. I tried to adjust the stakes to get the outcome of PL result.

Table 4.3.2.2A : Summary Table of Various Kelly Models (Stakes reversed based models)

TimeUS	DateUS	Sess	League	Stakes	HCap	HKPrice	EUPrice	Result	Return	PL	PL.R	Rebates	RebatesS	rRates	netEMEdge	netProbB	netProbL	rEMProbB	rEMProbL	weight.stakes	weight	PropHKPriceEdge	PropnetProbBEdge	KProbHKPrice	KProbnetProbB	KProbFixed	KProbFixednetProbB	KEMProb	KEMProbnetProbB	KProbHalf	KProbHalfnetProbB	KProbQuarter	KProbQuarternetProbB	KProbAdj	KProbAdjnetProbB	KHalfAdj	KHalfAdjnetProbB	KEMQuarterAdj	KEMQuarterAdjnetProbB	KStakesHKPriceEdge	KStakesnetProbBEdge	KStakesHKPrice	KStakesnetProbB	KStakesFixed	KStakesFixednetProbB	KStakesEMProb	KStakesEMProbnetProbB	KStakesHalf	KStakesHalfnetProbB	KStakesQuarter	KStakesQuarternetProbB	KStakesAdj	KStakesAdjnetProbB	KStakesHalfAdj	KStakesHalfAdjnetProbB	KStakesEMQuarterAdj	KStakesEMQuarterAdjnetProbB	KReturnHKPriceEdge	KReturnnetProbBEdge	KReturnHKPrice	KReturnnetProbB	KReturnFixed	KReturnFixednetProbB	KReturnEMProb	KReturnEMProbnetProbB	KReturnHalf	KReturnHalfnetProbB	KReturnQuarter	KReturnQuarternetProbB	KReturnAdj	KReturnAdjnetProbB	KReturnHalfAdj	KReturnHalfAdjnetProbB	KReturnEMQuarterAdj	KReturnEMQuarterAdjnetProbB	KPLHKPriceEdge	KPLnetProbBEdge	KPLHKPrice	KPLnetProbB	KPLFixed	KPLFixednetProbB	KPLEMProb	KPLEMProbnetProbB	KPLHalf	KPLHalfnetProbB	KPLQuarter	KPLQuarternetProbB	KPLAdj	KPLAdjnetProbB	KPLHalfAdj	KPLHalfAdjnetProbB	KPLEMQuarterAdj	KPLEMQuarterAdjnetProbB	KPLHKPriceEdge.R	KPLnetProbBEdge.R	KPLHKPrice.R	KPLnetProbB.R	KPLFixed.R	KPLFixednetProbB.R	KPLEMProb.R	KPLEMProbnetProbB.R	KPLHalf.R	KPLHalfnetProbB.R	KPLQuarter.R	KPLQuarternetProbB.R	KPLAdj.R	KPLAdjnetProbB.R	KPLHalfAdj.R	KPLHalfAdjnetProbB.R	KPLEMQuarterAdj.R	KPLEMQuarterAdjnetProbB.R
Min. :2011-01-07 14:45:00	Min. :2011-01-07	Min. :2011	ENG PR : 1930	Min. : 0.50	Min. :-3.500	Min. :0.1800	Min. :1.180	Cancelled: 28	Min. : 0.00	Min. :-1600.0000	Min. :-1.0000	Min. :-3.480000	Min. :-5568.000	Min. :2.002	Min. :2.002	Min. :0.0384	Min. :0.0947	Min. :0.07686	Min. :-0.81204	Min. :1	Min. :1	Min. : 1.004	Min. : 1.003	Min. : 0.7101	Min. : 0.656	Min. : 1.018	Min. : 0.8478	Min. :0.2658	Min. :0.1123	Min. : 0.5651	Min. : 0.4839	Min. : 0.4062	Min. : 0.2808	Min. : 1.018	Min. : 0.8478	Min. : 0.6814	Min. : 0.5262	Min. : 0.411	Min. : 0.2758	Min. : 2.80	Min. : 2.518	Min. : 0.50	Min. : 0.50	Min. : 0.50	Min. : 0.50	Min. :0.07686	Min. :0.07686	Min. : 0.000	Min. : 0.000	Min. : 0.0000	Min. : 0.0000	Min. : 0.50	Min. : 0.50	Min. : 0.04419	Min. : 0.0186	Min. : 0.001752	Min. : 0.00001	Min. : 0.0	Min. : 0.00	Min. : 0.00	Min. : 0.00	Min. : 0.00	Min. : 0.00	Min. :0.000	Min. :0.000	Min. : 0.000	Min. : 0.000	Min. : 0.000	Min. : 0.000	Min. : 0.00	Min. : 0.00	Min. : 0.000	Min. : 0.000	Min. : 0.0000	Min. : 0.0000	Min. :-6702.224	Min. :-6702.073	Min. :-1600.0000	Min. :-1600.0000	Min. :-1600.0000	Min. :-1600.0000	Min. :-1.62007	Min. :-1.62007	Min. :-399.5146	Min. :-399.5869	Min. :-99.2718	Min. :-99.38039	Min. :-1600.0000	Min. :-1600.0000	Min. :-204.07878	Min. :-225.6129	Min. :-26.03009	Min. :-31.81324	Min. :-1.00000	Min. :-1.00000	Min. :-1.00000	Min. :-1.00000	Min. :-1.00000	Min. :-1.00000	Min. :-1.00000	Min. :-1.00000	Min. :-1.0000	Min. :-1.0000	Min. :-1.00	Min. :-1.00	Min. :-1.00000	Min. :-1.00000	Min. :-1.00000	Min. :-1.00000	Min. :-1.00000	Min. :-1.00000
1st Qu.:2012-09-07 15:00:00	1st Qu.:2012-09-07	1st Qu.:2012	FRA D2 : 1526	1st Qu.: 12.50	1st Qu.: 0.000	1st Qu.:0.7800	1st Qu.:1.780	Half Loss: 2798	1st Qu.: 0.00	1st Qu.: -18.0000	1st Qu.:-1.0000	1st Qu.:-0.040000	1st Qu.: -0.780	1st Qu.:2.017	1st Qu.:2.017	1st Qu.:0.4324	1st Qu.:0.4149	1st Qu.:0.87514	1st Qu.:-0.18917	1st Qu.:1	1st Qu.:1	1st Qu.: 12.587	1st Qu.: 12.730	1st Qu.: 6.4545	1st Qu.: 6.513	1st Qu.: 5.562	1st Qu.: 5.5095	1st Qu.:0.9349	1st Qu.:0.9288	1st Qu.: 3.4912	1st Qu.: 3.5105	1st Qu.: 2.0090	1st Qu.: 2.0105	1st Qu.: 5.562	1st Qu.: 5.5095	1st Qu.: 4.0210	1st Qu.: 4.0020	1st Qu.: 2.900	1st Qu.: 2.8978	1st Qu.: 52.86	1st Qu.: 52.636	1st Qu.: 12.50	1st Qu.: 12.50	1st Qu.: 12.50	1st Qu.: 12.50	1st Qu.:0.87514	1st Qu.:0.87514	1st Qu.: 2.574	1st Qu.: 2.575	1st Qu.: 0.0000	1st Qu.: 0.0000	1st Qu.: 12.50	1st Qu.: 12.50	1st Qu.: 1.42401	1st Qu.: 1.3993	1st Qu.: 0.162111	1st Qu.: 0.15220	1st Qu.: 0.0	1st Qu.: 0.00	1st Qu.: 0.00	1st Qu.: 0.00	1st Qu.: 0.00	1st Qu.: 0.00	1st Qu.:0.000	1st Qu.:0.000	1st Qu.: 0.000	1st Qu.: 0.000	1st Qu.: 0.000	1st Qu.: 0.000	1st Qu.: 0.00	1st Qu.: 0.00	1st Qu.: 0.000	1st Qu.: 0.000	1st Qu.: 0.0000	1st Qu.: 0.0000	1st Qu.: -74.240	1st Qu.: -74.218	1st Qu.: -18.0000	1st Qu.: -18.0000	1st Qu.: -18.0000	1st Qu.: -18.0000	1st Qu.:-0.86976	1st Qu.:-0.86976	1st Qu.: -3.9845	1st Qu.: -3.9845	1st Qu.: -0.4068	1st Qu.: -0.44459	1st Qu.: -18.0000	1st Qu.: -18.0000	1st Qu.: -2.09361	1st Qu.: -2.1492	1st Qu.: -0.24835	1st Qu.: -0.26395	1st Qu.:-1.00000	1st Qu.:-1.00000	1st Qu.:-1.00000	1st Qu.:-1.00000	1st Qu.:-1.00000	1st Qu.:-1.00000	1st Qu.:-1.00000	1st Qu.:-1.00000	1st Qu.:-1.0000	1st Qu.:-1.0000	1st Qu.:-1.00	1st Qu.:-1.00	1st Qu.:-1.00000	1st Qu.:-1.00000	1st Qu.:-1.00000	1st Qu.:-1.00000	1st Qu.:-1.00000	1st Qu.:-1.00000
Median :2013-09-21 10:00:00	Median :2013-09-21	Median :2013	GER D1 : 1464	Median : 26.00	Median : 0.750	Median :0.9300	Median :1.930	Half Win : 3052	Median : 16.00	Median : 0.0000	Median : 0.0000	Median : 0.000000	Median : 0.000	Median :2.044	Median :2.044	Median :0.5105	Median :0.4895	Median :1.03411	Median :-0.03411	Median :1	Median :1	Median : 26.148	Median : 26.495	Median : 13.1134	Median : 13.274	Median : 8.005	Median : 8.1081	Median :1.0165	Median :1.0169	Median : 6.8155	Median : 6.8944	Median : 3.6711	Median : 3.6931	Median : 8.005	Median : 8.1081	Median : 5.7454	Median : 5.7943	Median : 4.139	Median : 4.1400	Median : 110.34	Median : 110.275	Median : 26.00	Median : 26.00	Median : 26.00	Median : 26.00	Median :1.03411	Median :1.03411	Median : 6.045	Median : 6.048	Median : 0.8272	Median : 0.8439	Median : 26.00	Median : 26.00	Median : 3.05504	Median : 3.0992	Median : 0.359275	Median : 0.37422	Median : 68.3	Median : 68.34	Median : 16.00	Median : 16.00	Median : 16.00	Median : 16.00	Median :1.253	Median :1.253	Median : 3.242	Median : 3.216	Median : 0.000	Median : 0.000	Median : 16.00	Median : 16.00	Median : 1.788	Median : 1.708	Median : 0.1975	Median : 0.1781	Median : 0.000	Median : 0.000	Median : 0.0000	Median : 0.0000	Median : 0.0000	Median : 0.0000	Median : 0.00000	Median : 0.00000	Median : 0.0000	Median : 0.0000	Median : 0.0000	Median : 0.00000	Median : 0.0000	Median : 0.0000	Median : 0.00000	Median : 0.0000	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.0000	Median : 0.0000	Median : 0.00	Median : 0.00	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.00000
Mean :2013-08-06 03:16:01	Mean :2013-08-05	Mean :2013	ITA D1 : 1412	Mean : 40.53	Mean : 1.075	Mean :0.9443	Mean :1.944	Loss :14374	Mean : 41.40	Mean : 0.8713	Mean : 0.0332	Mean : 0.001888	Mean : -0.123	Mean :2.033	Mean :2.033	Mean :0.5056	Mean :0.4944	Mean :1.02799	Mean :-0.02799	Mean :1	Mean :1	Mean : 39.896	Mean : 40.749	Mean : 19.8809	Mean : 20.292	Mean : 8.766	Mean : 9.1770	Mean :1.0017	Mean :0.9922	Mean : 10.2006	Mean : 10.3990	Mean : 5.3605	Mean : 5.4523	Mean : 8.766	Mean : 9.1770	Mean : 6.2743	Mean : 6.4657	Mean : 4.496	Mean : 4.5732	Mean : 168.78	Mean : 168.787	Mean : 40.53	Mean : 40.53	Mean : 40.53	Mean : 40.53	Mean :1.02799	Mean :1.02799	Mean : 9.582	Mean : 9.583	Mean : 1.8106	Mean : 1.8265	Mean : 40.53	Mean : 40.53	Mean : 4.73391	Mean : 4.8490	Mean : 0.562677	Mean : 0.61187	Mean : 172.5	Mean : 172.47	Mean : 41.40	Mean : 41.40	Mean : 41.40	Mean : 41.40	Mean :1.063	Mean :1.063	Mean : 9.781	Mean : 9.783	Mean : 1.842	Mean : 1.858	Mean : 41.40	Mean : 41.40	Mean : 4.833	Mean : 4.949	Mean : 0.5742	Mean : 0.6242	Mean : 3.684	Mean : 3.684	Mean : 0.8717	Mean : 0.8717	Mean : 0.8717	Mean : 0.8717	Mean : 0.03474	Mean : 0.03474	Mean : 0.1994	Mean : 0.1992	Mean : 0.0310	Mean : 0.03177	Mean : 0.8717	Mean : 0.8717	Mean : 0.09928	Mean : 0.1003	Mean : 0.01157	Mean : 0.01232	Mean : 0.03322	Mean : 0.03322	Mean : 0.03322	Mean : 0.03322	Mean : 0.03322	Mean : 0.03322	Mean : 0.03322	Mean : 0.03322	Mean : 0.0328	Mean : 0.0329	Mean : 0.03	Mean : 0.03	Mean : 0.03322	Mean : 0.03322	Mean : 0.03322	Mean : 0.03322	Mean : 0.03322	Mean : 0.03322
3rd Qu.:2014-09-18 15:05:00	3rd Qu.:2014-09-18	3rd Qu.:2014	SPA D1 : 1331	3rd Qu.: 50.00	3rd Qu.: 2.250	3rd Qu.:1.0800	3rd Qu.:2.080	Push : 3778	3rd Qu.: 53.55	3rd Qu.: 20.3000	3rd Qu.: 0.8500	3rd Qu.: 0.040000	3rd Qu.: 1.080	3rd Qu.:2.052	3rd Qu.:2.052	3rd Qu.:0.5851	3rd Qu.:0.5676	3rd Qu.:1.18917	3rd Qu.: 0.12486	3rd Qu.:1	3rd Qu.:1	3rd Qu.: 47.893	3rd Qu.: 48.975	3rd Qu.: 23.8376	3rd Qu.: 24.332	3rd Qu.:10.885	3rd Qu.: 11.3964	3rd Qu.:1.0836	3rd Qu.:1.0782	3rd Qu.: 12.1869	3rd Qu.: 12.4157	3rd Qu.: 6.3529	3rd Qu.: 6.4489	3rd Qu.:10.885	3rd Qu.: 11.3964	3rd Qu.: 7.7735	3rd Qu.: 7.9939	3rd Qu.: 5.561	3rd Qu.: 5.6439	3rd Qu.: 204.54	3rd Qu.: 204.517	3rd Qu.: 50.00	3rd Qu.: 50.00	3rd Qu.: 50.00	3rd Qu.: 50.00	3rd Qu.:1.18917	3rd Qu.:1.18917	3rd Qu.: 11.806	3rd Qu.: 11.782	3rd Qu.: 2.2386	3rd Qu.: 2.2458	3rd Qu.: 50.00	3rd Qu.: 50.00	3rd Qu.: 5.69606	3rd Qu.: 5.8483	3rd Qu.: 0.681064	3rd Qu.: 0.74440	3rd Qu.: 223.1	3rd Qu.: 223.14	3rd Qu.: 53.55	3rd Qu.: 53.55	3rd Qu.: 53.55	3rd Qu.: 53.55	3rd Qu.:1.957	3rd Qu.:1.957	3rd Qu.: 12.458	3rd Qu.: 12.439	3rd Qu.: 1.980	3rd Qu.: 2.021	3rd Qu.: 53.55	3rd Qu.: 53.55	3rd Qu.: 6.160	3rd Qu.: 6.223	3rd Qu.: 0.7218	3rd Qu.: 0.7632	3rd Qu.: 85.152	3rd Qu.: 85.064	3rd Qu.: 20.2800	3rd Qu.: 20.2800	3rd Qu.: 20.2800	3rd Qu.: 20.2800	3rd Qu.: 0.92914	3rd Qu.: 0.92914	3rd Qu.: 4.6300	3rd Qu.: 4.6499	3rd Qu.: 0.5975	3rd Qu.: 0.62808	3rd Qu.: 20.2800	3rd Qu.: 20.2800	3rd Qu.: 2.36233	3rd Qu.: 2.3670	3rd Qu.: 0.27408	3rd Qu.: 0.28592	3rd Qu.: 0.85000	3rd Qu.: 0.85000	3rd Qu.: 0.85000	3rd Qu.: 0.85000	3rd Qu.: 0.85000	3rd Qu.: 0.85000	3rd Qu.: 0.85000	3rd Qu.: 0.85000	3rd Qu.: 0.8600	3rd Qu.: 0.8600	3rd Qu.: 0.88	3rd Qu.: 0.90	3rd Qu.: 0.85000	3rd Qu.: 0.85000	3rd Qu.: 0.85000	3rd Qu.: 0.85000	3rd Qu.: 0.85000	3rd Qu.: 0.85000
Max. :2015-07-19 19:45:00	Max. :2015-07-19	Max. :2015	FRA D1 : 1256	Max. :1600.00	Max. : 8.250	Max. :3.9000	Max. :4.890	Win :17025	Max. :2992.00	Max. : 1392.0000	Max. : 2.6550	Max. : 3.040000	Max. : 4864.000	Max. :2.052	Max. :2.052	Max. :0.9053	Max. :0.9616	Max. :1.81204	Max. : 0.92314	Max. :1	Max. :1	Max. :1661.906	Max. :1793.534	Max. :812.3153	Max. :876.612	Max. :69.868	Max. :111.6521	Max. :1.1772	Max. :1.1349	Max. :406.4039	Max. :438.5324	Max. :203.4483	Max. :219.4924	Max. :69.868	Max. :111.6521	Max. :48.6334	Max. :72.2019	Max. :34.213	Max. :46.6908	Max. :6702.41	Max. :6702.423	Max. :1600.00	Max. :1600.00	Max. :1600.00	Max. :1600.00	Max. :1.81204	Max. :1.81204	Max. :399.515	Max. :399.587	Max. :99.2718	Max. :99.3804	Max. :1600.00	Max. :1600.00	Max. :204.07878	Max. :225.6129	Max. :26.030094	Max. :31.81324	Max. :12533.5	Max. :12533.53	Max. :2992.00	Max. :2992.00	Max. :2992.00	Max. :2992.00	Max. :2.287	Max. :2.287	Max. :746.925	Max. :746.915	Max. :185.388	Max. :185.372	Max. :2992.00	Max. :2992.00	Max. :337.202	Max. :334.563	Max. :38.0031	Max. :40.2482	Max. : 5831.098	Max. : 5831.108	Max. : 1392.0000	Max. : 1392.0000	Max. : 1392.0000	Max. : 1392.0000	Max. : 0.98481	Max. : 0.98481	Max. : 347.5000	Max. : 347.4951	Max. : 86.2500	Max. : 86.24260	Max. : 1392.0000	Max. : 1392.0000	Max. : 156.88006	Max. : 155.6523	Max. : 17.68057	Max. : 17.40491	Max. : 2.65000	Max. : 2.65000	Max. : 2.65000	Max. : 2.65000	Max. : 2.65000	Max. : 2.65000	Max. : 2.65000	Max. : 2.65000	Max. : 2.6500	Max. : 2.6500	Max. : 2.65	Max. : 2.65	Max. : 2.65000	Max. : 2.65000	Max. : 2.65000	Max. : 2.65000	Max. : 2.65000	Max. : 2.65000
NA	NA	NA	(Other):32136	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA’s :1348	NA’s :1383	NA’s :10832	NA’s :10297	NA	NA	NA	NA	NA	NA

table 4.3.2.2A : 41055 x 112 : Summary of Stakes reversed Kelly models year 2011~2015.

From above table summary, we can know the range of risk management applicable to various adjusted Kelly models. Now we try to compare the Profit & Loss from below table.

PL of Stakes Reversed based Kelly Models

Stakes of Firm A at Agency A (2011~2015) ($0,000)

table 4.3.2.2B : 19 x 6 : PL of Stakes based reversed Kelly models year 2011~2015.

Mean with Min-Max Range Fractional Models

Due to there has no league risk management profile, here I try to use the mean value of stakes on every single league as the baseline and set the min and max value to simulate 100 times.

~~table 4.3.2.2C : 19 x 5 : Summary of Stakes reversed Kelly models (mean value of stakes with min-max range as staking adjuster) year 2011~2015.~~

~~From above table summary, we can know the range of risk management applicable to various adjusted Kelly models.~~ Now we try to compare the Profit & Loss from below table.

PL of Reversed rEMProbB Kelly Models (Mean with min-max Adjusted Stakes)

Stakes of Firm A at Agency A (2011~2015) ($0,000)

table 4.3.2.2D : 19 x 5 : PL of Stakes reversed Kelly models (mean value of stakes with min-max range as staking adjuster) year 2011~2015.

Mean with sd Range Fractional Models

Due to there has no league risk management profile, here I try to use the mean value of stakes on every single league as the baseline.

table 4.3.2.2E : 19 x 5 : Summary of Stakes reversed Kelly models (mean value of stakes with sd range as staking adjuster) year 2011~2015.

From above table summary, we can know the range of risk management applicable to various adjusted Kelly models. Now we try to compare the Profit & Loss from below table.

PL of Reversed Stakes based Kelly Models (Mean with sd Adjusted Stakes)

Stakes of Firm A at Agency A (2011~2015) ($0,000)

table 4.3.2.2F : 19 x 5 : PL of Stakes reversed Kelly models year (mean value of stakes with sd range as staking adjuster) 2011~2015.

Reversed rEMProbB based Kelly models

Basic Fractional Models

rEMProbB (real EM Probabilities Back) are the application of the parameter from reversion of the stakes where add-on some modified version Kelly models. For the EM probabilities based models, I had just simply adjusted for staking and get the different outcome of Profit & Loss.

Table 4.3.2.3 : Summary Table of Various Kelly Models (reversed rEMProbB based models)

TimeUS	DateUS	Sess	League	Stakes	HCap	HKPrice	EUPrice	Result	Return	PL	PL.R	Rebates	RebatesS	rRates	netEMEdge	netProbB	netProbL	rEMProbB	rEMProbL	weight.stakes	weight	KStakesHKPriceEdge	KStakesnetProbBEdge	KStakesHKPrice	KStakesnetProbB	KStakesFixed	KStakesFixednetProbB	KStakesEMProb	KStakesEMProbnetProbB	KStakesHalf	KStakesHalfnetProbB	KStakesQuarter	KStakesQuarternetProbB	KStakesAdj	KStakesAdjnetProbB	KStakesHalfAdj	KStakesHalfAdjnetProbB	KStakesEMQuarterAdj	KStakesEMQuarterAdjnetProbB	KReturnHKPriceEdge	KReturnnetProbBEdge	KReturnHKPrice	KReturnnetProbB	KReturnFixed	KReturnFixednetProbB	KReturnEMProb	KReturnEMProbnetProbB	KReturnHalf	KReturnHalfnetProbB	KReturnQuarter	KReturnQuarternetProbB	KReturnAdj	KReturnAdjnetProbB	KReturnHalfAdj	KReturnHalfAdjnetProbB	KReturnEMQuarterAdj	KReturnEMQuarterAdjnetProbB	KPLHKPriceEdge	KPLnetProbBEdge	KPLHKPrice	KPLnetProbB	KPLFixed	KPLFixednetProbB	KPLEMProb	KPLEMProbnetProbB	KPLHalf	KPLHalfnetProbB	KPLQuarter	KPLQuarternetProbB	KPLAdj	KPLAdjnetProbB	KPLHalfAdj	KPLHalfAdjnetProbB	KPLEMQuarterAdj	KPLEMQuarterAdjnetProbB	KPLHKPriceEdge.R	KPLnetProbBEdge.R	KPLHKPrice.R	KPLnetProbB.R	KPLFixed.R	KPLFixednetProbB.R	KPLEMProb.R	KPLEMProbnetProbB.R	KPLHalf.R	KPLHalfnetProbB.R	KPLQuarter.R	KPLQuarternetProbB.R	KPLAdj.R	KPLAdjnetProbB.R	KPLHalfAdj.R	KPLHalfAdjnetProbB.R	KPLEMQuarterAdj.R	KPLEMQuarterAdjnetProbB.R
Min. :2011-01-07 14:45:00	Min. :2011-01-07	Min. :2011	ENG PR : 1930	Min. : 0.50	Min. :-3.500	Min. :0.1800	Min. :1.180	Cancelled: 28	Min. : 0.00	Min. :-1600.0000	Min. :-1.0000	Min. :-3.480000	Min. :-5568.000	Min. :2.002	Min. :2.002	Min. :0.0384	Min. :0.0947	Min. :0.07686	Min. :-0.81204	Min. :1	Min. :1	Min. : 0.000	Min. : 0.1201	Min. :0.0000	Min. :0.0400	Min. :0.03101	Min. :0.03754	Min. :0.0000	Min. :0.0400	Min. :0.00000	Min. :3.182e-05	Min. :0	Min. :0	Min. :0.03101	Min. :0.03754	Min. :0.002024	Min. :2.487e-05	Min. :2.152e-05	Min. :2.000e-08	Min. : 0.000	Min. : 0.000	Min. :0.000	Min. :0.000	Min. :0.0000	Min. :0.0000	Min. :0.000	Min. :0.000	Min. :0.00000	Min. :0.000000	Min. :0	Min. :0	Min. :0.0000	Min. :0.0000	Min. :0.00000	Min. :0.00000	Min. :0.00000	Min. :0.00000	Min. :-9.1370	Min. :-12.039	Min. :-3.17026	Min. :-3.94433	Min. :-0.47018	Min. :-0.38763	Min. :-3.17026	Min. :-3.94433	Min. :-0.335131	Min. :-0.0979937	Min. :0	Min. :0	Min. :-0.47018	Min. :-0.38763	Min. :-0.089810	Min. :-0.121450	Min. :-0.0221153	Min. :-0.0463606	Min. :-1.00000	Min. :-1.00000	Min. :-1.00000	Min. :-1.00000	Min. :-1.00000	Min. :-1.00000	Min. :-1.00000	Min. :-1.00000	Min. :-1.000	Min. :-1.00000	Min. : NA	Min. : NA	Min. :-1.00000	Min. :-1.00000	Min. :-1.00000	Min. :-1.00000	Min. :-1.00000	Min. :-1.00000
1st Qu.:2012-09-07 15:00:00	1st Qu.:2012-09-07	1st Qu.:2012	FRA D2 : 1526	1st Qu.: 12.50	1st Qu.: 0.000	1st Qu.:0.7800	1st Qu.:1.780	Half Loss: 2798	1st Qu.: 0.00	1st Qu.: -18.0000	1st Qu.:-1.0000	1st Qu.:-0.040000	1st Qu.: -0.780	1st Qu.:2.017	1st Qu.:2.017	1st Qu.:0.4324	1st Qu.:0.4149	1st Qu.:0.87514	1st Qu.:-0.18917	1st Qu.:1	1st Qu.:1	1st Qu.: 2.508	1st Qu.: 2.3791	1st Qu.:0.7606	1st Qu.:0.7816	1st Qu.:0.30702	1st Qu.:0.35604	1st Qu.:0.7606	1st Qu.:0.7816	1st Qu.:0.00000	1st Qu.:7.755e-03	1st Qu.:0	1st Qu.:0	1st Qu.:0.30702	1st Qu.:0.35604	1st Qu.:0.077703	1st Qu.:6.961e-02	1st Qu.:1.768e-02	1st Qu.:1.298e-02	1st Qu.: 0.000	1st Qu.: 0.000	1st Qu.:0.000	1st Qu.:0.000	1st Qu.:0.0000	1st Qu.:0.0000	1st Qu.:0.000	1st Qu.:0.000	1st Qu.:0.00000	1st Qu.:0.000000	1st Qu.:0	1st Qu.:0	1st Qu.:0.0000	1st Qu.:0.0000	1st Qu.:0.00000	1st Qu.:0.00000	1st Qu.:0.00000	1st Qu.:0.00000	1st Qu.:-2.5133	1st Qu.: -2.430	1st Qu.:-0.78214	1st Qu.:-0.79968	1st Qu.:-0.30502	1st Qu.:-0.35943	1st Qu.:-0.78214	1st Qu.:-0.79968	1st Qu.: 0.000000	1st Qu.:-0.0106810	1st Qu.:0	1st Qu.:0	1st Qu.:-0.30502	1st Qu.:-0.35943	1st Qu.:-0.078520	1st Qu.:-0.078308	1st Qu.:-0.0188247	1st Qu.:-0.0178730	1st Qu.:-1.00000	1st Qu.:-1.00000	1st Qu.:-1.00000	1st Qu.:-1.00000	1st Qu.:-1.00000	1st Qu.:-1.00000	1st Qu.:-1.00000	1st Qu.:-1.00000	1st Qu.:-1.000	1st Qu.:-1.00000	1st Qu.: NA	1st Qu.: NA	1st Qu.:-1.00000	1st Qu.:-1.00000	1st Qu.:-1.00000	1st Qu.:-1.00000	1st Qu.:-1.00000	1st Qu.:-1.00000
Median :2013-09-21 10:00:00	Median :2013-09-21	Median :2013	GER D1 : 1464	Median : 26.00	Median : 0.750	Median :0.9300	Median :1.930	Half Win : 3052	Median : 16.00	Median : 0.0000	Median : 0.0000	Median : 0.000000	Median : 0.000	Median :2.044	Median :2.044	Median :0.5105	Median :0.4895	Median :1.03411	Median :-0.03411	Median :1	Median :1	Median : 3.293	Median : 3.2566	Median :1.0704	Median :1.0689	Median :0.36589	Median :0.36890	Median :1.0704	Median :1.0689	Median :0.00000	Median :1.609e-02	Median :0	Median :0	Median :0.36589	Median :0.36890	Median :0.083037	Median :9.262e-02	Median :2.016e-02	Median :2.268e-02	Median : 3.805	Median : 3.669	Median :1.115	Median :1.212	Median :0.4304	Median :0.3829	Median :1.115	Median :1.212	Median :0.00000	Median :0.009855	Median :0	Median :0	Median :0.4304	Median :0.3829	Median :0.08676	Median :0.07776	Median :0.02001	Median :0.01451	Median : 0.0000	Median : 0.000	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.000000	Median : 0.0000000	Median :0	Median :0	Median : 0.00000	Median : 0.00000	Median : 0.000000	Median : 0.000000	Median : 0.0000000	Median : 0.0000000	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.365	Median : 0.00000	Median : NA	Median : NA	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.00000	Median : 0.00000
Mean :2013-08-06 03:16:01	Mean :2013-08-05	Mean :2013	ITA D1 : 1412	Mean : 40.53	Mean : 1.075	Mean :0.9443	Mean :1.944	Loss :14374	Mean : 41.40	Mean : 0.8713	Mean : 0.0332	Mean : 0.001888	Mean : -0.123	Mean :2.033	Mean :2.033	Mean :0.5056	Mean :0.4944	Mean :1.02799	Mean :-0.02799	Mean :1	Mean :1	Mean : 3.413	Mean : 3.5059	Mean :1.1122	Mean :1.1558	Mean :0.35669	Mean :0.36396	Mean :1.1122	Mean :1.1558	Mean :0.04257	Mean :1.838e-02	Mean :0	Mean :0	Mean :0.35669	Mean :0.36396	Mean :0.081028	Mean :8.813e-02	Mean :1.894e-02	Mean :2.304e-02	Mean : 3.531	Mean : 3.628	Mean :1.151	Mean :1.196	Mean :0.3688	Mean :0.3761	Mean :1.151	Mean :1.196	Mean :0.04419	Mean :0.019185	Mean :0	Mean :0	Mean :0.3688	Mean :0.3761	Mean :0.08373	Mean :0.09105	Mean :0.01956	Mean :0.02380	Mean : 0.1178	Mean : 0.122	Mean : 0.03837	Mean : 0.04009	Mean : 0.01206	Mean : 0.01218	Mean : 0.03837	Mean : 0.04009	Mean : 0.001619	Mean : 0.0008099	Mean :0	Mean :0	Mean : 0.01206	Mean : 0.01218	Mean : 0.002703	Mean : 0.002913	Mean : 0.0006262	Mean : 0.0007598	Mean : 0.03327	Mean : 0.03322	Mean : 0.03342	Mean : 0.03322	Mean : 0.03322	Mean : 0.03322	Mean : 0.03342	Mean : 0.03322	Mean : 0.035	Mean : 0.03322	Mean :NaN	Mean :NaN	Mean : 0.03322	Mean : 0.03322	Mean : 0.03322	Mean : 0.03322	Mean : 0.03322	Mean : 0.03322
3rd Qu.:2014-09-18 15:05:00	3rd Qu.:2014-09-18	3rd Qu.:2014	SPA D1 : 1331	3rd Qu.: 50.00	3rd Qu.: 2.250	3rd Qu.:1.0800	3rd Qu.:2.080	Push : 3778	3rd Qu.: 53.55	3rd Qu.: 20.3000	3rd Qu.: 0.8500	3rd Qu.: 0.040000	3rd Qu.: 1.080	3rd Qu.:2.052	3rd Qu.:2.052	3rd Qu.:0.5851	3rd Qu.:0.5676	3rd Qu.:1.18917	3rd Qu.: 0.12486	3rd Qu.:1	3rd Qu.:1	3rd Qu.: 4.224	3rd Qu.: 4.4155	3rd Qu.:1.4315	3rd Qu.:1.4608	3rd Qu.:0.41479	3rd Qu.:0.37952	3rd Qu.:1.4315	3rd Qu.:1.4608	3rd Qu.:0.07768	3rd Qu.:2.670e-02	3rd Qu.:0	3rd Qu.:0	3rd Qu.:0.41479	3rd Qu.:0.37952	3rd Qu.:0.086236	3rd Qu.:1.099e-01	3rd Qu.:2.104e-02	3rd Qu.:3.260e-02	3rd Qu.: 6.271	3rd Qu.: 6.278	3rd Qu.:2.052	3rd Qu.:2.071	3rd Qu.:0.6886	3rd Qu.:0.7006	3rd Qu.:2.052	3rd Qu.:2.071	3rd Qu.:0.05820	3rd Qu.:0.034649	3rd Qu.:0	3rd Qu.:0	3rd Qu.:0.6886	3rd Qu.:0.7006	3rd Qu.:0.15889	3rd Qu.:0.16068	3rd Qu.:0.03590	3rd Qu.:0.03924	3rd Qu.: 2.9827	3rd Qu.: 2.901	3rd Qu.: 0.95674	3rd Qu.: 0.95740	3rd Qu.: 0.32312	3rd Qu.: 0.32301	3rd Qu.: 0.95674	3rd Qu.: 0.95740	3rd Qu.: 0.005910	3rd Qu.: 0.0148883	3rd Qu.:0	3rd Qu.:0	3rd Qu.: 0.32312	3rd Qu.: 0.32301	3rd Qu.: 0.074592	3rd Qu.: 0.071302	3rd Qu.: 0.0165572	3rd Qu.: 0.0168291	3rd Qu.: 0.85000	3rd Qu.: 0.85000	3rd Qu.: 0.85000	3rd Qu.: 0.85000	3rd Qu.: 0.85000	3rd Qu.: 0.85000	3rd Qu.: 0.85000	3rd Qu.: 0.85000	3rd Qu.: 0.760	3rd Qu.: 0.85000	3rd Qu.: NA	3rd Qu.: NA	3rd Qu.: 0.85000	3rd Qu.: 0.85000	3rd Qu.: 0.85000	3rd Qu.: 0.85000	3rd Qu.: 0.85000	3rd Qu.: 0.85000
Max. :2015-07-19 19:45:00	Max. :2015-07-19	Max. :2015	FRA D1 : 1256	Max. :1600.00	Max. : 8.250	Max. :3.9000	Max. :4.890	Win :17025	Max. :2992.00	Max. : 1392.0000	Max. : 2.6550	Max. : 3.040000	Max. : 4864.000	Max. :2.052	Max. :2.052	Max. :0.9053	Max. :0.9616	Max. :1.81204	Max. : 0.92314	Max. :1	Max. :1	Max. :18.221	Max. :28.7399	Max. :6.3234	Max. :9.5749	Max. :0.47018	Max. :0.38763	Max. :6.3234	Max. :9.5749	Max. :0.44300	Max. :1.535e-01	Max. :0	Max. :0	Max. :0.47018	Max. :0.38763	Max. :0.089810	Max. :1.214e-01	Max. :2.212e-02	Max. :4.636e-02	Max. :18.221	Max. :28.740	Max. :6.323	Max. :9.575	Max. :0.7735	Max. :0.8016	Max. :6.323	Max. :9.575	Max. :0.48620	Max. :0.153497	Max. :0	Max. :0	Max. :0.7735	Max. :0.8016	Max. :0.17384	Max. :0.30363	Max. :0.04928	Max. :0.12847	Max. : 3.7786	Max. : 4.863	Max. : 1.31621	Max. : 1.59310	Max. : 0.35309	Max. : 0.46989	Max. : 1.31621	Max. : 1.59310	Max. : 0.146565	Max. : 0.0395793	Max. :0	Max. :0	Max. : 0.35309	Max. : 0.46989	Max. : 0.093727	Max. : 0.196716	Max. : 0.0304529	Max. : 0.0849179	Max. : 2.65000	Max. : 2.65000	Max. : 1.95000	Max. : 2.65000	Max. : 2.65000	Max. : 2.65000	Max. : 1.95000	Max. : 2.65000	Max. : 0.970	Max. : 2.65000	Max. : NA	Max. : NA	Max. : 2.65000	Max. : 2.65000	Max. : 2.65000	Max. : 2.65000	Max. : 2.65000	Max. : 2.65000
NA	NA	NA	(Other):32136	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA’s :3	NA	NA’s :26	NA	NA	NA	NA’s :26	NA	NA’s :20781	NA	NA’s :41055	NA’s :41055	NA	NA	NA	NA	NA	NA

table 4.3.2.3A : 41055 x 94 : Summary of Reversed rEMProbB Kelly models year 2011~2015.

From above table summary, we can know the range of risk management applicable to various adjusted Kelly models. Now we try to compare the Profit & Loss from below table.

PL of Reversed rEMProbB Kelly Models

Stakes of Firm A at Agency A (2011~2015) ($0,000)

table 4.3.2.3B : 19 x 5 : PL of Reversed rEMProbB Kelly models year 2011~2015.

Mean with Min-Max Range Fractional Models

Due to there has no league risk management profile, here I try to use the mean value of stakes on every single league as the baseline and set the min and max value to simulate 100 times.

~~table 4.3.2.3C : 19 x 5 : Summary of Reversed rEMProbB Kelly models year (mean value of stakes with min-max range as staking adjuster) 2011~2015.~~

~~From above table summary, we can know the range of risk management applicable to various adjusted Kelly models.~~ Now we try to compare the Profit & Loss from below table.

PL of Reversed rEMProbB Kelly Models (Mean with Min-Max Adjusted Stakes)

Stakes of Firm A at Agency A (2011~2015) ($0,000)

table 4.3.2.3D : 19 x 5 : PL of Reversed rEMProbB Kelly models (mean value of stakes with min-max range as staking adjuster) year 2011~2015.

Mean with sd Range Fractional Models

Due to there has no league risk management profile, here I try to use the median value of stakes on every single league as the baseline.

~~table 4.3.2.3E : 19 x 5 : Summary of Reversed rEMProbB Kelly models year (mean value of stakes with sd range as staking adjuster) year 2011~2015.~~

~~From above table summary, we can know the range of risk management applicable to various adjusted Kelly models.~~ Now we try to compare the Profit & Loss from below table.

PL of Reversed rEMProbB Kelly Models (Mean with sd Adjusted Stakes)

Stakes of Firm A at Agency A (2011~2015) ($0,000)

table 4.3.2.3F : 19 x 5 : PL of Reversed rEMProbB Kelly models (mean value of stakes with sd range as staking adjuster) year 2011~2015.

4.3.3 Weighted Fractional Kelly Ⓜodels

In previous section I measure the data from 2011~2015 as static analysis. Well, now I try to seperates as annum data base and get the optimal weight value for next year use. Due to I dont know if the weight function is needed for staking models since sports consultancy firm had applied Poison models weith weight function. As we know a dice will have $\frac{1}{6}$ chance to open one of the outcome, however theoretical probabilities doesn’t not correct since there have papers which applied bernoulli distribution to test the outcome with a certain iteration, the outcome might be 0.499 and 0.501 for over under but not 0.5 for each, that is due to the some effect like the balance of the dice, the flat level of the table, the wind, momentum and etc. I don’t pretand to know and only simulate it by obseravation to guess the optimal value.

Due to fractional Kelly model is independent models (for example : half-Kelly will be half-Kelly staking model, and full-Kelly will be only full-Kelly model across the years as we made comparison in section [4.3.2 Fractional Kelly odel].), now we need to make it weighted fractional model. Similar with my prevous Rmodel which applied on Poisson model. Due to the calculation of the settlement and result on the win and loss of Asian Handicap is different with Fixed odds, the probabilities of the outcome will be descrete and the measurement of the likelihood result required in order to maximize the profit. here we need to add an additional parameter controller to adjust the staking amount on every single match.

Now I try to will simulate an enhanced Kelly model on staking which take the effect of the outcome of the result into calculation below controller parameter $\phi(r)$ fit into $equation\ 4.3.3.1$ to control the leverage ratio.3131 similar theory apply on investment portfolio while it might turn to be nested controller parameters across different soccer leagues.

\[\phi(r) = exp(w_{i}\rho_{i}) \cdots equation\ 4.3.3.1\] Where $X = x_{i,2,3...n}$ is the original staking amount by Kelly model. Meanwhile, the $r$ value is the optimal parameter controller for staking.

\[r\begin{Bmatrix} =& Win\\ =& Half\ Win\\ =& Push\\ =& Half\ Loss\\ =& Loss\end{Bmatrix} \cdots equation\ 4.3.3.2\]

Here I try to diversified the weight parameters on equation 4.3.3.2. The first year data will be the baseline for further years analysis. The $\phi(r)$ function is a contant variable which using previous years’s data to apply on current year staking model. You can also use other method to find yours.

Due to we unable foreseen the result before a soccer match started, here I tried to categorise from -1, 0.75, -0.5… 1 as a set of handicap.

table 4.3.3.1 : sample data of weighted handicap

Weighted Value Estimation for Weighted Kelly Models

Weighted Table (2011~2015)

~~table 4.3.3.1 : 5 x 8 : The static weight parameter from year 2011~2015.~~

Above table 4.3.3.13232 table 4.3.3.1 is wrong due to we cannot foreseen the result of a soccer match before kick-off, here I rewrote two kind of weight functions for handicap parameters just a static weighted parameter or constant across a year, you can simulate by apply the Epectation Maximization to get a dynamic vector of weight parameters across the soccer matches. The later section will conduct a monte carlo simulation from 2011 until 2015 to get the best fit outcome.

Stakes based reversed Kelly models

Weighted Fractional Models

Stakes based reversed Kelly models are the application of the parameter from reversion of the stakes where add-on some modified version Kelly models. I tried to adjust by add a constant weight value theta to get the outcome of PL result.

PL of Stakes Reversed based Kelly Models

Stakes of Firm A at Agency A (2012~2015) ($0,000)

table 4.3.3.2A : 19 x 27 : Summary of Stakes reversed weighted 1 Kelly models 1 year 2012~2015.

Now we try to look at a vector dres weighted values.

PL of Stakes Reversed based Kelly Models

Stakes of Firm A at Agency A (2012~2015) ($0,000)

table 4.3.3.2B : 19 x 27 : Summary of Stakes reversed weighted 2 Kelly models 2 year 2012~2015.

Reversed rEMProbB based Kelly models

Weighted Fractional Models

rEMProbB (real EM Probabilities Back) are the application of the parameter from reversion of the stakes where add-on some modified version Kelly models. For the EM probabilities based models, I had just simply adjusted by added a constant theta to get the different outcome of Profit & Loss.

PL of Reversed Prob Kelly Models

Stakes of Firm A at Agency A (2012~2015) ($0,000)

table 4.3.3.3A : 19 x 11 : Summary of Reversed rEMProbB weighted 1 Kelly models 2 year 2012~2015.

Now we try to look at a vector dres weighted values.

PL of Reversed Prob Kelly Models

Stakes of Firm A at Agency A (2012~2015) ($0,000)

table 4.3.3.3B : 19 x 11 : PL of Reversed rEMProbB weighted 2 Kelly models 2 year 2012~2015.

4.3.4 Dynamic Fractional Kelly Ⓜodel

Comparison

Due to the weighted models only analyse on year 2012~2015, here I need to summarise the static and weigthed data to do comparison from the profit and loss prior to further section.

From the tables in section 4.3.2 Fractional Kelly Ⓜodel and 4.3.3 Weighted Fractional Kelly Ⓜodels we tried to weighted the Kelly models to increase the profit. Now we try to do a dynamic weighted parameters. For the staking and money management which is portfolio management, I leave it to 4.5 Staking Ⓜodel and Ⓜoney Ⓜanagement.

Stakes based reversed Kelly models

PL of Stakes Reversed based Kelly Models

Stakes of Firm A at Agency A (2012~2015) ($0,000)

table 4.3.4.1A : 19 x 27 : PL of Reversed stakes dynamic 1 Kelly models 1 year 2012~2015.

PL of Stakes Reversed based Kelly Models

Stakes of Firm A at Agency A (2012~2015) ($0,000)

table 4.3.4.1B : 19 x 27 : PL of Reversed stakes dynamic 2 Kelly models 1 year 2012~2015.

Reversed rEMProbB based Kelly models

PL of Reversed Prob Kelly Models

Stakes of Firm A at Agency A (2012~2015) ($0,000)

table 4.3.4.2A : 19 x 11 : PL of Reversed rEMProbB dynamic 1 Kelly models 2 year 2012~2015.

PL of Reversed Prob Kelly Models

Stakes of Firm A at Agency A (2012~2015) ($0,000)

table 4.3.4.2B : 19 x 11 : PL of Reversed rEMProbB dynamic 2 Kelly models 2 year 2012~2015.

4.3.5 Bank Roll

There has few points we need to consider, there the we need to retrieve the initial investment capital $BR$:

risk averse from ruin3333 The daily lost cannot over the investment fund, otherwise will be bankrupt before growth, Kelly model is sort of risky at the beginning period but bacame stable as time goes by.
Initial invested capital
The differences of time zone (British based sports consultancy firm A)
The financial settlement time of Asian operators (daily financial settlement time 12:00PM Hong Kongnese GMT+8 Credit market with rebates)3434 Here I follow the kick-off time from GMT+8 1200 until next morning 1159 (or the American Timezone which is GMT - 4) considered as a soccer betting date. Re-categorise the soccer financial settlement date. Due to I have no the history matches dataset from bookmakers. The scrapped spbo time is not stable (always change, moreover there just an information website) where firm A is the firm who placed bets with millions HKD (although the kick-off time might also changed after placed that particular bet), therefore I follow the kick-off time of the firm A.

graph 4.3.5.1 : Sample data of bank roll and fund growth for basic Kelly model. ($1 = $10,000)

Above Graph is a basic Kelly model, we can know the initial fund size is not united.

Due to our bank roll cannot be less than 0, otherwie will be ruined. Therefore I added the initial balance of the account from the min value of variable SPL which is the balance before place bets must be more than 0. Otherwise unable to place bets. 4.5.1 Risk Management will united the initial fund size and Kelly portion across the league profiles.

The file BankRoll.csv states the profit and loss of the staking. You can see the

As I mentioned at the begining of the research paper, the stakes only reflects the profit and loss of agency A but not firm A. Firm A might have deal with 10~50 or even more agencies and the data from year 2011 is not the initial investment year. You are feel free to download the file. We will discuss the inventory management to reduce the risk.

graph 4.3.5.1 has Event label to marking a specific event on a specifi date or time while I just leave it and only mark on high volatily event dates. From BankRoll.csv we observe the end of soccer sesson in May 2011 dat %>% filter(DateUS >= '2011-05-14' & DateUS <= '2011-05-21') has a seriourly crash. We can investigate more details about the loss matches from the data (or filter the range of the bets in the data table inside Part I).

Comparison of Summarized Kelly Investment Funds

table 4.3.5.2 : Summary of 110 Kelly main funds.

From table 4.5.1.1 we can know the risk of the all investmental funds. You are feel free to browse over KellyApps for more details. You can also refer to Faster Way of Calculating Rolling Realized Volatility in R to measure the volatility of the fund as here I omit it at this stage.

shinyapp 4.3.5.1 : Kelly sportsbook investment fund. Kindly click on KellyApps to use the ShinyApp.

shinyapp 4.3.5.1 : Kelly sportsbook investment fund. Kindly click on KellyApps3535 The shinyApp contain both basic fund management and also portfolio management which is in later section 4.5.1 Risk Management. to use the ShinyApp.

4.4 Poisson Ⓜodel

4.4.1 Niko Marttinen (2001)

Data has been collected over the last four seasons in the English Premier League. These include 1997-1998, 1998-1999, 1999-2000 and 2000-2001 seasons. We have also collected the season 2000-2001 data from the main European football betting leagues, such as English Division 1, Division 2 Division 3, Italian Serie A, German Bundesliga and Spanish Primera Liga…

quote 4.4.1.1 : the dataset for the studies (source : Niko Marttinen (2001)).

Niko Marttinen (2001)3636 Kindly refer to 1th paper in Reference for industry knowdelege and academic research portion for the paper. has enhanced the Dixon and Coles (1996) which are :

Basic Poisson model : Independence Poisson model for both home and way teams with a constant home advantage parameter.
Independent home advantages model : Seperate the home advantage parameter depends on the teams accordingly.
Split season model : Split a soccer league season to be 1st half and 2nd half season.
1. Scores plus Poisson model.

From above models, the author has compare the efficiency and the best fit model for scores prediction as below.

figure 4.4.1.1 : Comparison of various Poison models (source : Niko Marttinen (2001)).

From figure 4.4.1.1 above, the author compare the deviance of the models3737 Kindly refer to Generalized Linear Models in R, Part 2: Understanding Model Fit in Logistic Regression Output, devianceTest and Use of Deviance Statistics for Comparing Models to learn about the method of comparison.

figure 4.4.1.2 : Comparison of various mixed Poison models II (source : Niko Marttinen (2001)).

figure 4.4.1.3 : Comparison of various mixed Poison models III (source : Niko Marttinen (2001)).

From above models, the author list the models and states that even though pick the worst model among the models still more accurate than bookmaker while E(Score)&Dep&Weighted is the best.

figure 4.4.1.4 : Comparison of various odds modelling models (source : Niko Marttinen (2001)).

Besides, Niko Marttinen (2001) not only choose Poison model throughly as the odds modelling model but also compare to below models :-

ELO ratings.
multinomial ordered probit model.

He concludes that the multinomial ordered probit model is the best fit model but the software for fitting is not generally available. Meanwhile, the Poisson model is more versatile than probit logit model based on the dataset accross the European soccer leagues.3838 There has a lot of papers with regard to application of logit probit models on soccer betting, might read through and made comparison with my ®Model ®γσ, Eng Lian Hu (2016). I used to read though the logit probit and there has a complicated parameters setting for various effects like : wheather, players’ condition, couch, pitch condition and even though the distance travel and the players’ stamina modelling.

You can read for more details from paper 4.3.1.1 : Niko Marttinen (2001).

4.4.2 Dixon and Coles (1996)

Here we introduce the Dixon and Coles (1996) model and its codes. You are freely learning from below links if interest.

Due to the soccer matches randomly getting from different leagues, and also not Bernoulli win-lose result but half win-lose etc as we see from above. Besides, there were mixed Pre-Games and also In-Play soccer matches and I try to filter-up the sample data to be only English soccer leagues as shinyApps. I don’t pretend to know the correct answer or the model from firm A. However I take a sample presentation Robert Johnson (2011)3939 Kindly refer to 23th paper in 7.4 References from one of consultancy firm which is Dixon-Coles model and omitted the scoring process section.

4.4.3 ®γσ, Eng Lian Hu (2016)

Below is my previous research paper which was more sophiscated than Dixon-Coles model. You can refer it and I will just omit the section as mentioned at the beginning section of this staking validation research paper.

paper 4.4.3.1 : ®γσ, Eng Lian Hu (2016)

By refer to 4.4.1 Niko Marttinen (2001), I’ve add the enhancement on Odds Modelling and Testing Inefficiency of Sports Bookmakers as future improvement while the weighted function might be enhanced soonly.

Here I cannot reverse computing from barely $\rho_i^{EM}$ without know the $\lambda_{ij}$ and $\gamma$ values. Meanwhile, the staked matches is a descrete random soccer teams accross all leagues and tournaments. Therefore I just simply use reverse EM probabilities by mean value of edge in previous section Kelly.

\[X_{ij} = pois(\gamma \alpha_{ij} \beta_{ij} ); Y_{ij} = pois(\alpha_{ij} \beta_{ij}) \cdots equation\ 4.4.1\]

4.4.4 Combination Handicap

In order to minimzie the risk, I tried to validate the odds price range invested by firm A.4040 As I used to work in AS3388 which always take bets from Starlizard where they only placed bets within the odds price range from 0.70 ~ -0.70. They are not placed bets on all odds price in same edge. The sportbook consulatancy firms might probably not place same amount of stakes on same edge, lets take example as below :-

$Odds_{em}$ = 0.40 while $Odds_{BK}$ = 0.50, The edge to firm will be 0.5 ÷ 0.4 = 1.25
$Odds_{em}$ = 0.64 while $Odds_{BK}$ = 0.80, The edge to firm will be 0.8 ÷ 0.64 = 1.25

We know above edge is same but due to the probability of occurance an event/goal at 0.4 is smaller than 0.64. In 4.3.3 Weighted Fractional Kelly Ⓜodels I tried to use a weight function to measure the effect of Win-All, Win-Half, Push, Loss-Half, Loss and also Cancelled and there has generated a higher profit.

Again, I don’t pretend to know the correct models but here I try to simulate the occurance of the combination and independent handicaps by testing below distribution.

categorise handicap set (from start of previous session until latest soccer matches as Weight function which has talked in 4.3.3 Weighted Fractional Kelly Ⓜodels)
normal distribution
binomial distribution
Poison distribution

Due to there have 110 different weighted Kelly models I test, here I put the application of distribution in specific English soccer league which will compare with my Rmodel in Application of Kelly Criterion model in Sportsbook Investment - Part II.

Comparison Chart

figure 4.4.4.1 : Binomial vs. Poisson

figure 4.4.4.2 : Difference Between Binomial and Poisson Distribution

You can know about the use of the distribution through below articles.

figure 4.4.4.1 : Binomial vs. Poisson
figure 4.4.4.2 : Difference Between Binomial and Poisson Distribution
Difference between Poisson and Binomial distributions
Statistical Distributions (e.g. Normal, Poisson, Binomial) and their uses

figure 4.4.4.3 : Diagram of network among probability distributions

From above diagram we can know the network and relationship among probability distributions. Besides, we can try to refer to below probability distribution listing attach with R codes and examples for pratice :

Probability Distributions in R (Stat 5101, Geyer)

figure 4.4.4.4 : R functions in probability distributions

Here I try to bootstrap/resampling the scores of matches of the dataset to test the Kelly model and get the mean/likelihood value. Boostrapping the scores and staking model will be falling in the following sections [4.5 Staking Ⓜodel and Ⓜoney Management] and 4.6 Expectation Ⓜaximization and Staking Simulation.

4.5 Staking Ⓜodel and Ⓜoney Ⓜanagement

4.5.1 Risk Management

graph 4.5.1.1 : Sample data of candle stick chart for fund growth of firm A via agent A. ($1 = $10,000)

Above table shows the return rates of investment fund from firm A via agent A. We know the initial unvestment fund from $47,788,740.00 growth to $405,511,999.00 within year 2011-01-07, 2015-07-19 with a high return rates 848.55%. Well, I try to apply Kelly model for stakes management which is in previous section 4.3 Kelly Ⓜodel.

From table 4.3.5.2 we can know the risk of the all investmental funds. You are feel free to browse over KellyApps4141 shinyapp 4.3.5.1 for more details.

In order to equalise the initial fund size, here I united it as $22,100.00 which is get from max value among the 110 funds.

From above shinyApps, we can know the initial fund required and the risk as well as the return of investment for us to follow the firm A with application of a time series onto the Kelly staking models.

I had built and test 110 Kelly main funds (split to indipedent fund will be altogether 6,194 funds) but now need to set a baseline for every single leagues and simulate again the steps in 4.3 Kelly Ⓜodel to made a comparison based on the portion of stakes from the initial pools for every single league as we can refer to table 4.3.2.1.

League Stakes Profiling

Staking allocation and portfolio ($0,000)

table 4.5.1.2 : 177 x 3 : A simple new league stakes profile.

table 4.5.1.2 is just a simple baseline profile for league risk management to test the Kelly models. Here I try to adjust few points to do comparison :

united the initial fund size
set a baseline staking portfolio acrosss the leagues
application of various Kelly but not reversed models

4.5.2 Markowitz Portfolio

Galema, Plantinga and Scholtens (2008)4242 You are feel free to refer to Reference for industry knowdelege and academic research portion for the paper. in 7.4 References for further details introduce Fama-MacBeth regressions4343 FM-regression is a Capital asset pricing model (CAPM) which estimate the rates of return of an asset. for portfolio management. The method estimates the betas and risk premia for any risk factors that are expected to determine asset prices. The method works with multiple assets across time (panel data). The parameters are estimated in two steps:

First regress each asset against the proposed risk factors to determine that asset’s beta for that risk factor.
Then regress all asset returns for a fixed time period against the estimated betas to determine the risk premium for each factor.

Assumptions of CAPM

All investors:

Aim to maximize economic utilities (Asset quantities are given and fixed).
Are rational and risk-averse.
Are broadly diversified across a range of investments.
Are price takers, i.e., they cannot influence prices.
Can lend and borrow unlimited amounts under the risk free rate of interest.
Trade without transaction or taxation costs.
Deal with securities that are all highly divisible into small parcels (All assets are perfectly divisible and liquid).
Have homogeneous expectations.
Assume all information is available at the same time to all investors.

Here I skip above section and will leave it as future study due to it will estimate below point which invilve in financial and accounting :

active investors, inactive investors and potential investors.
rates of return of an asset (which is my previous courses in Coursera Improving Business Finances and Operations but not yet complete the specialization)
governmental taxes.

Martin Spann and Bernd Skiera (2009)4444 Kindly refer to 19th paper in Reference for industry knowdelege and academic research portion for the paper. under 7.4 References applied a basic probability sets on the draw games and also the portion of win and loss. The author simply measured the portion of the draw result with win/loss to get the edge to place a bet. However it made a loss on Italian operator Oddset due to the 25% high vigorish but profitable in 12%. Secondly, the bets placed on fixed odds but not Asian Handicap and also a fixed amount $100.

video 4.5.2.1 : Calculated Bets - computers, gambling, mathematical modeling to Win (part 4 of 4)

table 4.3.2.1 shows a risk portfolio for every single league which is roughly similar with Parimutuel Betting but a portion among the initial fund required. In order to equalise the intial fund size. next section I will united it and also application of resampling method to get the optimal league risk profile.

paper 4.5.2.1 : Magnus Erik Hvass Pedersen (2014)

张丹 (2016)4545 Kindly refer to 7th paper inside Reference for technical research on programming and coding portion for the paper. in 7.4 References for further details provides couple of r package for investment and analysis in financial market. You can also refer to Introduction of R Packages.

4.5.3 Optimal Kelly Portfolio

4.5.4 Investors’ Fund Refill

Kelly model will be a good risk averse investment portfolio. As we know normally a mutual fund or any investment fund will advise investors credit a certain money into the pool regularly. For this section I keep it as the next study which is in Application of Kelly Criterion model in Sportsbook Investment - Part II as an dynamic staking baseline model upon injection of new fund into the pool. It will includes the :

bonus issue or dividends
refill or pump-in money into the pool
fund management and admin fees

equation 4.5.4.1 : Economic Order Quantity (EOQ)

Base on above euqation, there has some criteria as below :

$C$ is the total cycle-inventory cost per annum which is the invest or pump in figure into the investment pool.
$Q$ is the fund size which pump into the investment pool. (For example: normally the investment fund or insurance company will advise the investors regularly credit a certain money into whose investment account.)
$H = 1$ due to there has no holding cost per annum unless there is inactive account which will be charges a certain amount of the administration fee where it is not apply to active players.
$D$ is the betting stakes per annum.
$S = 1$ due to there has no setup costs per lot. (unless we count in the bank charges, for example : western union, Entropay, bank transfer fee, etc)

You are feel free to know about inventory management and Kelly fund portfolio management via Module 3: Inventory and Supply Chain Management.

4.6 Expectation Ⓜaximization and Staking Simulation

4.6.1 Truncated Bivariate Normal Distribution

Before I simulate all Kelly funds by resampling the scores, here I using few different trancated bivariate normal distribution4646 You can refer to few r packages which are MASS, mvtnorm and tmvtnorm. and the weight parameters to get the optimal randomise scores. tmvtnorm - A Package for the Truncated Multivariate Normal Distribution

Due to there has negative lambda values which unable proceed a scores based on basic bivariate normal distribution. Here I try to build few models in order to get the best fit model for soccer scores resampling.

1st bivariate normal distribution
- adjust all negative values to be 0
- measure the mean of negative values and add to all positive values as average.
2nd bivariate normal distribution
- adjust all negative values to be 0
- measure the mean of negative values and substrated by all positive values as average.
- multiply all values by the portion of baseline which is the raw dataset.
3rd bivariate normal distribution
- adjust all negative values to be 0
- apply nested normal distribution on the mean and sd of all negative values and distribute to all positive values as average in rnorm.
- apply looping to get the likehood values.
4th bivariate normal distribution
- apply truncated adjustment with set a set of lower and upper values.
- set min as lower interval and max as upper interval.
- multiply all values by the portion of baseline which is the raw dataset.
5th bivariate normal distribution
- apply truncated adjustment with set a set of lower and upper values.
- manually set min as lower interval and 3, 2.34747 3 goals for home team and 2.3 goals for away team as upper interval.
6th bivariate normal distribution
- apply truncated adjustment with set a set of lower and upper values.
- manually set 1, 04848 1 goal for home team and 0 goal for away team as lower interval and 2, 2.34949 3 goals for home team and 2.3 goals for away team as upper interval.
7th bivariate normal distribution
- apply truncated adjustment with set a set of lower and upper values.
- set min values as lower interval and max as upper interval.
- multiply all values by the portion of baseline which is the raw dataset.
8th bivariate normal distribution
- apply truncated adjustment with set a set of lower and upper values.
- set min values as lower interval and mean values as upper interval.
- apply looping to get the likehood values by stepwise adding 0.0001 to upper interval.
9th bivariate normal distribution
- apply truncated adjustment with set a set of lower and upper values.
- set 1st quantile values as lower interval and 3rd quantile values as upper interval.
- apply looping to get the likehood values by stepwise exoanding 0.0001 to both lower and upper interval.
10th bivariate normal distribution
- apply truncated adjustment with set a set of lower and upper values.
- set mean values as lower interval and 3rd quantile values as upper interval.
- apply looping to get the likehood values by stepwise exoanding 0.0001 to both lower and upper interval.

graph 4.6.1.1A : comparison of random scoring models.

graph 4.6.1.1B : comparison of random scoring models.

graph 4.6.1.1C : comparison of random scoring models.

graph 4.6.1.1D : comparison of random scoring models.

graph 4.6.1.1E : comparison of random scoring models.

graph 4.6.1.1F : comparison of random scoring models.

graph 4.6.1.1G : comparison of random scoring models.

graph 4.6.1.1H : comparison of random scoring models.

graph 4.6.1.1I : comparison of random scoring models.

graph 4.6.1.1J : comparison of random scoring models.

Truncated Bivariate Normal Distribution

Mean values and comparison among randomize bivariate scoring models

table 4.5.2.1A : 22 x 4 : Comparison of truncated bivariate normal distributions.

Truncated Bivariate Normal Distribution

variance among randomize bivariate scoring models

table 4.5.2.1B : 22 x 5 : Comparison of truncated bivariate normal distributions.

Truncated Bivariate Normal Distribution

summary among randomize bivariate scoring models

table 4.5.2.1C : 22 x 5 : Comparison of truncated bivariate normal distributions.

table 4.5.2.1D : 11 x 12 : Comparison of truncated bivariate normal distributions.

From above models, we know that opt10 is the best fit model and I’ll apply it in further section 4.6.2 Resampling Scores and Stakes. You are feel free to refer to below articles for further understanding:

4.6.2 Resampling Scores and Stakes

From the article 凯利模式资金管理5050 You might refer to Application of Kelly Criterion model in Sportsbook Investment as well. we know the application of generalization of Kelly criterion for uneven payoff games.

\[G: = \mathop {\lim }\limits_{N \to \infty } \frac{1}{N}\log\left( {\frac{{{S_N}}}{{{S_0}}}} \right) \cdots equation\ 4.3.2\]

In order to get the optimal value, I apply the bootrapping and resampling method.

\[L(\rho) = \prod_{i=1}^{n} (x_{i}|\rho) \cdots equation\ 4.3.4\]

Now we look at abpve function from a different perspective by considering the observed values $x_{1}, x_{2}, x_{3}… x_{n}$ to be fixed parameters of this function, whereas $\rho$ will be the function’s variable and allowed to vary freely; this function will be called the likelihood.

5. ®esult

Section 5.1 Comparison of the ®esults - Comparison of the Returns of Staking Models.
Section [5.2 Ⓜarket Basket] - Analyse the Hedging or Double up Invest by Firm A.

5.1 Comparison of the ®esults

5.1.1 Comparison of Fully Followed Bets

Dixon and Pope (2003) apply linear model to compare the efficiency of the odds prices offer by first three largest Firm A, B and C in UK.

BRSum <- readRDS('./KellyApps/data/BRSum.rds') %>% tbl_df

BRSum  %>% datatable(
  caption = "Table 5.1.1.1 : Summary of Kelly Main Funds ('0,000)", 
  escape = FALSE, filter = 'top', rownames = FALSE, 
  extensions = list('ColReorder' = NULL, 'RowReorder' = NULL, 
                    'Buttons' = NULL, 'Responsive' = NULL), 
  options = list(dom = 'BRrltpi', autoWidth = TRUE,  scrollX = TRUE, 
                 lengthMenu = list(c(10, 50, 100, -1), c('10', '50', '100', 'All')), 
                 ColReorder = TRUE, rowReorder = TRUE, 
                 buttons = list('copy', 'print', 
                                list(extend = 'collection', 
                                     buttons = c('csv', 'excel', 'pdf'), 
                                     text = 'Download'), I('colvis'))))

## Warning in instance$preRenderHook(instance): It seems your data is too
## big for client-side DataTables. You may consider server-side processing:
## http://rstudio.github.io/DT/server.html

table 5.1.1.1 : 2282 x 119 : Comparison of Kelly investmemt fund.

5.1.2 Comparison of Missing Bets

Due to I have no followed bets data, therefore there has no any clue but based on my previous experience in Telebiz, Caspo and Global Solution Sdn Bhd as list in my CV I try to adjust setting of 90% successful following rates and 90% of average odds compare to firm A.

~~According to above result, in order to made it workable in real life. Here I simulate whole Kelly function with monte carlo method to get the return.~~

Here I skip this sub-section due to no following bets dataset and somemore there is a profit sharing investment business.

5.2 Market Basket

By refer to ®γσ, Eng Lian Hu (2016)5151 Kindly refer to 28th paper in Reference for industry knowdelege and academic research portion for the paper. under 7.4 References, here I apply the arules and arulesViz packages to analyse the market basket of the bets.

## Loading required package: Matrix

## 
## Attaching package: 'Matrix'

## The following object is masked from 'package:tidyr':
## 
##     expand

## 
## Attaching package: 'arules'

## The following object is masked from 'package:dplyr':
## 
##     recode

## The following objects are masked from 'package:base':
## 
##     abbreviate, write

## Loading required package: grid

## 
## Attaching package: 'grid'

## The following object is masked from 'package:BBmisc':
## 
##     explode

## 
## Attaching package: 'igraph'

## The following object is masked from 'package:arules':
## 
##     union

## The following object is masked from 'package:BBmisc':
## 
##     normalize

## The following object is masked from 'package:formattable':
## 
##     normalize

## The following objects are masked from 'package:dplyr':
## 
##     as_data_frame, groups, union

## The following objects are masked from 'package:purrr':
## 
##     compose, simplify

## The following object is masked from 'package:tidyr':
## 
##     crossing

## The following object is masked from 'package:tibble':
## 
##     as_data_frame

## The following objects are masked from 'package:lubridate':
## 
##     %--%, union

## The following objects are masked from 'package:stats':
## 
##     decompose, spectrum

## The following object is masked from 'package:base':
## 
##     union

##              1860 Munchen 1FC Koln AA Gent AaB Aalborg Aachen
## 1860 Munchen           20        0       0           0      0
## 1FC Koln                0       35       0           0      0
## AA Gent                 0        0      20           0      0
## AaB Aalborg             0        0       0          14      0
## Aachen                  0        0       0           0     15

Equipped with this knowledge, let’s see what products tend to compliment each other with high lift (i.e. purchase of one product would lead to purchase of another with high probability) and what products tend to be substitutes:

##                     Total Goals - under Total Goals - over Leicester
## Total Goals - under                  NA          0.1068912 0.0000000
## Total Goals - over           0.10689122                 NA 0.3859765
## Leicester                    0.00000000          0.3859765        NA
## Liverpool                    0.09666446          0.7522156 0.0000000
## Guingamp                     0.19782493          0.5497921 0.0000000
##                      Liverpool  Guingamp
## Total Goals - under 0.09666446 0.1978249
## Total Goals - over  0.75221561 0.5497921
## Leicester           0.00000000 0.0000000
## Liverpool                   NA 0.0000000
## Guingamp            0.00000000        NA

rules <- apriori(trans3, parameter = list(sup = 0.001, conf = 0.5, target = 'rules', minlen = 1))

## Apriori
## 
## Parameter specification:
##  confidence minval smax arem  aval originalSupport maxtime support minlen
##         0.5    0.1    1 none FALSE            TRUE       5   0.001      1
##  maxlen target   ext
##      10  rules FALSE
## 
## Algorithmic control:
##  filter tree heap memopt load sort verbose
##     0.1 TRUE TRUE  FALSE TRUE    2    TRUE
## 
## Absolute minimum support count: 23 
## 
## set item appearances ...[0 item(s)] done [0.00s].
## set transactions ...[1298 item(s), 23329 transaction(s)] done [0.00s].
## sorting and recoding items ... [198 item(s)] done [0.00s].
## creating transaction tree ... done [0.00s].
## checking subsets of size 1 2 done [0.34s].
## writing ... [0 rule(s)] done [0.00s].
## creating S4 object  ... done [0.00s].

summary(rules)

## set of 0 rules

itemsets <- apriori(trans3, parameter = list(
  support = .001, minlen = 2, maxlen = 2, target = 'frequent' # to mine for itemsets
  ))

## Apriori
## 
## Parameter specification:
##  confidence minval smax arem  aval originalSupport maxtime support minlen
##          NA    0.1    1 none FALSE            TRUE       5   0.001      2
##  maxlen            target   ext
##       2 frequent itemsets FALSE
## 
## Algorithmic control:
##  filter tree heap memopt load sort verbose
##     0.1 TRUE TRUE  FALSE TRUE    2    TRUE
## 
## Absolute minimum support count: 23 
## 
## set item appearances ...[0 item(s)] done [0.00s].
## set transactions ...[1298 item(s), 23329 transaction(s)] done [0.00s].
## sorting and recoding items ... [198 item(s)] done [0.00s].
## creating transaction tree ... done [0.00s].
## checking subsets of size 1 2

## Warning in apriori(trans3, parameter = list(support = 0.001, minlen =
## 2, : Mining stopped (maxlen reached). Only patterns up to a length of 2
## returned!

##  done [0.00s].
## writing ... [1 set(s)] done [0.00s].
## creating S4 object  ... done [0.00s].

I tried to split the transaction ID by matchID but the result seems unfavorable, I’ll look into it beyond the future.

6. Conclusion

Section 6.1 Conclusion - Conclusion of this Research Paper.
Section 6.2 Future Works - Future Research or Enhancement.

6.1 Conclusion

Due to the data-sets I collected just one among all agents among couple sports-bookmakers 4lowin. Here I cannot determine if the sample data among the population…

JA : What skills and academic training (example: college courses) are valuable to sports statisticians?

KW : I would say there are three sets of skills you need to be a successful sports statistician:

Quantitative skills - the statistical and mathematical techniques you’ll use to make sense of the data. Most kinds of coursework you’d find in an applied statistics program will be helpful. Regression methods, hypothesis testing, confidence intervals, inference, probability, ANOVA, multivariate analysis, linear and logistic models, clustering, time series, and data mining/machine learning would all be applicable. I’d include in this category designing charts, graphs, and other data visualizations to help present and communicate results.

Technical skills - learning one or more statistical software systems such as R/S-PLUS, SAS, SPSS, Stata, Matlab, etc. will give you the tools to apply quantitative skills in practice. Beyond that, the more self-reliant you are at extracting and manipulating your data directly, the more quickly you can explore your data and test ideas. So being adept with the technology you’re likely to encounter will help tremendously. Most of the information you’d be dealing with in sports statistics would be in a database, so learning SQL or another query language is important. In addition, mastering advanced spreadsheet skills such as pivot tables, macros, scripting, and chart customization would be useful.

Domain knowledge - truly understanding the sport you want to analyze professionally is critical to being successful. Knowing the rules of the game; studying how front offices operate; finding out how players are recruited, developed, and evaluated; and even just learning the jargon used within the industry will help you integrate into the organization. You’ll come to understand what problems are important to the GM and other decisionmakers, as well as what information is available, how it’s collected, what it means, and what its limitations are. Also, I recommend keeping up with the discussions in your sport’s analytic community so you know about the latest developments and what’s considered the state of the art in the public sphere. One of the great things about being a sports statistician is getting to follow your favorite websites and blogs as a legitimate part of your job!

source : Preparing for a Career as a Sports Statistician: Two Interviews with People in the Field

In this Part II research paper I try to add a section which is filtered out only English soccer leagues and the revenue and profit & loss all sessional based but not annum based to make it applicable to my future staking in real world. The proportional staking and also money management on the staking pools. You are feel free to browse from the content page Betting Strategy and Model Validation.

Journey to West

The statistical analysis on sportsbook and soccereconomics is popular in US, Europe and Ocean Pacific but not yet in Asia. Here I am learning from the western professional sportsbook consultancy firms and do shares with those who like scientific analysis on soccer sports.

If you get interest to be a punter, you are feel free to read over below presentation paper from a British consultancy firm to know the requirement to be a professional gambler.

Mark Dixon

graph 6.1.1 : The pioneer of sportsbook statistical analytical ATASS — founder : Mark Dixon

You are feel free to know further about the history of Mr Mark Dixon from Boffins -vs- Bookies (The Man Who Broke the World Leading Bookmakers).

6.2 Future Works

Niko Marttinen (2001) has conducted a very detail and useful but also applicable betting system in real life. There has a ordered probit model which shows a high accuracy predictive model compare to his Poisson (Escore) model. Well, the ®γσ, Lian Hu ENG (2016)5252 The research modelling with testing the efficiency of odds price which had completed in year 2010. Kindly refer to 3rd paper in Reference for industry knowdelege and academic research portion for the paper. under 7.4 References has build a weight inflated diagonal poisson model which is more complicated and shophitiscated and later ®γσ, Lian Hu ENG (2014)5353 Kindly refer to 4th paper inside Reference for industry knowdelege and academic research portion for the paper. under 7.4 References. However there has an automatically and systematically trading system which wrote in VBA + S-Plus + Excel + SQL5454 the betting system has stated in his paper. which is very useful as reference. The author use VBA to automac the algorithmic trading while there has no Asian Handicap and Goal Line odds price data to simulate compare to mine. While currently the shinyapps with RStudioConnect can also build an algorithmic trading system. However the session timeout issue5555 The connection timeout issue might be a big issue for real time algorithmic trading might need to consider. The shinydashboard example from ｮStudio might probably cope with the issue.

John Fingleton & Patrick Waldron (1999) applied Shin model to test the portion of hedge funds and smart punters. As I stated in 4.2 Linear Ⓜodel, the sparkR, RHadoop and noSQL require in order to analyse the high volume betslips dataset. Its interesting and will conduct the research if all betslips of bookmaker(s) is(are) available in the future.

From the 4.3 Kelly Ⓜodel we test the staking model, the table 4.2.1 we apply the linear models and choose the best fit model based on the edge of odds price. 4.4 Poisson Ⓜodel we try to reverse the odds price placed to get the probabilities of scoring different scores. Now we try to test the return of staking on different handicap (ex: 0, 0.25, 0.5, 0.75, 1 etc.) to know which handicap earn the most. Nowadays the hotest matches of four major leagues provides few handicaps market, there will be another case study and research to increase the profit base on same probabilities and also edge but staking on different handicap. The dataset will be collect for research beyond the future. The effects of Win-Half and Loss-Half might probably more effective by application of Poison models since it is a descrete outcome while I take it as a known linear effects this time due to the odds price of Handicap we placed always within a range from 0.7 to 1.25.

I will be apply Shiny to write a dynamic website to utilise the function as web based apps. I am currently conducting another research on Analyse the Finance and Stocks Price of Bookmakers which is an analysis on the public listed companies and also anonymous companies revenue and profit & loss. You are welcome to refer SHOW ME SHINY and build your own shinyapps.

I will also write as a package to easier load and log.

7. Appendices

Section 7.1 Documenting File Creation - Information of the Paper.
Section 7.2 Versions’ Log - Version Log of the Paper.
Section 7.3 Speech and Blooper - Speech and Blooper during Conducting the Research.
Section 7.4 References - Reference Papers for the Paper.

7.1 Documenting File Creation

It’s useful to record some information about how your file was created.

File creation date: 2015-07-22
File latest updated date: 2017-10-15
R version 3.4.2 (2017-09-28)
R version (short form): 3.4.2
rmarkdown package version: 1.6.0.9004
tufte package version: 0.2
File version: 1.0.0
Author Profile: ®γσ, Eng Lian Hu
GitHub: Source Code
Additional session information

[1] “2017-10-15 23:03:10 JST”

7.2 Versions’ Log

File pre-release version: 0.9.0
- file created
- Applied ggplot2, ggthemes, directlabels packages for ploting. For example, the graphs applied in Section [2. Data].
File pre-release version: 0.9.1
- Added Natural Language Analysis which is research for teams’ name filtering purpose.
- Changed from knitr::kable to use datatble from DT::datatable to make the tables be dynamic.
- Changed from ggplot2 relevant packages to googleVis package to make graph dynamic.
- Completed chapter [3. Summarise the Staking Model].
File pre-release version: 0.9.2 - “2016-02-20 09:41:49 JST”
- Added Section 7.2 Versions’ Log and Section 7.3 Speech and Blooper since retest the coding
- Added Section [4. Staking Model] .
File pre-release version: 0.9.3 - “2016-02-05 05:24:35 EST”
- Modified DT::datatable to make the documents can be save as xls/csv
- Added log file for version upgraded
File pre-release version: 0.9.3.1 - 2016-06-22 13:36:33 JST
- Reviewed previous version, DT::datatable updated new version replaced Button extension from TableTools, removed sparkline and htmlwidget
- Applied linear regression to test the efficiency of staking model by consultancy firm A
File pre-release version 0.9.4 - 2016-09-28 00:15:24 JST
- Added linear regression and shinyApp to test the effects on staking
File pre-release version 0.9.5 - 2016-12-20 02:26:19 JST
- Added Kelly criterion for 110 main-funds with each has 19 sub-funds.

7.3 Speech and Blooper

Firstly I do appreciate those who shade me a light on my research. Meanwhile I do happy and learn from the research.

Due to the rmarkdown file has quite some sections and titles, you might expand or collapse the codes by refer to Code Folding and Sections for easier reading.

There are quite some errors when I knit HTML:

let say always stuck (which is not response and consider as completed) at 29%. I tried couple times while sometimes prompt me different errors (upgrade Droplet to larger RAM memory space doesn’t helps) and eventually apply rm() and gc() to remove the object after use and also clear the memory space.
Need to reload the package suppressAll(library('networkD3')) which in chunk decission-tree-A prior to apply function simpleNetwork while I load it in chunk libs at the beginning of the section 1. Otherwise cannot found that particlar function.
The rCharts::rPlot() works fine if run in chunk, but error when knit the rmarkdown file. Raised an issue : Error : rCharts::rPlot() in rmarkdown file.
xtable always shows LaTeX output but not table. Raised a question in COS : 求助！knitr Rmd pdf 中文编译 2016年8月19日下午9:56 7 楼.Here I try other packages like textreg and stargazer. You can refer to Test version to view the output of stargazer function and the source codes I reserved but added eval = FALSE in chunks named lm-summary and lm-anova to unexecute the codes.
I refer to R Shiny: Rendering summary.ivreg output and tried to plot the output table, but there has no bottom statistical information like Residual standard error, Degree of Freedom, R-Squared, F-statistical value and also p-value, therefore I use R Shiny App for Linear Regression, Issue with Render Functions which simply renderPrint() the verbatimTextOutput() in shinyapp 4.2.1.
I tried to raise an issue about post the shinyapps to RStudioConnect at Unable publish to RStudio Connect : Error in yaml::yaml.load(enc2utf8(string), …) : Reader error: control characters are not allowed: #81 at 276 #115. You might try to refer to the gif files in #issue 115 for further information. I tried couple times and find the solution but there has no an effective solution and only allowed post to ｮPubs.com where I finally decide to seperate the dynamic shinyApp into another web url.
Remark : When I rewrite Report with ShinyApps : Linear Regression Analysis on Odds Price of Stakes and would like to post to ®StudioConnect, the wizard only allowed me post to rPubs.com (but everyone know rPubs only allow static document which is not effort to support Shinyapp). Therefore kindly refer to https://beta.rstudioconnect.com/content/1766/. You might download and run locally due to web base version always affected by wizards and sometimes only view datatable but sometimes only can view googleVis while sometimes unable access.
Using formattable and plotly simultaneously and Possible namespace issue with plotly::last_plot() #41 solved the formattable issue.
The analysis in Part I might slightly different with Part II due to the timezone issue.
- The daily financial settlement time is EST 0000 or HKT 1200.
- The filtered data for observation in Part II for statistical analysis purpose while Part I is just summarise and breakdown the bets which includes all bets.
I am currently work as a customer service operator and self research as a smart punter. Hope my sportsbook hedge fund company website Scibrokes® running business soon…

Terminator II

7.4 References

Reference for industry knowdelege and academic research portion for the paper.

Reference for technical research on programming and coding portion for the paper.

Powered by - Copyright® Intellectual Property Rights of Scibrokes®個人の経営企業

Betting Strategy and Ⓜodel Validation - Part II

Betting Model Analysis on Sportsbook Consultancy Firm A

®γσ, Lian Hu 白戸則道®

2017-10-15

Abstract

1. Introduction to the Betting Stategics

2. Data

3. Summarise the Staking Model

4. Staking Ⓜodel

4.1 Basic Equation

Profit and Loss of Investment

Annual Stakes and Profit and Loss of Firm A at Agency A (2011-2015) ($0,000)

Profit and Loss of Investment

Stakes and Profit and Loss of Firm A at Agency A (2011~2015) ($0,000)

4.2 Linear Ⓜodel

4.3 Kelly Ⓜodel

4.3.1 Basic Kelly Ⓜodel

4.3.2 Fractional Kelly Ⓜodel

League Stakes Profiling

Stakes of Firm A at Agency A (2011~2015) ($0,000)

Stakes based reversed Kelly models

PL of Stakes Reversed based Kelly Models

Stakes of Firm A at Agency A (2011~2015) ($0,000)

PL of Reversed rEMProbB Kelly Models (Mean with min-max Adjusted Stakes)

Stakes of Firm A at Agency A (2011~2015) ($0,000)

PL of Reversed Stakes based Kelly Models (Mean with sd Adjusted Stakes)

Stakes of Firm A at Agency A (2011~2015) ($0,000)

Reversed rEMProbB based Kelly models

PL of Reversed rEMProbB Kelly Models

Stakes of Firm A at Agency A (2011~2015) ($0,000)

PL of Reversed rEMProbB Kelly Models (Mean with Min-Max Adjusted Stakes)

Stakes of Firm A at Agency A (2011~2015) ($0,000)

PL of Reversed rEMProbB Kelly Models (Mean with sd Adjusted Stakes)

Stakes of Firm A at Agency A (2011~2015) ($0,000)

4.3.3 Weighted Fractional Kelly Ⓜodels

Weighted Value Estimation for Weighted Kelly Models

Weighted Table (2011~2015)

Stakes based reversed Kelly models

PL of Stakes Reversed based Kelly Models

Stakes of Firm A at Agency A (2012~2015) ($0,000)

PL of Stakes Reversed based Kelly Models

Stakes of Firm A at Agency A (2012~2015) ($0,000)

Reversed rEMProbB based Kelly models

PL of Reversed Prob Kelly Models

Stakes of Firm A at Agency A (2012~2015) ($0,000)

PL of Reversed Prob Kelly Models

Stakes of Firm A at Agency A (2012~2015) ($0,000)

4.3.4 Dynamic Fractional Kelly Ⓜodel

Comparison

Stakes based reversed Kelly models

PL of Stakes Reversed based Kelly Models

Stakes of Firm A at Agency A (2012~2015) ($0,000)

PL of Stakes Reversed based Kelly Models

Stakes of Firm A at Agency A (2012~2015) ($0,000)

Reversed rEMProbB based Kelly models

PL of Reversed Prob Kelly Models

Stakes of Firm A at Agency A (2012~2015) ($0,000)

PL of Reversed Prob Kelly Models

Stakes of Firm A at Agency A (2012~2015) ($0,000)

4.3.5 Bank Roll

4.4 Poisson Ⓜodel

4.4.1 Niko Marttinen (2001)

4.4.2 Dixon and Coles (1996)

4.4.3 ®γσ, Eng Lian Hu (2016)

4.4.4 Combination Handicap

4.5 Staking Ⓜodel and Ⓜoney Ⓜanagement

4.5.1 Risk Management

League Stakes Profiling

Staking allocation and portfolio ($0,000)

4.5.2 Markowitz Portfolio

4.5.3 Optimal Kelly Portfolio

4.5.4 Investors’ Fund Refill

4.6 Expectation Ⓜaximization and Staking Simulation

4.6.1 Truncated Bivariate Normal Distribution

Truncated Bivariate Normal Distribution

Mean values and comparison among randomize bivariate scoring models

Truncated Bivariate Normal Distribution

variance among randomize bivariate scoring models

Truncated Bivariate Normal Distribution

summary among randomize bivariate scoring models