Idealized scientific research:
Empirical finance research in academia:
Ideally research would involve common objectives, where conclusions can generate public goods, hence the idea behind publishing.
Does this make sense in empirical finance?
Empirical finance research in industry:
This is due to the fact that asymmetric information in markets is what creates opportunities for individuals.
Question: If incentives are sometimes aligned against collaborative and open-source empirical finance research, should we expect research to be replicable?
Follow-up question: If incentives are that publishing is highly rewarded, can we expect published research to be “genuine” research?
It is an important issue in empirical finance:
Data mining, aka p-hacking:
In conclusion, as good statisticians, our goal is to avoid data mining from our research methodology while limiting statistical bias when appropriate, and to recognize these issues when evaluating other researchers’ results.
Numerous elements can cause results to be biased and invalid:
He concludes that it is likely for most scientific findings to be more false than true, being accurate measures of prevailing bias rather than actual rigorous results.
We have seen that in the context of empirical finance, financial incentives and flexibility in design can play a large role in driving results.
Why are most medical findings false?
Definitions
See xkcd
Bayes Theorem implies
\begin{align*} PPV & =\Pr\left( H_{0}=F|S=f\right) =\frac{\Pr\left( H_{0}=F,S=f\right)}{\Pr\left( S=f\right)}\\ &=\frac{\left( 1-\beta\right) \cdot\Pr\left( H_{0}=F\right)}{\left( 1-\beta\right) \cdot\Pr\left( H_{0}=F\right) +\alpha\cdot\Pr\left( H_{0}=T\right)}\\ &=\frac{\left( 1-\beta\right) \cdot R}{\left( 1-\beta\right) \cdot R+\alpha} \end{align*}
\therefore Research findings are more likely right than wrong
\iff\left( 1-\beta\right) \cdot R>\alpha\approx0.05
Implications
Repeated Testing
n \equiv number of H_{0} tested (but not reported) to produce 1 statistically significant result that is reported
Now,
PPV = \frac{\left( 1-\beta^{n}\right) \cdot R}{\left( 1-\beta^{n}\right) \cdot R+\left[ 1-\left( 1-\alpha\right) ^{n}\right]} \approx \frac{\left( 1-\beta^{n}\right) \cdot R}{\left( 1-\beta^{n}\right) \cdot R+\left[ 1-.95^{n}\right]}
Research findings are more likely right than wrong
\iff\left( 1-\beta^{n}\right) \cdot R>1-\left( 1-\alpha\right) ^{n}
Raising n can make it harder to satisfy this condition!
Asset pricing studies seek to answer a variety of research questions, but they usually share:
Sufficient grounds for researchers to be concerned about MHT in financial predictability studies.
Inference can only be made by making explicit assumptions about the data-generating process and the ability of researchers to filter unsound strategies.
In their study, MHT-adjusted thresholds for t-statistics of time-series alpha and cross-sectional FM regressions slopes are 3.8 and 3.4 respectively, implying 1,028 and 4,790 tests to be attempted to have a 0.50 probability of meeting the threshold.
Important There is no absolute standard of significance for hypothesis testing, only generally accepted guidelines for statistical research.
Important Also consider that authors usually need a theory, motivation or story to get published. Similar in a corporate finance.
➡️ This raises the R in Ioannidis’s formulas.
Many other potential issues are of concern in the field of empirical finance. We will discuss three:
Understanding the variations between the significance of the Fama-French factors depending on the time they were downloaded.
They find that factor returns differ substantially depending on factor vintage, which causes a third of the anomaly long-short portfolios to lose statistical significance.
How is that possible?
Question If such small deviations in methodology have such an important impact on study results, what other hidden issues could exist which are currently overlooked by researchers?

HML is the most affected factor

α of well-diversified portfolios are significantly affected.

The first crowd-sourced empirical paper in Economics/Finance. The project seeks to expose the variation across researchers for results they report independently testing the same hypotheses on the same sample.
343 authors from 34 countries answering the same asset pricing questions:


Harvey (2022): yes, but incentives make it less of a problem than in academia.
I believe p-hacking is less of a problem in asset management than in academia—in particular, less of a problem in the proprietary research that is the foundation for a product. The reasons are simple. First, in the presence of a performance fee, the asset management company’s research needs to be optimized in a way that maximizes the chances of repeatable performance. This means the asset manager does not choose the best-performing backtest, because it is the one that is most likely to be overfit. If the manager were to launch a backtest-overfitted strategy, it would likely fail and thereby generate no performance fees. The second reason is reputation. Academic tenure has no equivalent in asset management. If an asset manager’s products disappoint because of overfitting, the firm’s investors will flee. This market mechanism naturally minimizes the overfitting. That said, asset management companies still produce a substantial amount of low-quality research. Similar to the academic research, investors need to be skeptical.
One of the groups that has attempted to measure the robustness of past asset pricing studies by re-assessing the strength of established factors.
Study design:
Conclusion Most anomalies are not replicable empirically, and those that are successfully replicated are so with magnitudes that are often lower.
Emphasis on the impact of controlling for micro-cap stocks:
Is it realistic to consider strategies to be significant almost uniquely on the behavior of this (small) sub-sample of equities?
Most practical implementations of researched investing strategy will not prioritize micro-caps in their design!
Conclusion While the influence of micro-caps is overwhelming in asset pricing studies, it is almost nonexistent in actual money management.
For Chen and Zimmerman, past replications studies measure predictability deviating from the methods employed in original research. - If the method is different, for sure the results will be different!
Important Replication must be undertaken with the exact same conditions as in the original study. The methodology must be followed precisely and assumptions be extremely limited.

Importance of ethical practices amongst scientists:
There is no escaping it, if your initial method is wrong, every conclusion that ensues is wrong.
The authors hereby retract the above article, published in print in the April 2020 issue of The Journal of Finance. A replication study finds that the replication code provided in the supplementary information section of the article does not reproduce some of the central findings reported in the article. Upon reexamination of the work, the authors confirmed that the replication code does not fully reproduce the published results and were unable to provide revised code that does. Therefore, the authors conclude that the published results are not reliable and that the responsible course of action is to retract the article and return the Brattle Group Distinguished Paper Prize that the article received. The authors deeply regret the damage this caused to the journal and the scholarly community. The specific contributions of the authors to the article were as follows: the first and second author provided the theoretical hypothesis; all three authors jointly designed the empirical approach and identification strategy; the third author constructed and handled the data, implemented the empirical analysis, and provided the empirical results as well as the replication data and code. The third author states that the original data and code that produced the published results were lost. The first and second author were not notified of the loss of the original data and code at the time it occurred and had no prior knowledge of the issues with the replication data and code provided to the journal.
As technological advances become prevalent in corporate and academic research, scientists have the possibility of making their work fit for replication with adequate documentation and rigorous exposition.
The Turing Way Project:
➡️ When researchers employ transparency in their research - in other words, when they properly document and share the data and processes associated with their analyses - the broader research community is able to save valuable time when reproducing or building upon published results.




You can read and write Excel files with pandas.
You can also use Excel with Python:
Book (O’Reilly): Python for Excel
MATH60230