05-11-2023 · Insight

Machine learning in finance: Why and how?

With the rise in popularity of machine learning (ML), it’s no surprise that investment practitioners are attracted to this potential industry game changer.

That’s why this new research report by Robeco is an important contribution to the conversation. It provides detailed analysis of the huge upside of this technological advancement, but also expresses a much-needed note of caution for the finance and investment industry.

    Authors

  • Mike Chen - Head of Next Gen Research

    Mike Chen

    Head of Next Gen Research

  • Weili Zhou - Head of Quant Investing & Research

    Weili Zhou

    Head of Quant Investing & Research

So, why ML in investing?

The answer is simple: the promise of higher risk-adjusted returns. In general, industry practitioners have found that ML-derived alpha models outperform more traditional, linear models in predicting cross-sectional equity returns.

In this report, the authors explore the interaction between ML and other known quantitative techniques. They examine non-linear effects which are a key factor behind the superior performance of ML. The interaction effect between accounting red flags and equity returns is an interesting illustration of this.

Another notable feature of ML algorithms is their so called ‘knowledge discovery’ abilities. Given that they’re designed to look for relationships, an ML algorithm can decipher the link between inputs and outputs without specifying a hypothesis.

Investor beware…

It’s worth mentioning that financial markets differ from other areas of modern life where ML has made tremendous strides. Financial markets’ inherent complexity throws up notable challenges. For instance, the Robeco research highlights the low signal-to-noise ratio of financial data. This means for a given security, any one metric is generally not a huge determinant of how that security will perform.

Another challenge relates to amount of data, a significant driver of an ML algorithm’s power. While it may feel like there is a vast amount of data in finance, it’s actually relatively small compared with other areas where ML has thrived.

Finally, financial markets are adaptive. This means they ‘learn’ over time, enabling investors to change their approach as required. However, because ML tends to perform well with static systems, this throws up potential difficulties.

Overcoming challenges in application

Due to the short sample data history, overfitting and spurious correlations can occur. While common techniques to overcome this may not work as well in finance, human intuition and economic domain knowledge can help significantly. In the full Robeco paper, you’ll see the authors’ illustration of this using the SHapley Additive exPlanation (SHAP) value.

Robeco suggests that ML investors build a robust data and code infrastructure to tackle potential replicability challenges. This would include a robust system for code version control and documentation of all tested iterations and hypotheses.

Quant investors will be familiar with lookahead bias and the more general problem of data leakage. This phenomenon occurs when data used in the training set contains information that can be used to infer the prediction, information that would otherwise not be available to the ML model in live production. The report outlines various techniques to counteract this potential pitfall.

Machine learning models can be difficult to understand and explain, especially via performance attribution. The full Robeco report offers the fundamental approach to recent explainable machine learning work, including helpful diagrams.

As technology advances, so do the opportunities for quantitative investors. By incorporating more data and leveraging advanced modelling techniques, we can develop deeper insights and enhance decision-making.

Sample ML applications in finance

The Robeco report analyses practical applications of ML to financial investing. It takes a comprehensive dive into each area of study:

  1. Predicting cross-sectional stock returns
    The ML algorithms used here compare the relative, rather than absolute, returns of securities to predict whether a security’s price will rise or fall. The report delves into five consistent results that practitioner and academic studies have found, starting with the outperformance of ML algorithm prediction versus the traditional linear approach.

  2. Predicting stock crashes
    This application differs slightly in focusing on the worst-performing stocks only, offering potential benefits for conservative strategies which can then exclude securities most likely to crash. The performance of this ML algorithm is shown to be greater than that of traditional approaches, with notable implications for stock selection when looking on a sectoral basis.

  3. Predicting fundamental variables
    Company fundamentals have a major influence on stock price and performance. Fundamentals, being more stable, may be easier to predict than stock returns. ML models give more accurate earnings forecasts. One key conclusion is that ensemble models, traditional or machine learned, were more accurate than individual models alone.

  4. Natural Language Processing (NLP) in multiple languages
    In general, a global portfolio may invest in 20-30 different countries, while a typical investor may understand only two or three languages, if that. NLP can be used to overcome this barrier. Taking Chinese as an example, NLP can be used to detect investment terms in both standard Chinese and slang. This can aid understanding of foreign language investment blogs for instance.

Let's keep the conversation going

Keep track of fast-moving events in sustainable and quantitative investing, trends and credits with our newsletters.

Stay updated
Robeco

Robeco aims to enable its clients to achieve their financial and sustainability goals by providing superior investment returns and solutions.

Important information
The Robeco Capital Growth Funds have not been registered under the United States Investment Company Act of 1940, as amended, nor or the United States Securities Act of 1933, as amended. None of the shares may be offered or sold, directly or indirectly in the United States or to any U.S. Person (within the meaning of Regulation S promulgated under the Securities Act of 1933, as amended (the “Securities Act”)). Furthermore, Robeco Institutional Asset Management B.V. (Robeco) does not provide investment advisory services, or hold itself out as providing investment advisory services, in the United States or to any U.S. Person (within the meaning of Regulation S promulgated under the Securities Act).
This website is intended for use only by non-U.S. Persons outside of the United States (within the meaning of Regulation S promulgated under the Securities Act who are professional investors, or professional fiduciaries representing such non-U.S. Person investors. By clicking “I Agree” on our website disclaimer and accessing the information on this website, including any subdomain thereof, you are certifying and agreeing to the following: (i) you have read, understood and agree to this disclaimer, (ii) you have informed yourself of any applicable legal restrictions and represent that by accessing the information contained on this website, you are not in violation of, and will not be causing Robeco or any of its affiliated entities or issuers to violate, any applicable laws and, as a result, you are legally authorized to access such information on behalf of yourself and any underlying investment advisory client, (iii) you understand and acknowledge that certain information presented herein relates to securities that have not been registered under the Securities Act, and may be offered or sold only outside the United States and only to, or for the account or benefit of, non-U.S. Persons (within the meaning of Regulation S under the Securities Act), (iv) you are, or are a discretionary investment adviser representing, a non-U.S. Person (within the meaning of Regulation S under the Securities Act) located outside of the United States and (v) you are, or are a discretionary investment adviser representing, a professional non-retail investor.


Access to this website has been limited so that it shall not constitute directed selling efforts (as defined in Regulation S under the Securities Act) in the United States and so that it shall not be deemed to constitute Robeco holding itself out generally to the public in the U.S. as an investment adviser. Nothing contained herein constitutes an offer to sell securities or solicitation of an offer to purchase any securities in any jurisdiction. We reserve the right to deny access to any visitor, including, but not limited to, those visitors with IP addresses residing in the United States. This website has been carefully prepared by Robeco. The information contained in this publication is based upon sources of information believed to be reliable. Robeco is not answerable for the accuracy or completeness of the facts, opinions, expectations and results referred to therein. Whilst every care has been taken in the preparation of this website, we do not accept any responsibility for damage of any kind resulting from incorrect or incomplete information. This website is subject to change without notice. The value of the investments may fluctuate. Past performance is no guarantee of future results. If the currency in which the past performance is displayed differs from the currency of the country in which you reside, then you should be aware that due to exchange rate fluctuations the performance shown may increase or decrease if converted into your local currency. For investment professional use only. Not for use by the general public.