hongkongen
Quant modeling can use non-numerical data, too

Quant modeling can use non-numerical data, too

10-12-2021 | Insight

We find that text analysis can predict the risk and return characteristics of corporate bonds.

  • Patrick  Houweling
    Patrick
    Houweling
    Co-Head of Quant Fixed Income and Lead Portfolio Manager
  • Robbert-Jan 't Hoen
    Robbert-Jan
    't Hoen
    Researcher

Speed read

  • More than 80% of all corporate information is in an unstructured form
  • Literature finds that text in SEC filings predicts return and volatility of stocks
  • We show that results from the literature carry over to corporate bonds

It is estimated that over 80% of all business-relevant information is in an unstructured form, such as text, video, or audio.1 However, financial models traditionally only use numerical data, such as market prices and company accounting data. Hence, tapping into the large pool of unstructured data has the potential of enriching existing models. We investigated the opportunities that unstructured data present for investing in corporate bonds.2

Stay informed on our latest insights with monthly mail updates
Stay informed on our latest insights with monthly mail updates
Subscribe

Mining abundant sources of non-numerical data

Textual data is an important source of non-numerical data. Examples include news articles, social media posts, transcripts of management presentations and corporate reports. Until recently, usage of such text sources in analysis required human intervention to code attributes into numerical form – a slow and tedious process. Nowadays, due to advances in natural language processing (NLP) and the immense growth in computing power, text mining techniques can be used to systematically analyze vast amounts of text data.

Academics as well as practitioners have started analyzing text data for the purpose of predicting the risks and returns of stocks and bonds. One strand of research investigates the information content of corporate reports filed by publicly listed companies in the US with the Securities and Exchange Commission (SEC). Of these SEC filings, most attention is directed towards the annual (Form 10-K) and quarterly (Form 10-Q) reports. The reports are very extensive, owing to laws and regulations that prohibit companies from making materially false or misleading statements, and from omitting material information that would render disclosures misleading. Along with the numerical data from the financial statements, these filings contain large volumes of unstructured textual information.

The information in 10-Ks and 10-Qs should enable any investor to fully understand the state of a company. In practice, however, valuable information in these reports is easily overlooked, because of the daunting challenge of reading and grasping many pages of formal and often very technical text.3 These reports therefore provide an attractive avenue of research for the application of computer-based text analysis.

Data collection and pre-processing

We obtain all 10-Ks and 10-Qs of publicly listed US issuers of corporate bonds in the Bloomberg US Corporate Investment Grade and High Yield ex. Financials indices. The sample covers the period from 1994 to 2017 and contains a total of 212,400 filings, of which 57,952 are 10-Ks and 154,448 10-Qs.

Figure 1 | Filing size

Source: Robeco, EDGAR. Sample period 1994-2017.

To facilitate later analyses, we first clean each document so that only the text, numbers and symbols in the main body of the original filing remain. Figure 1 shows the average size of the cleaned files over time, as measured by the total number of characters. As expected, we find that 10-Ks are, on average, significantly larger than 10-Qs. Moreover, there is a strong upward trend in the size of 10-Ks and 10-Qs. This is driven largely by the gradual increase over time in required disclosures.

Text analysis

The next step in the research is to process the cleaned text data so that it becomes understandable to a computer. A commonly used method to convert text into a numerical format is the Bag-of-Words (BoW) model. BoW is an NLP technique that reduces the complexity of text data by removing information about word order and context. All that remains of each filing is a list of term frequencies, i.e., the number of times each unique word appears. The idea behind the model is that the more frequently a term is used, the more important it is.4

Changers and non-changers

A recently published academic article documents that the similarity of a company’s consecutive 10-Ks and 10-Qs is a significant predictor of stock return and stock return volatility: companies that make more changes to the text of their report compared to their previous report (which the article labels as ‘changers’) underperform companies with fewer changes (labeled as ‘non-changers’) by a wide margin.5 The rationale for this finding is that firms tend to repeat what they reported previously and that they are only required to change the text if there are material changes to the company or to its circumstances over the reporting period. Changes in the text are thus interpreted as being negative. Although extensive text changes are not necessarily a bad sign, analysis does show that these are mostly related to negative events and negative future stock returns.

In our research, we test if a similar effect exists for corporate bonds. If the degree of similarity between consecutive 10-Ks and 10-Qs is truly linked to firm performance, then we expect to see this reflected in corporate bond returns as well. To gauge the similarity between reports, we compare the text in a report with that of the same report published a year previously, i.e., a 10-K is compared with previous year’s 10-K, and a 10-Q with a 10-Q of the same quarter in the previous year.

We evaluate the performance of changers versus non-changers on our sample of US investment grade and high yield issuers over the 1997-2017 period. Our hypothetical investment strategy for this research goes long in the bonds of the companies whose reports showed the fewest changes, and goes short in the bonds of the firms with the most changes.

We find that, in investment grade as well as in high yield, non-changers have outperformed changers by over 50bps per year and have been less risky than changers, resulting in higher Sharpe ratios for non-changers. Overall, we find that the degree of similarity between consecutive reports has predictive power for corporate bond risk and return, with stronger statistical significance in investment grade than in high yield.

1 http://breakthroughanalysis.com/2008/08/01/unstructured-data-and-the-80-percent-rule/
2 This insight is based on an extract from the paper “Continuous innovation in factor credit strategies”, April 2021, by Patrick Houweling, Frederik Muskens and Robbert-Jan ‘t Hoen.
3 Loughran & McDonald, 2014, “Measuring readability in financial disclosures”, The Journal of Finance, 69(4), 1643-1671.
4 We filter out uninformative words using the popular stop word list of Loughran and McDonald: https://sraf.nd.edu/textual-analysis/resources/#StopWords
5 Cohen, Malloy & Nguyen, 2020, “Lazy prices”, The Journal of Finance, 75(3), 1371-1415.

Important information

The contents of this document have not been reviewed by the Securities and Futures Commission ("SFC") in Hong Kong. If you are in any doubt about any of the contents of this document, you should obtain independent professional advice. This document has been distributed by Robeco Hong Kong Limited (‘Robeco’). Robeco is regulated by the SFC in Hong Kong.
This document has been prepared on a confidential basis solely for the recipient and is for information purposes only. Any reproduction or distribution of this documentation, in whole or in part, or the disclosure of its contents, without the prior written consent of Robeco, is prohibited. By accepting this documentation, the recipient agrees to the foregoing
This document is intended to provide the reader with information on Robeco’s specific capabilities, but does not constitute a recommendation to buy or sell certain securities or investment products. Investment decisions should only be based on the relevant prospectus and on thorough financial, fiscal and legal advice. Please refer to the relevant offering documents for details including the risk factors before making any investment decisions.
The contents of this document are based upon sources of information believed to be reliable. This document is not intended for distribution to or use by any person or entity in any jurisdiction or country where such distribution or use would be contrary to local law or regulation.
Investment Involves risks. Historical returns are provided for illustrative purposes only and do not necessarily reflect Robeco’s expectations for the future. The value of your investments may fluctuate. Past performance is no indication of current or future performance.

Logo

Disclaimers

1. General
Please read this information carefully.

This website is prepared and issued by Robeco Hong Kong Limited ("Robeco"), which is a corporation licensed by the Securities and Futures Commission in Hong Kong to engage in Type 1 (dealing in securities); Type 4 (advising in securities) and Type 9 (asset management) regulated activities. The Company does not hold client assets and is subject to the licensing condition that it shall seek the SFC’s prior approval before extending services at retail level. This website has not been reviewed by the Securities and Futures Commission or any regulatory authority in Hong Kong.

2. Important risk disclosures
2. Important risk disclosures Robeco Capital Growth Funds (“the Funds”) are distinguished by their respective specific investment policies or any other specific features. Please read carefully for the risks of the Funds:

  • Some Funds are subject to investment, market, equities, liquidity, counterparty, securities lending and foreign currency risk and risk associated with investments in small and/or mid-capped companies.
  • Some Funds are subject to the risks of investing in emerging markets which include political, economic, legal, regulatory, market, settlement, execution, counterparty and currency risks.
  • Some Funds may invest in China A shares directly through the Qualified Foreign Institutional Investor (“QFII”) scheme and / or RMB Qualified Foreign Institutional Investor (“RQFII”) scheme and / or Stock Connect programmes which may entail additional clearing and settlement, regulatory, operational, counterparty and liquidity risk.
  • For distributing share classes, some Funds may pay out dividend distributions out of capital. Where distributions are paid out of capital, this amounts to a return or withdrawal of part of your original investment or capital gains attributable to that and may result in an immediate decrease in the net asset value of shares.
  • Some Funds’ investments maybe concentrated in one region / one country / one sector / around one theme and therefore the value of the Fund may be more volatile and may be subject to concentration risk.
  • The risk exists that the quantitative techniques used by some Funds may not work and the Funds’ value may be adversely affected.
  • In addition to investment, market, liquidity, counterparty, securities lending, (reverse) repurchase agreements and foreign currency risk, some Funds are subject to risk associated with fixed income investments like credit risk, interest rate risk, convertible bonds risk, ABS risk and the risk of investments in non-investment grade or unrated securities and the risk of investments made in non-investment grade sovereign securities.
  • Some Funds can use derivatives extensively. Robeco Global Consumer Trends Equities can use derivatives for hedging and efficient portfolio management. Derivatives exposure may involve higher counterparty, liquidity and valuation risks. In adverse situations, the Funds may suffer significant losses (even a total loss of the Funds’ assets) from its derivative usage.
  • Robeco European High Yield Bonds is subject to Eurozone risk.
  • Investors may suffer substantial losses of their investments in the Funds. Investor should not invest in the Funds solely based on the information provided in this document and should read the offering documents (including potential risks involved) for details.

3. Local legal and sales restrictions
The Website is to be accessed by “professional investors” only (as defined in the Securities and Futures Ordinance (Cap.571) and/or the Securities and Futures (Professional Investors) Rules (Cap.571D) under the laws of Hong Kong). The Website is not directed at any person in any jurisdiction where (by reason of that person’s nationality, residence or otherwise) the publication or availability of the Website is prohibited. Persons in respect of whom such prohibitions apply or persons other than those specified above must not access this Website. Persons accessing the Website need to be aware that they are responsible themselves for the compliance with all local rules and regulations. By accessing this Website and any of its pages, you acknowledge your agreement with understanding of the following terms of use and legal information. If you do not agree to the terms and conditions below, do not access this Website or any pages thereof.

The information contained in the Website is being provided for information purposes.

Neither information nor any opinion expressed on the Website constitutes a solicitation, an offer or a recommendation to buy, sell or dispose of any investment, to engage in any other transaction or to provide any investment advice or service. The information contained in the Website does not constitute investment advice or a recommendation and was prepared without regard to the specific objectives, financial situation or needs of any particular person who may receive it. An investment in a Robeco product should only be made after reading the related legal documents such as management regulations, prospectuses, most recent annual and semi-annual reports, which can be all be obtained free of charge at www.robeco.com/hk/en and at the Robeco Hong Kong office.

4. Use of the Website
The information is based on certain assumptions, information and conditions applicable at a certain time and may be subject to change at any time without notice. Robeco aims to provide accurate, complete and up-to-date information, obtained from sources of information believed to be reliable. Persons accessing the Website are responsible for their choice and use of the information.

5. Investment performance
No assurance can be given that the investment objective of any investment products will be achieved. No representation or promise as to the performance of any investment products or the return on an investment is made. The value of your investments may fluctuate. The value of the assets of Robeco investment products may also fluctuate as a result of the investment policy and/or the developments on the financial markets. Results obtained in the past are no guarantee for the future. Past performance, projection, or forecast included in this Website should not be taken as an indication or guarantee of future performance, and no representation or warranty, express or implied, is made regarding future performance. Fund performance figures are based on the month-end trading prices and are calculated on a total return basis with dividends reinvested. Return figures versus the benchmark show the investment management result before management and/or performance fees; the fund returns are with dividends reinvested and based on net asset values with prices and exchange rates of the valuation moment of the benchmark.
Investments involve risks. Past performance is not a guide to future performance. Potential investors should read the terms and conditions contained in the relevant offering documents and in particular the investment policies and the risk factors before any investment decision is made. Investors should ensure they fully understand the risks associated with the fund and should also consider their own investment objective and risk tolerance level. Investors are reminded that the value and income (if any) from shares of the fund may be volatile and could change substantially within a short period of time, and investors may not get back the amount they have invested in the fund. If in doubt, please seek independent financial and professional advice.

6. Third party websites
This website includes material from third parties or links to websites maintained by third parties some of which is supplied by companies that are not affiliated to Robeco. Following links to any other off-site pages or websites of third parties shall be at the own risk of the person following such link. Robeco has not reviewed any of the websites linked to or referred to by the Website and does not endorse or accept any responsibility for their content nor the products, services or other items offered through them. Robeco shall have no liability for any losses or damages arising from the use of or reliance on the information contained on websites of third parties, including, without limitation, any loss of profit or any other direct or indirect damage. Third party off-site pages or websites are provided for informational purposes only.

7. Limitation of liability
Robeco as well as (possible) other suppliers of information to the Website accept no responsibility for the contents of the Website or the information or recommendations contained herein, which moreover may be changed without notice.
Robeco assumes no responsibility for ensuring, and makes no warranty, that the functioning of the Website will be uninterrupted or error-free. Robeco assumes no responsibility for the consequences of e-mail messages regarding a Robeco (transaction) service, which either cannot be received or sent, are damaged, received or sent incorrectly, or not received or sent on time.
Neither will Robeco be liable for any loss or damage that may result from access to and use of the Website.

8. Intellectual property
All copyrights, patents, intellectual and other property, and licenses regarding the information on the Website are held and obtained by Robeco. These rights will not be passed to persons accessing this information.

9. Privacy
Robeco guarantees that the data of persons accessing the Website will be treated confidentially in accordance with prevailing data protection regulations. Such data will not be made available to third parties without the approval of the persons accessing the Website, unless Robeco is legally obliged to do so. Please find more details in our Privacy and Cookie Policy.

10. Applicable law
The Website shall be governed by and construed in accordance with the laws of Hong Kong. All disputes arising out of or in connection with the Website shall be submitted to the exclusive jurisdiction of the courts of Hong Kong. 

Please click the “I agree” button if you have read and understood this page and agree to the Disclaimers above and the collection and use of your personal data by Robeco, for the purposes for which such data is collected and used as set out in the Privacy and Cookie Policy, including for the purpose of direct marketing of Robeco products or services. Otherwise, please click “I Disagree” to leave the website.

I Disagree