The concept of getting the best data is not new. During the Napoleonic wars, a group of European bankers set up a Europe-wide network of messengers and carrier pigeon stations to gather battleground information as quickly as possible.
Since then, investors have found other ways to obtain information to gain an advantage. Traditional financial data typically consists of financial statements and other company disclosures, combined with brokerage information. Most come from a few centralized sources, led by the company itself, sell-side analysts, and stock exchanges.
But since the mass adoption of the internet and smartphones in the 21st century, alternative data is now more popular. This can come from various sources, such as satellite images, credit card spending, and even web blog comments. Research shows that there are about 180 data marketplaces, 2,000 data providers, and more than 200,000 datasets.1 Average spending to acquire this data has also risen, as shown in the charts below:
Source: Robeco, AlternativeData.org, and Azcoitia, Iordanou, and Laoutaris (2021)
The four Vs
Compared to traditional financial data, alternative data sources are characterized by the four Vs:
Volume: The amount of information from alternative financial data sources dwarfs that from standard financial data
Velocity: Alternative financial data creation and dissemination is much faster
Variety: An endless variety of sources, such as figures, text, images, or even audio/video files that are much richer
Veracity: It can be crowdsourced and disseminated without rigorous controls and may be false or misleading.
So, the alternative data marketplace is a rich ecosystem with pros and cons. Therefore, investors who use alternative data need the skills to extract valuable insights and avoid misleading information.
Applying the data
Alternative data also allows users to gain a different insight or perspective than investors who only use traditional data. A good example can be seen in the blogs that Chinese retail investors use to gain insights into the A-shares market.2 Investors who read these blogs have an informational advantage in short-term investor flows compared to those who wait for the publication of the following quarterly report.
And alternative data can be essential when it comes to sustainable investing. Traditional data on issues such as carbon emissions or gender diversity comes from corporations and tends to be backward-looking. Understanding how a company plans to align itself with the energy transition requires a more forward-looking approach. And looking at executives’ biographical data and employees’ profiles on networking sites allows analysts to get a better sense of a company’s actual levels of diversity.
Alternative data in investment processes
So, how do institutional investors use alternative data in their investment processes? This can, broadly speaking, be split into quantitative or fundamental approaches. Quant investors were the early adopters and had been using alternative data on a large scale. Their investment process involves establishing an investment hypothesis, which is tested through statistical and mathematical analysis of financial data – the classical scientific approach.
The wide availability of alternative data broadens the types of investment hypotheses that quant investors can investigate. For example, suppose we want to examine if better employee morale leads to a company’s long-term outperformance. This question is impossible to answer if you only use financial statements or stock price data. However, using information from websites like Glassdoor means researchers can examine whether companies with high employee sentiment will outperform those with lower morale.3
Conversely, fundamental investors have mostly not adopted alternative data since investment decisions are based on the judgment of individual managers. Instead, they examine financial statements, speak to company management and observe the popularity of various products and services. Thus, much of the intangible information encapsulated by alternative data is observed directly via human activities.
Frequently asked questions
There remain frequently asked questions, the most common of which is where can investors source alternative data? The most straightforward approach is to use alternative data aggregators and brokers, such as Neudata or Eagle Alpha. Another popular channel is service providers such as Bloomberg and FactSet, who now also provide alternative data alongside traditional financial data.
Another popular question is how much does it cost? Not as much as one might imagine. In the early 2010s, alternative data vendors often asked for upwards of a million dollars, even for datasets with a low breadth and non-granular information. Now that it has become more mainstream, the cost has come down to below USD 100,000, and the median dataset price is currently around USD 17,000 per year.4
So, what is required to process it? Many investors realize that the bottleneck is not finding alternative data, but rather having the technical skills and infrastructure needed to onboard and process it all. Whereas traditional financial data can be stored in spreadsheets and processed with simple statistical tools, alternative data requires more sophisticated tools such as machine learning.
Finally, which dataset should be chosen? Although investors can often trial alternative data for free, the time required to properly investigate if it actually adds value is still significant. From our experience, experienced alternative data users typically only onboard five to ten new datasets per year. Part of the reason for this low quantity of data onboard is the time requirement.
You still need fund management
Ultimately, the investors create real value in the investment process by transforming the data of whatever kind into actionable portfolio decisions, irrespective of whether they are part of a quantitative or fundamental investment style.
Based on alternative data, we can, though, ask more exciting research questions and create more value through enhanced performance and investment solutions that are better aligned with client goals.
This article is an excerpt of a special topic in our five-year outlook.
1Azcoitia, S., Iordanou, C., and Laoutaris, N. (2021). “What is the price of data? A measurement study of commercial data marketplaces”, ArXiv working paper 2111.04427.
2Chen, M., Lee, J., and Mussalli, G. (2020). “Teaching machines to understand Chinese investment slang”, Journal of Financial Data Science 2(1), 116-125.
3Filbeck, G., and Zhao, X. (2022). “Glassdoor best places to work: how do they work for shareholders?”, Studies in Economics and Finance (forthcoming).
4Neudata (2022). “What is the Price of Data?”, Neudata Literature Review
当資料は情報提供を目的として、Robeco Institutional Asset Management B.V.が作成した英文資料、もしくはその英文資料をロベコ・ジャパン株式会社が翻訳したものです。資料中の個別の金融商品の売買の勧誘や推奨等を目的とするものではありません。記載された情報は十分信頼できるものであると考えておりますが、その正確性、完全性を保証するものではありません。意見や見通しはあくまで作成日における弊社の判断に基づくものであり、今後予告なしに変更されることがあります。運用状況、市場動向、意見等は、過去の一時点あるいは過去の一定期間についてのものであり、過去の実績は将来の運用成果を保証または示唆するものではありません。また、記載された投資方針・戦略等は全ての投資家の皆様に適合するとは限りません。当資料は法律、税務、会計面での助言の提供を意図するものではありません。 ご契約に際しては、必要に応じ専門家にご相談の上、最終的なご判断はお客様ご自身でなさるようお願い致します。 運用を行う資産の評価額は、組入有価証券等の価格、金融市場の相場や金利等の変動、及び組入有価証券の発行体の財務状況による信用力等の影響を受けて変動します。また、外貨建資産に投資する場合は為替変動の影響も受けます。運用によって生じた損益は、全て投資家の皆様に帰属します。したがって投資元本や一定の運用成果が保証されているものではなく、投資元本を上回る損失を被ることがあります。弊社が行う金融商品取引業に係る手数料または報酬は、締結される契約の種類や契約資産額により異なるため、当資料において記載せず別途ご提示させて頂く場合があります。具体的な手数料または報酬の金額・計算方法につきましては弊社担当者へお問合せください。 当資料及び記載されている情報、商品に関する権利は弊社に帰属します。したがって、弊社の書面による同意なくしてその全部もしくは一部を複製またはその他の方法で配布することはご遠慮ください。 商号等： ロベコ・ジャパン株式会社 金融商品取引業者 関東財務局長（金商）第２７８０号 加入協会： 一般社団法人 日本投資顧問業協会