Back to all posts

Text Mining Corporate Filings by Yin Luo

Web scraping, distributed cloud computing, NLP, and machine learning techniques (in short, text mining) can be applied to systematically analyze corporate filings from the EDGAR database. Equipped with his own NLP algorithms, Yin Luo, Vice Chairman at Wolfe Research, studies a wide range of models based on corporate filing data: measuring the document tone or sentiment with finance oriented lexicons; investigating the changes in the language structure; computing the proportion of numeric versus textual information, and estimating the word complexity in corporate filings; and lastly, using machine learning algorithms to quantify the informative contents. His NLP-based stock selection signals have strong and consistent performance, with low turnover and slow decay, and is uncorrelated to traditional factors.

You can now watch Yin's full talk from QuantCon NYC 2017.

Text Mining Unstructured Corporate Filing Data

QuantCon 2018 Returns to NYC

Don't miss QuantCon NYC 2018 April 27th-28th! The event focuses on on algorithmic trading and portfolio optimization, and how data science, alternative data sets, and machine learning, can help you craft and improve on your trading strategies. Confirmed speakers and talks include:

  • Optimizing Trading Strategies without Overfitting by Dr. Ernest Chan, Managing Member at QTS Capital Management LLC.
  • Automation of Equity Markets, the Evolution of High Frequency Trading and the Applicability of Deep Learning by Robert Litzenberger, Professor Emeritus at Wharton, and Alexander Litzenberger, Carnegie Mellon Student
  • Return Predictability and Market-Timing: A One-Month Model by Petra Bakosova, Chief Operating Officer at Hull Investments
  • The Crucial Role of Custom/Alternative Data in Finding Alpha (Using Text-Mined ESG Data as an example) by Dr. Stephen Malinak, Chief Data and Analytics Officer at TruValue Labs

As part of the Quantopian community we would like to offer you a 10% discount on any ticket by using code QCommunityQuantCon2018 at checkout! RSVP here.



The material on this website is provided for informational purposes only and does not constitute an offer to sell, a solicitation to buy, or a recommendation or endorsement for any security or strategy, nor does it constitute an offer to provide investment advisory services by Quantopian.

In addition, the material offers no opinion with respect to the suitability of any security or specific investment. No information contained herein should be regarded as a suggestion to engage in or refrain from any investment-related course of action as none of Quantopian nor any of its affiliates is undertaking to provide investment advice, act as an adviser to any plan or entity subject to the Employee Retirement Income Security Act of 1974, as amended, individual retirement account or individual retirement annuity, or give advice in a fiduciary capacity with respect to the materials presented herein. If you are an individual retirement or other investor, contact your financial advisor or other fiduciary unrelated to Quantopian about whether any given investment idea, strategy, product or service described herein may be appropriate for your circumstances. All investments involve risk, including loss of principal. Quantopian makes no guarantees as to the accuracy or completeness of the views expressed in the website. The views are subject to change, and may have become unreliable for various reasons, including changes in market conditions or economic circumstances.