By 2022, the cumulative data volume had soared to approximately 97 zettabytes1. To put this into perspective, one zettabyte amounts to 10^21 bytes, equivalent to roughly one trillion books' worth of data. However, even this comparison fails to fully showcase the sheer size of stored data. 

The magnitude of this size requires efficient storage, management, and implementation to get the best available information in the shortest time possible. In recent times, storage needs have been addressed by cloud technology, which uses remote servers eliminating local drive/datacentre storage.

In terms of data management, structured data can be stored and retrieved in tables. Unstructured data like photos, emails, and documents which are growing exponentially require an additional layer of data called ‘metadata’ which provides an ID for such datasets. At an aggregate level, even metadata may fail to cover all the unstructured data which is now being tackled by artificial intelligence and machine learning models. These algorithms are designed to study and train on the vast unstructured data and find out patterns of matching data. AI and ML algorithms continuously improve their efficiency through feedback mechanisms. As they process more data and receive feedback on their performance, they adapt and refine their algorithms to achieve higher accuracy and effectiveness in identifying patterns and making predictions.

Very recently, AI has also been utilized for predictive analysis, where historical data is analyzed to forecast future trends or outcomes. By analyzing patterns and correlations within the data, AI algorithms can make predictions about future events or behaviors, enabling organizations to make informed decisions and take proactive measures.

BlueStar® Big Data & AI Index (BDAI), which tracks the performance of companies that are involved in the Big Data and Artificial Intelligence industries was up ~77% in YoY.


For more information on our family of indexes, visit


About the Author:

Kinjal Shukla is the Index Researcher and Data Engineer at MarketVector. She is responsible for the design and development of MarketVector indexes and for creating data infographics. Kinjal has cleared FRM Level 1 having come from a risk profile in Barclays, India. She has a degree in a Master of Science in Financial Engineering from Stevens Institute of Technology, USA, and an MBA-Tech degree from NMIMS-MPSTME University, India.


For informational and advertising purposes only. The views and opinions expressed are those of the authors but not necessarily those of MarketVector Indexes GmbH. Opinions are current as of the publication date and are subject to change with market conditions. Certain statements contained herein may constitute projections, forecasts, and other forward-looking statements, that do not reflect actual results. It is not possible to invest directly in an index. Exposure to an asset class represented by an index is available through investable instruments based on that index. MarketVector Indexes GmbH does not sponsor, endorse, sell, promote, or manage any investment fund or other investment vehicle that is offered by third parties and that seeks to provide an investment return based on the performance of any index. The inclusion of a security within an index is not a recommendation by MarketVector Indexes GmbH to buy, sell, or hold such security, nor is it considered to be investment advice.