You are currently viewing 10 Best Datasets for Finance [2023]

10 Best Datasets for Finance [2023]

The finance industry relies heavily on data analysis to make informed investment decisions and manage risk. With the rapid advancement of technology, the availability of financial data has increased exponentially. In this article, we will be exploring the 10 best datasets for finance in 2023.

Dataset NameSizeDownload LinkDescription
S&P 500 Stock Prices2,807 recordshttps://www.kaggle.com/dgawlik/nyse#prices.csvThis dataset includes information on S&P 500 stock prices from 2010 to 2020.
Global Financial Crisis3,656 recordshttps://www.kaggle.com/wesseb/global-financial-crisis-2008-to-2009This dataset includes information on the global financial crisis of 2008 and 2009, including stock prices and macroeconomic indicators.
Housing Prices14,60 recordshttps://www.kaggle.com/c/home-data-for-ml-courseThis dataset includes information on housing prices in King County, Washington from 2014-2015.
Crypto Market8,000 recordshttps://www.kaggle.com/sudalairajkumar/cryptocurrencypricehistoryThis dataset includes information on cryptocurrency prices from 2013 to 2018.
Bank Marketing45,211 recordshttps://archive.ics.uci.edu/ml/datasets/bank+marketingThis dataset includes information on a bank marketing campaign, including customer demographics and response to marketing efforts.
Stock Market1,600 recordshttps://www.kaggle.com/szrlee/stock-time-series-20050101-to-20171231This dataset includes information on stock market prices and volume from 2005 to 2017.
Financial Distress1,167 recordshttps://www.kaggle.com/shebrahimi/financial-distressThis dataset includes information on financial distress in US companies from 1996 to 2016.
Credit Card Fraud Detection284,807 recordshttps://www.kaggle.com/mlg-ulb/creditcardfraudThis dataset includes information on credit card transactions, with a high percentage of fraudulent transactions.
Santander Customer Transaction Prediction200,000 recordshttps://www.kaggle.com/c/santander-customer-transaction-predictionThis dataset includes information on customer transactions, with a minority of positive classifications (i.e. customer will make a transaction).
Loan Default887 recordshttps://www.kaggle.com/kashnitsky/topic-4-linear-models-and-sgdr-practice-timeThis dataset includes information on loan defaults, with a high imbalance between positive and negative classifications (i.e. loans that defaulted and loans that did not).

Leave a Reply