This dataset contains demographics and passenger information from 891 of the 2224 passengers and crew on board the Titanic. Dataset schema JSON Schema The following JSON object is a standardized description of your dataset's schema. Analyzing the Titanic Dataset in Dataiku. titanic is an R package containing data sets providing information on the fate of passengers on the fatal maiden voyage of the ocean liner "Titanic", summarized according to economic status (class), sex, age and survival. Purpose: To performa data analysis on a sample Titanic dataset. Image Source Data description The sinking of the RMS Titanic is one of the most infamous shipwrecks in history. Thanks to Kaggle and encyclopedia-titanica for the dataset. We checked the data types of the columns in Titanic dataset. Kaggle titanic dataset : https: ... To work on the data, you can either load the CSV in excel software or in pandas. List of Titanic Passengers. Importing the dataset in Dataiku is pretty easy: a single drag-and-drop of the file is required, and from there, Dataiku automatically guesses the charset and other parameters of the file (comma separated, etc. df = pd.read_csv('train.csv') In this problem you will use real data from the Titanic to calculate conditional probabilities and … A wealth of curated data sets, available in different formats (inluding CVS suitable for Excel), including "number of Prussian cavalry soldiers killed by horse kicks (1875 to 1894)", "Global-mean monthly, seasonal, and annual temperatures since 1880", and many more . Importing the Dataset. On April 15, 1912, during her maiden voyage, the Titanic sank after colliding with an iceberg, killing 1502 out of 2224 passengers and crew. One example is an analysis of the famous Titanic data set that was the subject of a Kaggle data science competition.