logo

Titanic Machine Learning from Disaster Analysis

   

Added on  2019-09-22

4 Pages506 Words325 Views
 | 
 | 
 | 
Data:Title:Titanic: Machine Learning from DisasterLink:https://www.kaggle.com/c/titanic/dataOverview:The data has been split into two groups:training set (train.csv)test set (test.csv)The training setwas used to build your machine learning models. The test setshould be used to see how well your model performs on unseen data. For the test set, we do not provide the ground truth for each passenger. It is your job to predict these outcomes. For each passenger in the test set, use the model you trained to predict whether or not they survived the sinking of the Titanic.Data DictionaryVariableDefinitionKeysurvivalSurvival0 = No, 1 = YesPclassTicket class1 = 1st, 2 = 2nd, 3 = 3rdSexSexAgeAge in yearsSibsp# of siblings / spouses aboard the TitanicParch# of parents / children aboard the TitanicTicketTicket numberFarePassenger fare
Titanic Machine Learning from Disaster Analysis_1

CabinCabin numberembarkedPort of EmbarkationC = Cherbourg, Q = Queenstown, S = SouthamptonVariable Notespclass: A proxy for socio-economic status (SES)1st = Upper2nd = Middle3rd = Lowerage: Age is fractional if less than 1. If the age is estimated, is it in the form of xx.5sibsp: The dataset defines family relations in this way...Sibling = brother, sister, stepbrother, stepsisterSpouse = husband, wife (mistresses and fiancés were ignored)parch: The dataset defines family relations in this way...Parent = mother, fatherChild = daughter, son, stepdaughter, stepsonSome children travelled only with a nanny, therefore parch=0 for them.Question 2:Please check the twbx file.Question 3:
Titanic Machine Learning from Disaster Analysis_2

End of preview

Want to access all the pages? Upload your documents or become a member.

Related Documents
Analysis of Titanic Datasets
|15
|3119
|313

Data Mining - Desklib
|20
|4415
|262