Skip to content

Commit 8a86542

Browse files
committed
Fix various typos
1 parent 0a172c3 commit 8a86542

1 file changed

Lines changed: 5 additions & 15 deletions

File tree

content/titanic/CaseStudy_Titanic-Solution.ipynb

Lines changed: 5 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -70,7 +70,7 @@
7070
"| --- | --- | --- |\n",
7171
"| survival | Survival | 0 = No, 1 = Yes |\n",
7272
"| pclass | Ticket class\t| 1 = 1st, 2 = 2nd, 3 = 3rd |\n",
73-
"| sex | Sex | male/femail |\t\n",
73+
"| sex | Sex | male/female |\t\n",
7474
"| Age | Age | in years |\n",
7575
"| sibsp | # of siblings / spouses aboard the Titanic | |\n",
7676
"| parch | # of parents / children aboard the Titanic | |\n",
@@ -104,7 +104,7 @@
104104
"source": [
105105
"### Load Data\n",
106106
"\n",
107-
"This dataset is in titanic.csv. Make sure the file is in current folder. Please download the file from [here](https://github.com/data-lessons/python-business/tree/gh-pages/data) if you haven't done so yet."
107+
"This dataset is in titanic.csv. Make sure the file is in current folder."
108108
]
109109
},
110110
{
@@ -934,7 +934,7 @@
934934
"cell_type": "markdown",
935935
"metadata": {},
936936
"source": [
937-
"##### Task7: Plot Perished vs. Survived Bar for Male and Femail\n",
937+
"##### Task7: Plot Perished vs. Survived Bar for Male and Female\n",
938938
"We will use seaborn countplot() again, but set argument `hue` to 'Survived'."
939939
]
940940
},
@@ -1727,7 +1727,7 @@
17271727
"### Feature Engineering\n",
17281728
"We'll create a new column FamilySize. There are 2 columns related to family size, parch indicates parent or children number, Sibsp indicates sibling and spouse number.\n",
17291729
"\n",
1730-
"Take one name 'Asplund' as example, we can see that total family size is 7(Parch + SibSp + 1), and each family member has same Fare, which means the Fare is for the whole group. So family size will be an important feature to predict Fare. There're only 4 Asplunds out of 7 in the dataset becasue the dataset is only a subset of all passengers."
1730+
"Take one name 'Asplund' as example, we can see that total family size is 7 (Parch + SibSp + 1), and each family member has same Fare, which means the Fare is for the whole group. So family size will be an important feature to predict Fare. There're only 4 Asplunds out of 7 in the dataset becasue the dataset is only a subset of all passengers."
17311731
]
17321732
},
17331733
{
@@ -2054,17 +2054,7 @@
20542054
"\n",
20552055
"## Step 4: Modeling\n",
20562056
"\n",
2057-
"Now we have a relatively clean dataset(Except for Cabin column which has many missing values). We can do a classification on Survived to predict whether a passenger could survive the desaster or a regression on Fare to predict ticket fare. This dataset is not a good dataset for regression. But since we don't talk about classification in this workshop we will construct a linear regression on Fare in this exercise."
2058-
]
2059-
},
2060-
{
2061-
"cell_type": "markdown",
2062-
"metadata": {},
2063-
"source": [
2064-
"##### Task16: Contruct a regresson on Fare\n",
2065-
"Construct regression model with statsmodels.\n",
2066-
"\n",
2067-
"Pick Pclass, Embarked, FamilySize as independent variables."
2057+
"Now we have a relatively clean dataset (except for the **Cabin** column which has many missing values). We can do a classification on Survived to predict whether a passenger could survive the disaster or a regression on Fare to predict ticket fare. This dataset is not a good dataset for regression. But since we don't talk about classification in this workshop we will construct a linear regression on Fare in this exercise."
20682058
]
20692059
},
20702060
{

0 commit comments

Comments
 (0)