Need to perform Data mining using spread sheet

The sinking of the Titanic is one of the most infamous shipwrecks in history. On April 15,

1912, during her maiden voyage, the widely considered “unsinkable” RMS Titanic sank after

colliding with an iceberg. Unfortunately, there weren’t enough lifeboats for everyone

onboard, resulting in the death of 1502 out of 2224 passengers and crew. While there was

some element of luck involved in surviving, it seems some groups of people were more likely

to survive than others. We would want to build a predictive model to predict what people

are more likely to survive Titanic sinking? The data is grouped according to whether or not a

person survived (1=significant, 0=insignificant). Download the data from D2L, and the

following steps that will guide you how to build a data mining model:

A. For any dataset, we need to clean our data first before doing any data analysis. There

are 8 steps that we discussed in Topic 1 – Data Preparation needed to perform.

However, to simplify this step we will perform only data transformation and clean

our missing data:

i. There are 4 missing variables in our data set. Variable ‘Cabin’ missed 80%

values. Therefore, we won’t use it in our models. Replaced missing values of

variable ‘Embarked’ with the most common value, missing values of variable

‘Age’ and variable ‘Price’ with average values (0.5 point).

ii. Since variable ‘Sex’ and ‘Embarked’ are categorical variable, we will need to

transform them. Transform variable ‘Sex’ into dummy variable (value 0 and

1), and variable ‘Embarked’ into numeric variable (value 1, 2, and 3) (0.5


B. Next step, we will need to perform cross-validation by perform partitioning our data.

Use Analytic Solver’s standard data partition command to partition the data into a

training set (with 50% of the observations), validation set (with 30% of the

observations), and test set (with 20% of the observations) using the default seed of

12345. (1 point)

C. Perform discriminant analysis, logistic regression, k-nearest neighbor (with

normalized inputs), single classification tree (with normalized inputs and at least 4

observations per terminal node), and manual neural network (use normalized inputs

and a single hidden layer with 3 nodes) to create a classifier for this data. How

accurate is this procedure on the training, validation, and test data sets? (1 point).

Навыки: Интеллектуальный анализ данных, Анализ и обработка данных, Big Data Sales, Excel, Python

О клиенте:
( 0 отзыв(-а, -ов) ) Cedar Park, United States

ID проекта: #34322682

9 фрилансеров(-а) готовы выполнить эту работу в среднем за $19


MASTERS IN COMPUTER SCIENCE AND SOFTWARE ARCHITECT EXCEL EXPERT Hi there, I have carefully gone through your project description and I would like to help you with this. Let me know if you have any more info that may h Больше

$20 USD за 7 дней(-я)
(1 отзыв)

Hi. Thank you for your title and I feel I am ready for your project right now. I saw your title for a position as the developer for your project. I have experienced with 12+ years of website development using HTML, CSS Больше

$20 USD за 7 дней(-я)
(1 отзыв)

As I am working in shifts in my current job, I can give proper time to this project, Currently I am working as SAP Operator ,so I have work experience of 5 years.

$20 USD за 7 дней(-я)
(0 отзывов(-а))

am a hard-working and driven individual who isn't afraid to face a challenge. I'm passionate about my work and I know how to get the job done. I would describe myself as an open and honest person who doesn't believe in Больше

$20 USD за 7 дней(-я)
(0 отзывов(-а))

Hello, I will do your task in the next few hours. I will provide a sample. kindly award the project. Thank you

$20 USD за 7 дней(-я)
(0 отзывов(-а))

Dear Client My name is Salvo and I am Python&Excel Expert. I read your description carefully and I think it is just fixed job for me. I have 6 years of experience and finished many kinds of Python&Excel project. I have Больше

$20 USD за 7 дней(-я)
(0 отзывов(-а))

Hello, I am a research scholar working in the area of Machine Learning. I have read and understood your problem statement and I have the relevant knowledge and skills to implement the same. I have worked on numerous M Больше

$25 USD за 3 дней(-я)
(0 отзывов(-а))

I am looking for work as a freelancer. Doing graphic design, data entry, content writing, etc... You can trust me 100%….I will do the work you assign with great responsibility.

$20 USD за 7 дней(-я)
(0 отзывов(-а))

Hello! I am Ahmed, I saw your project and I can help you as a Data Entry to finish your project. I will be happy if you choose me to work with you. Thanks

$10 USD за 1 день
(0 отзывов(-а))