Prepare for your exams
Get points
Guidelines and tips

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Log in Sign up

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search Store documents

The best documents sold by students who completed their studies

Search through all study resources

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

University Rankings

Discover the best universities in your country according to Docsity users

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

From our blog

Exams and Study

Go to the blog

Machine Learning Assignment – M.Tech | PCA, K-Means, Logistic Regression (Questions), Assignments of Machine Learning

Birla Institute of Technology and Science Machine Learning

This is a Questions only from Machine Learning assignment for M.Tech Computer Science, ideal for students from Amity, BITS, or other work-integrated programs. The assignment covers core machine learning concepts with clear explanations, step-by-step calculations, and practical application of algorithms. Eigenvalues & Eigenvectors in PCA Application in dimensionality reduction Interpretation and mathematical calculation Evaluation Metrics for Predictive Models Precision, Recall, Accuracy Campaign case study analysis with detailed breakdown K-Means Clustering Manual clustering of 2D data points Cluster assignment using initial centers Principal Component Analysis (PCA) Finding the first principal component Eigenvector interpretation for a 2D dataset Logistic Regression Probability calculation using given β₀ and β₁ Real-world use case: Exam performance prediction

Typology: Assignments

2024/2025

Available from 06/28/2025

rishav-raj-singh-1 🇮🇳

3 documents

1 / 2

This page cannot be seen from the preview

Don't miss anything!

Assignment (20 Marks)

Q.1 Marks 4

In the context of machine learning, eigenvalues and eigenvectors are crucial concepts,

particularly in methods like Principal Component Analysis (PCA), Singular Value

Decomposition (SVD), and various linear transformations in feature space. These

mathematical tools help in understanding and reducing dimensionality, improving algorithm

performance, and extracting meaningful patterns from the data.

Consider a scenario where a data scientist is working on a large dataset to predict customer

churn. To simplify the predictive modeling, the scientist wants to reduce the dataset's

dimensionality without losing significant information. This is often achieved through PCA,

which requires computation of eigenvalues and eigenvectors.

To grasp the underlying concepts, the scientist starts with a simpler problem — computing the

eigenvalues of a smaller matrix. This matrix could represent a simplified transformation in a

reduced feature space. Here is the matrix in question:

A=

[

1−1 0

−1 2 1

0 1 1

]

a) Calculate Eigenvalues: The objective is to compute eigenvalues of matrix A. This will provide

insights into the variance explained by the components when matrix A is used in PCA.

b) Interpret Results: Understanding the eigenvalues helps in determining the importance of

each principal component. What is the relation between the sum of the Eigenvalues with

matrix A? What is the relation between product of the Eigenvalues with matrix A?

Q.2 Marks 4

As a senior data analyst at XYZ Corp, a leader in retail, you've been tasked with evaluating the

efficacy of a recently concluded, multifaceted campaign. This campaign, aimed at boosting the

sales of a high-profile product, spanned multiple channels including digital (emails and social

media), traditional media (TV and print), and various experimental platforms.

The primary objective is to assess the performance of a predictive model designed to identify

potential buyers among those exposed to the campaign. This assessment is crucial, as its

outcomes will guide the strategic planning for upcoming marketing efforts.

The dataset collected is extensive, including:

Engagement metrics such as number of ad impressions, clicks, and time spent on

product pages.

Viewer demographics including age, location, and previous purchase history.

Follow-up actions, specifically whether viewers purchased the product post-ad

exposure.

Additional engagement metrics unrelated to the campaign, like overall website traffic

and user interaction across non-campaign content.

Extended Data Description:

Partial preview of the text

Download Machine Learning Assignment – M.Tech | PCA, K-Means, Logistic Regression (Questions) and more Assignments Machine Learning in PDF only on Docsity!

Assignment (20 Marks)

Q.1 Marks 4

In the context of machine learning, eigenvalues and eigenvectors are crucial concepts, particularly in methods like Principal Component Analysis (PCA), Singular Value Decomposition (SVD), and various linear transformations in feature space. These mathematical tools help in understanding and reducing dimensionality, improving algorithm performance, and extracting meaningful patterns from the data. Consider a scenario where a data scientist is working on a large dataset to predict customer churn. To simplify the predictive modeling, the scientist wants to reduce the dataset's dimensionality without losing significant information. This is often achieved through PCA, which requires computation of eigenvalues and eigenvectors. To grasp the underlying concepts, the scientist starts with a simpler problem — computing the eigenvalues of a smaller matrix. This matrix could represent a simplified transformation in a reduced feature space. Here is the matrix in question:

A =

[

0 1 1 ]

a) Calculate Eigenvalues : The objective is to compute eigenvalues of matrix A. This will provide insights into the variance explained by the components when matrix A is used in PCA. b) Interpret Results : Understanding the eigenvalues helps in determining the importance of each principal component. What is the relation between the sum of the Eigenvalues with matrix A? What is the relation between product of the Eigenvalues with matrix A?

Q.2 Marks 4

As a senior data analyst at XYZ Corp, a leader in retail, you've been tasked with evaluating the efficacy of a recently concluded, multifaceted campaign. This campaign, aimed at boosting the sales of a high-profile product, spanned multiple channels including digital (emails and social media), traditional media (TV and print), and various experimental platforms. The primary objective is to assess the performance of a predictive model designed to identify potential buyers among those exposed to the campaign. This assessment is crucial, as its outcomes will guide the strategic planning for upcoming marketing efforts. The dataset collected is extensive, including:  Engagement metrics such as number of ad impressions, clicks, and time spent on product pages.  Viewer demographics including age, location, and previous purchase history.  Follow-up actions, specifically whether viewers purchased the product post-ad exposure.  Additional engagement metrics unrelated to the campaign, like overall website traffic and user interaction across non-campaign content. Extended Data Description:

 From the campaign, 1,000 viewers who clicked on the ads were randomly selected for detailed tracking.  The predictive model used in the campaign anticipated that 350 of these tracked viewers would purchase the product.  Post-campaign analysis confirmed that 200 of these viewers actually purchased the product, suggesting a need to evaluate the predictive accuracy of the model.  The model's specific predictions indicated that out of the 350 viewers it identified as likely purchasers, only 200 completed a purchase. This indicates potential overfitting or misalignment with actual consumer behavior.  Furthermore, the model identified 500 viewers who would not purchase the product, and this was accurately confirmed post-campaign.  Ancillary data noted include increased general web traffic from non-targeted geographic areas during the campaign period, possibly indicating broader market penetration or incidental traffic increases.  Other observed metrics included a 10-second average increase in site visit duration compared to the previous period, a 7% increase in social media followers, and a 5% improvement in brand recognition according to post-campaign surveys. From the complex array of data provided, task is to calculate Precision, Recall, and Accuracy of marketing campaign’s predictive model. Begin by dissecting the narrative to extract and categorize the true positives (TP), true negatives (TN), false positives (FP), and false negatives (FN) from the described outcomes. Evaluate effectiveness of the model in terms of these metrics, considering broad spectrum of collected data and the viewer behavior analysis.

Q.3 Marks 4

Cluster the following twelve points (with (x, y) representing locations) into four clusters: (3,10), (3,6), (10,5), (6,8), (8,6), (7,5), (2,3), (5,9), (9,7), (11,4), (3,7), (4,10) Take initial cluster centers as (3,10), (6,8), (2,3), and (10,5). Provide the final clusters after clustering with K-means clustering.

Q.4 Marks 4

Consider the following two-dimensional data points: (1,4), (2,3), (3,6), (4,4), (5,7), (6,5). Find the first principal component using PCA.

Q.5 Marks 4

A university conducted a study to understand the relationship between the number of practice tests completed and the probability of passing the final exam. A logistic regression model was fitted, and the parameters obtained were β 0 = −5 and β 1 = 0.8. Calculate the probability that a student will pass the final exam if they complete: a) 3 practice tests b) 8 practice tests

Machine Learning Assignment – M.Tech | PCA, K-Means, Logistic Regression (Questions), Assignments of Machine Learning

Related documents

Partial preview of the text

Download Machine Learning Assignment – M.Tech | PCA, K-Means, Logistic Regression (Questions) and more Assignments Machine Learning in PDF only on Docsity!

Assignment (20 Marks)

Q.1 Marks 4

A =

[

0 1 1 ]

Q.2 Marks 4

Q.3 Marks 4

Q.4 Marks 4

Q.5 Marks 4