99爱在线视频这里只有精品_窝窝午夜看片成人精品_日韩精品久久久毛片一区二区_亚洲一区二区久久

合肥生活安徽新聞合肥交通合肥房產生活服務合肥教育合肥招聘合肥旅游文化藝術合肥美食合肥地圖合肥社保合肥醫院企業服務合肥法律

代寫MS6711、代做Python語言程序
代寫MS6711、代做Python語言程序

時間:2025-03-07  來源:合肥網hfw.cc  作者:hfw.cc 我要糾錯



MS6711 Data Mining
Homework 2
Instruction
This homework contains both coding and non-coding questions. Please submit two files,
1. One word or pdf document of answers and plots of ALL questions without coding details.
2. One jupyter notebook of your codes.
3. Questions 1 and 2 are about concepts, 3 - 6 are about coding.
1
Problem 1 [20 points]
We perform best subset, forward stepwise and backward stepwise selection on the same dataset with p
predictors. For each approach, we obtain p + 1 models containing 0, 1, 2, · · · , p predictors. Explain your
answer.
1. Which of the three models with same number of k predictors has smallest training RSS?
2. Which of the three models with same number of k predictors has smallest testing RSS? (best
subset, forward, backward, or cannot determine?)
3. True or False: The predictors in the k-variable model identified by forward stepwise are a subset of
the predictors in the (k + 1)-variable model identified by forward stepwise selection.
4. True or False: The predictors in the k-variable model identified by best subset are a subset of the
predictors in the (k + 1)-variable model identified by best subset selection.
5. True or False: The lasso, relative to OLS, is less flexible and hence will give improved prediction
accuracy when its increase in bias is less than its decrease in variance.
2
Problem 2 [20 points]
Suppose we estimate Lasso by minimizing
||Y − Xβ||2
2 + λ||β||1
for a particular value of λ. For part 1 to 5, indicate which of (a) to (e) is correct and explain your answer.
1. As we increase λ from 0, the training RSS will
(a) Increase initially, and then eventually start decreasing in an inverted U shape.
(b) Decrease initially, and then eventually start increasing in a U shape.
(c) Steadily increase.
(d) Steadily decrease.
(e) Remain constant.
2. Repeat 1. for test RSS.
3. Repeat 1. for variance.
4. Repeat 1. for (squared) bias.
3
Problem 3 [20 points]
These data record the level of atmospheric ozone concentration from eight daily meteorological mea surements made in the Los Angeles basin in 1976. We have the 330 complete cases1. We want to find
climate/weather factors that impact ozone readings. Ozone is a hazardous byproduct of burning fossil
fuels and can harm lung function. The data set for this problem is:
Variable name Definition
ozone Long Maximum Ozone
vh Vandenberg 500 mb Height
wind Wind speed (mph)
humidity Humidity (%)
temp Sandburg AFB Temperature
ibh Inversion Base Height
dpg Daggot Pressure Gradint
ibt Inversion Base Temperature
vis Visibility (miles)
doy Day of the Year
[Note: I would recommend you use R for this question, since python does not have package for
forward / backward selection. See the code example on Canvas. Or you may use the sample python code
I provided.]
1. Report result of linear regression using all variables. Note that ozone is the response variable to
predict. What variables are significant?
2. Report the selected variables using the following model selection approaches.
(a) All subset selection.
(b) Forward stepwise
(c) Backward stepwise
3. Compare the outcome of these methods with the significant variables found in the full linear regres sion in question 1.
4. Potentially, other transformation of covariates might be important. What happens if you do all
subset selection using both the original variables and their square? That is, for all variables, include
4
both
X, X2
in the linear regression model for all subset selection.
5
Problem 4 [20 points]
In this exercise, we will predict the number of applications received using the other variables in the College
data set.
Private Public/private school indicator
Apps Number of applications received
Accept Number of applicants accepted
Enroll Number of new students enrolled
Top10perc New students from top 10% of high school class
Top25perc 1 = New students from top 25 % of high school class
F.Undergrad Number of full-time undergraduates
P.Undergrad Number of part-time undergraduates
Outstate Out-of-state tuition
Room.Board Room and board costs
Books Estimated book costs
Personal Estimated personal spending
PhD Percent of faculty with Ph.D.
Terminal Percent of faculty with terminal degree
S.F.Ratio Student faculty ratio
perc.alumni Percent of alumni who donate
Expend Instructional expenditure per student
Grad.Rate Graduation rate
1. Split the data set into a training set and a test set.
2. Fit a linear regression model using OLS on the training set, and report the test error obtained.
3. Fit a ridge regression model on the training set, with λ chosen by cross-validation. Report the test
error obtained.
4. Fit a lasso model on the training set, with λ chosen by cross-validation. Report the test error
obtained, along with the number of non-zero coefficient estimates.
5. Fit a PCR model on the training set, with number of components chosen by cross-validation. Report
the test error obtained, along with the value of M selected by cross-validation.
6. Fit a PLS model on the training set, with number of components chosen by cross-validation. Report
the test error obtained, along with the value of number of components selected by cross-validation.
6
Problem 5 [20 points]
We will now try to predict per capita crime rate in the Boston data set.
crim per capita crime rate by town.
zn proportion of residential land zoned for lots over 25,000 sq.ft.
indus proportion of non-retail business acres per town.
chas Charles River dummy variable (= 1 if tract bounds river; 0 otherwise).
nox nitrogen oxides concentration (parts per 10 million).
rm 1 = average number of rooms per dwelling.
age proportion of owner-occupied units built prior to 1940.
dis weighted mean of distances to five Boston employment centres.
rad index of accessibility to radial highways.
tax full-value property-tax rate per $10,000.
ptratio pupil-teacher ratio by town.
black 1000(Bk − 0.63)2 where Bk is the proportion of blacks by town.
lstat lower status of the population (percent).
medv median value of owner-occupied homes in $1000s.
1. Try out some of the regression methods explored in this chapter, such as best subset selection, the
lasso, ridge regression, PCR and partial least squares. Present and discuss results for the approaches
that you consider.
2. Propose a model (or set of models) that seem to perform well on this data set, and justify your
answer. Make sure that you are evaluating model performance using validation set error, cross validation, or some other reasonable alternative, as opposed to using training error.
3. Does your chosen model involve all of the features in the data set? Why or why not?
7
Problem 6 [20 points]
In a bike sharing system the process of obtaining membership, rental, and bike return is automated
via a network of kiosk locations throughout a city. In this problem, you will try to combine historical
usage patterns with weather data to forecast bike rental demand in the Capital Bikeshare program in
Washington, D.C.
You are provided hourly rental data collected from the Capital Bikeshare system spanning two years.
The file Bike train.csv, as the training set, contains data for the first 19 days of each month, while
Bike test.csv, as the test set, contains data from the 20th to the end of the month. The dataset includes
the following information:
daylabel day number ranging from 1 to 731
year, month, day, hour hourly date
season 1=spring,2=summer,3=fall,4=winter
holiday whether the day is considered a holiday
workingday whether the day is neither a weekend nor a holiday
weather 1 = clear, few clouds, partly cloudy
2 = mist + cloudy, mist + broken clouds, mist + few clouds, mist
3 = light snow, light rain + thunderstorm + scattered clouds, light rain
4 = 4 = heavy rain + ice pallets + thunderstorm + mist, snow + fog
temp temperature in Celsius
atemp ’feels like’ temperature in Celsius
humidity relative humidity
wind speed wind speed
count number of total rentals, outcome variable to predict
Predictions will be evaluated using the root mean squared error (RMSE), calculated as
RMSE =
v
u
u t
n
1
nX
i=1
(yi − ybi)
2
where yi
is the true count, ybi
is the prediction, and n is the number of entries to be evaluated.
Build a model on train dataset to predict the bikeshare counts for the hours recorded in the test
dataset. Report your prediction RMSE on testing set.
Some tips
• This is a relatively open question, you may use any model you learnt from this class.
8
• It will be helpful to examine the data graphically to spot any seasonal pattern or temporal trend.
• There is one day in the training data with weird atemp record and another day with abnormal
humidity. Find those rows and think about what you want to do with them. Is there anything
unusual in the test data?
• It might be helpful to transform the count to log(count + 1). If you did that, do not forget to
transform your predicted values back to count.
• Think about how you would include each predictor into the model, as continuous or as categorical?
• Is there any transformation of the predictors or interactions between them that you think might be
helpful?
Try to summarize your exploration of the data, and modeling process. You may fit a few models and
chose one from them. You will receive points based on your write-up and test RMSE. This is not a
competition among the class to achieve the minimal RMSE, but your result should be in a reasonable
range.


請加QQ:99515681  郵箱:99515681@qq.com   WX:codinghelp



 

掃一掃在手機打開當前頁
  • 上一篇:INT5051代做、代寫Python編程設計
  • 下一篇:代寫COMP3334、代做C/C++,Python編程
  • 無相關信息
    合肥生活資訊

    合肥圖文信息
    急尋熱仿真分析?代做熱仿真服務+熱設計優化
    急尋熱仿真分析?代做熱仿真服務+熱設計優化
    出評 開團工具
    出評 開團工具
    挖掘機濾芯提升發動機性能
    挖掘機濾芯提升發動機性能
    海信羅馬假日洗衣機亮相AWE  復古美學與現代科技完美結合
    海信羅馬假日洗衣機亮相AWE 復古美學與現代
    合肥機場巴士4號線
    合肥機場巴士4號線
    合肥機場巴士3號線
    合肥機場巴士3號線
    合肥機場巴士2號線
    合肥機場巴士2號線
    合肥機場巴士1號線
    合肥機場巴士1號線
  • 短信驗證碼 豆包 幣安下載 AI生圖 目錄網

    關于我們 | 打賞支持 | 廣告服務 | 聯系我們 | 網站地圖 | 免責聲明 | 幫助中心 | 友情鏈接 |

    Copyright © 2025 hfw.cc Inc. All Rights Reserved. 合肥網 版權所有
    ICP備06013414號-3 公安備 42010502001045

    99爱在线视频这里只有精品_窝窝午夜看片成人精品_日韩精品久久久毛片一区二区_亚洲一区二区久久

          亚洲午夜一区| 男同欧美伦乱| 红杏aⅴ成人免费视频| 日韩网站免费观看| 久久综合激情| 99re8这里有精品热视频免费| 亚洲香蕉视频| 国产嫩草一区二区三区在线观看 | 欧美精品 国产精品| 一区二区三区久久| 136国产福利精品导航| 欧美精品日韩一本| 一区二区亚洲精品国产| 欧美日本国产精品| 久久夜色精品国产噜噜av| 亚洲毛片一区| 欧美视频在线看| 久久久久久9999| 亚洲精品日韩久久| 你懂的国产精品永久在线| 亚洲一区999| 在线观看国产欧美| 国产精品爱久久久久久久| 91久久国产综合久久| 国产精品一区久久| 欧美另类变人与禽xxxxx| 国产精品99久久久久久久vr| 激情欧美一区二区三区在线观看| 欧美亚洲不卡| 欧美电影免费观看网站| 久久综合亚洲社区| 蜜臀av性久久久久蜜臀aⅴ四虎| 国产视频一区免费看| 久久成人综合网| 久久国产加勒比精品无码| 欧美专区在线观看| 亚洲欧洲一区二区在线观看| 国产精品国产自产拍高清av| 午夜精品网站| 欧美一区二区三区免费视| 欧美一区二区在线免费播放| 欧美在线视频一区| 美女脱光内衣内裤视频久久影院| 亚洲娇小video精品| 亚洲精品日韩精品| 亚洲天堂av在线免费观看| 午夜精品福利一区二区蜜股av| 国产综合久久久久久| 在线观看视频欧美| 日韩亚洲视频在线| 欧美一级片一区| 久久综合九色综合网站| 欧美日韩精品欧美日韩精品一| 国产精品国产a| 国产一区亚洲| 亚洲另类自拍| 在线欧美视频| 亚洲亚洲精品在线观看| 久久精品亚洲国产奇米99| 男女精品网站| 久久久在线视频| 欧美三级不卡| 国语自产精品视频在线看8查询8| 亚洲乱码国产乱码精品精| 新片速递亚洲合集欧美合集| 免费日韩av片| 国产午夜精品美女视频明星a级 | 欧美一区二区视频97| 欧美 日韩 国产在线| 国产精品你懂的在线欣赏| 91久久久久| 欧美一区激情视频在线观看| 欧美美女喷水视频| 国产主播在线一区| 国产精品99久久久久久有的能看| 久久免费视频在线观看| 国产精品极品美女粉嫩高清在线| 一区二区三区在线免费观看| 亚洲午夜一区二区三区| 夜夜嗨av一区二区三区中文字幕| 性做久久久久久免费观看欧美| 欧美成人激情视频免费观看| 国内成+人亚洲| 国产欧美日韩亚州综合| 亚洲日本中文| 欧美二区不卡| 亚洲高清久久| 乱码第一页成人| 伊人久久成人| 久久一区二区三区国产精品| 国产日韩精品电影| 亚洲资源av| 久久精品国产一区二区三| 国产日产精品一区二区三区四区的观看方式| 最近看过的日韩成人| 蜜臀av性久久久久蜜臀aⅴ四虎| 国产欧美丝祙| 久久成人av少妇免费| 国产欧美亚洲精品| 午夜久久久久久久久久一区二区| 欧美日韩蜜桃| 99精品国产热久久91蜜凸| 欧美日韩国产精品一卡| 亚洲免费福利视频| 欧美色另类天堂2015| 99视频精品免费观看| 欧美日韩一区二区三区在线看| 亚洲剧情一区二区| 午夜精品久久久久久久久久久久| 国产精品久久久久免费a∨| 一色屋精品视频在线观看网站| 久久免费国产精品1| 欧美日韩亚洲一区| 一区二区三区日韩欧美精品| 欧美视频不卡| 亚欧美中日韩视频| 伊人精品成人久久综合软件| 你懂的成人av| 一本一本久久a久久精品综合麻豆 一本一本久久a久久精品牛牛影视 | 欧美午夜欧美| 性色av一区二区怡红| 亚洲第一精品福利| 欧美日韩国产91| 欧美一区二区在线看| 亚洲国产第一| 国产精品久久看| 久久免费高清视频| 亚洲午夜精品网| 永久91嫩草亚洲精品人人| 欧美人与性禽动交情品| 午夜精品美女自拍福到在线| 在线观看福利一区| 国产精品国内视频| 欧美a级在线| 午夜亚洲伦理| 日韩亚洲国产精品| 激情六月婷婷久久| 国产精品v欧美精品v日韩精品| 久久天天躁夜夜躁狠狠躁2022 | 日韩网站在线| 国产日韩久久| 欧美亚州一区二区三区 | 欧美 日韩 国产 一区| 亚洲男人第一av网站| 最新国产精品拍自在线播放| 国产日韩欧美在线看| 欧美网站在线观看| 欧美va亚洲va香蕉在线| 久久成人国产| 国产在线精品一区二区夜色| 欧美日韩一区二区国产| 久热综合在线亚洲精品| 欧美亚洲网站| 亚洲男女毛片无遮挡| 99视频+国产日韩欧美| 亚洲高清av在线| 亚洲第一在线综合在线| 国内自拍一区| 国产在线精品一区二区中文| 国产精品ⅴa在线观看h| 欧美精品福利在线| 欧美大香线蕉线伊人久久国产精品| 欧美一区二区三区免费观看| 午夜精品福利视频| 亚洲免费中文字幕| 国产精品99久久久久久人 | 蜜桃av一区| 久久蜜臀精品av| 久久躁日日躁aaaaxxxx| 久久久久久电影| 久久精品亚洲精品国产欧美kt∨| 销魂美女一区二区三区视频在线| 亚洲在线观看视频网站| 亚洲一区二区高清视频| 亚洲一区二区在| 性刺激综合网| 欧美中文字幕在线| 久久久久久欧美| 欧美黄污视频| 国产精品九九| 国产情人节一区| 亚洲第一主播视频| 艳妇臀荡乳欲伦亚洲一区| 一区二区三区视频在线| 亚洲欧美亚洲| 久久蜜桃资源一区二区老牛| 免费国产自线拍一欧美视频| 欧美精品日韩www.p站| 欧美午夜美女看片| 国产欧美一区二区三区久久| 亚洲第一级黄色片| 在线中文字幕不卡| 久久精品国产第一区二区三区| 美女免费视频一区| 最近中文字幕mv在线一区二区三区四区| 在线看视频不卡| 99热免费精品| 久久精选视频| 欧美日韩午夜| 含羞草久久爱69一区|