爱情鸟第一论坛com高清免费_91免费精品国自产拍在线可以看_亚洲一区精品中文字幕_男人操心女人的视频

昆明精選生活信息陽宗海宜良富民五華滇池安寧市麗江大理西雙版納楚雄普爾市迪慶磨憨-磨丁經濟合作區企業信息企業推廣網站推廣外鏈推廣法律機構法律案例法律文書云南法規昆明社保查詢昆明醫保問題解答辦事指南工傷保險失業保險昆明房產信息昆明市公共租賃住房昆明市不動產辦理昆明公積金交通信息昆明ETC 交通服務昆明地鐵昆明機場昆明公交旅游服務昆明旅游景點昆明旅游線路昆明旅游攻略云南介紹云南旅游攻略云南自駕游攻略昆明美食特色餐廳云南美食教育信息教育機構昆明義務教育昆明中考中職高職昆明高考昆明學校招聘信息公務員招錄就業服務就業政策昆明招聘網站昆明公共服務電話昆明政府電話便民服務婚姻登記昆明供水戶口居住證昆明護照昆明陵園昆明燃氣社會救助老人福利教育培訓美容服飾機械電子網絡科技健康保健企業市場社會娛樂百科外鏈推廣藥品網保健購物商城武漢網重慶網合肥網

昆明云南新聞昆明法律昆明社保昆明房產昆明交通昆明旅游昆明美食昆明教育昆明招聘昆明醫院文化藝術企業服務昆明電話

網文薈萃教育培訓美容服飾機械電子網絡科技健康保健企業市場社會娛樂百科外鏈推廣

代做IEMS 5730、代寫 c++，Java 程序設計

時間：2024-03-11 來源：作者：我要糾錯

IEMS 5730 Spring 2024 Homework 2
Release date: Feb 23, 2024
Due date: Mar 11, 2024 (Monday) 11:59:00 pm
We will discuss the solution soon after the deadline. No late homework will be accepted!
Every Student MUST include the following statement, together with his/her signature in the submitted homework.
I declare that the assignment submitted on Elearning system is original except for source material explicitly acknowledged, and that the same or related material has not been previously submitted for another course. I also acknowledge that I am aware of University policy and regulations on honesty in academic work, and of the disciplinary guidelines and procedures applicable to breaches of such policy and regulations, as contained in the website http://www.cuhk.edu.hk/policy/academichonesty/.
Signed (Student_________________________) Date:______________________________ Name_________________________________ SID_______________________________
Submission notice:
● Submit your homework via the elearning system.
● All students are required to submit this assignment.
General homework policies:
A student may discuss the problems with others. However, the work a student turns in must be created COMPLETELY by oneself ALONE. A student may not share ANY written work or pictures, nor may one copy answers from any source other than one’s own brain.
Each student MUST LIST on the homework paper the name of every person he/she has discussed or worked with. If the answer includes content from any other source, the student MUST STATE THE SOURCE. Failure to do so is cheating and will result in sanctions. Copying answers from someone else is cheating even if one lists their name(s) on the homework.
If there is information you need to solve a problem, but the information is not stated in the problem, try to find the data somewhere. If you cannot find it, state what data you need, make a reasonable estimate of its value, and justify any assumptions you make. You will be graded not only on whether your answer is correct, but also on whether you have done an intelligent analysis.
Submit your output, explanation, and your commands/ scripts in one SINGLE pdf file.

Q1 [20 marks + 5 Bonus marks]: Basic Operations of Pig
You are required to perform some simple analysis using Pig on the n-grams dataset of Google books. An ‘n-gram’ is a phrase with n words. The dataset lists all n-grams present in books from books.google.com along with some statistics.
In this question, you only use the Google books bigram (1-grams). Please go to Reference [1] and [2] to download the two datasets. Each line in these two files has the following format (TAB separated):
bigram year match_count
An example for 1-grams would be:
volume_count
circumvallate 1978 335 91 circumvallate 1979 261 95
This means that in 1978(1979), the word "circumvallate" occurred 335(261) times overall, from 91(95) distinct books.
(a) [Bonus 5 marks] Install Pig in your Hadoop cluster. You can reuse your Hadoop cluster in IEMS 5730 HW#0 and refer to the following link to install Pig 0.17.0 over the master node of your Hadoop cluster :
http://pig.apache.org/docs/r0.17.0/start.html#Pig+Setup
Submit the screenshot(s) of your installation process.
If you choose not to do the bonus question in (a), you can use any well-installed Hadoop cluster, e.g., the IE DIC, or the Hadoop cluster provided by the Google Cloud/AWS [5, 6, 7] to complete the following parts of the question:
(b) [5 marks] Upload these two files to HDFS and join them into one table.
(c) [5 marks] For each unique bigram, compute its average number of occurrences per year. In the above example, the result is:
circumvallate (335 + 261) / 2 = 298
Notes: The denominator is the number of years in which that word has appeared. Assume the data set contains all the 1-grams in the last 100 years, and the above records are the only records for the word ‘circumvallate’. Then the average value is:
instead of
(335 + 261) / 2 = 298, (335 + 261) / 100 = 5.96
(d) [10 marks] Output the 20 bigrams with the highest average number of occurrences per year along with their corresponding average values sorted in descending order. If multiple bigrams have the same average value, write down anyone you like (that is,

break ties as you wish).
You need to write a Pig script to perform this task and save the output into HDFS.
Hints:
● This problem is very similar to the word counting example shown in the lecture notes
of Pig. You can use the code there and just make some minor changes to perform this task.
Q2 [20 marks + 5 bonus marks]: Basic Operations of Hive
In this question, you are asked to repeat Q1 using Hive and then compare the performance between Hive and Pig.
(a) [Bonus 5 marks] Install Hive on top of your own Hadoop cluster. You can reuse your Hadoop cluster in IEMS 5730 HW#0 and refer to the following link to install Hive 2.3.8 over the master node of your Hadoop cluster.
https://cwiki.apache.org/confluence/display/Hive/GettingStarted
Submit the screenshot(s) of your installation process.
If you choose not to do the bonus question in (a), you can use any well-installed Hadoop cluster, e.g., the IE DIC, or the Hadoop cluster provided by the Google Cloud/AWS [5, 6, 7].
(b) [20 marks] Write a Hive script to perform exactly the same task as that of Q1 with the same datasets stored in the HDFS. Rerun the Pig script in this cluster and compare the performance between Pig and Hive in terms of overall run-time and explain your observation.
Hints:
● Hive will store its tables on HDFS and those locations needs to be bootstrapped:
$ hdfs dfs -mkdir /tmp
$ hdfs dfs -mkdir /user/hive/warehouse
$ hdfs dfs -chmod g+w /tmp
$ hdfs dfs -chmod g+w /user/hive/warehouse
● While working with the interactive shell (or otherwise), you should first test on a small subset of the data instead of the whole data set. Once your Hive commands/ scripts work as desired, you can then run them up on the complete data set.

Q3 [30 marks + 10 Bonus marks]: Similar Users Detection in the MovieLens Dataset using Pig
Similar user detection has drawn lots of attention in the machine learning field which is aimed at grouping users with similar interests, behaviors, actions, or general patterns. In this homework, you will implement a similar-users-detection algorithm for the online movie rating system. Basically, users who rate similar scores for the same movies may have common tastes or interests and be grouped as similar users.
To detect similar users, we need to calculate the similarity between each user pair. In this homework, the similarity between a given pair of users (e.g. A and B) is measured as the total number of movies both A and B have watched divided by the total number of movies watched by either A or B. The following is the formal definition of similarity: Let M(A) be the set of all the movies user A has watched. Then the similarity between user A and user B is defined as:
𝑆𝑖𝑚𝑖𝑙𝑎𝑟𝑖𝑡𝑦(𝐴, 𝐵) = |𝑀(𝐴)∩𝑀(𝐵)| ...........(**) |𝑀(𝐴)∪𝑀(𝐵)|
where |S| means the cardinality of set S.
(Note: if |𝑀(𝐴)∪𝑀(𝐵)| = 0, we set the similarity to be 0.)
The following figure illustrates the idea:
Two datasets [3][4] with different sizes are provided by MovieLens. Each user is represented by its unique userID and each movie is represented by its unique movieID. The format of the data set is as follows:
<userID>, <movieID>
Write a program in Pig to detect the TOP K similar users for each user. You can use the

cluster you built for Q1 and Q2 or you can use the IE DIC or one provided by the Google Cloud/AWS [5, 6, 7].
(a) [10 marks] For each pair of users in the dataset [3] and [4], output the number of movies they have both watched.
For your homework submission, you need to submit i) the Pig script and ii) the list of the 10 pairs of users having the largest number of movies watched by both users in the pair within the corresponding dataset. The format of your answer should be as follows:
<userID A>, <userID B>, <the number of movie both A and B have watched> //top 1 ...
<userID X>, <userID Y>, <the number of movie both X and Y have watched> //top 10
(b) [20 marks] By modifying/ extending part of your codes in part (a), find the Top-K (K=3) most similar users (as defined by Equation (**)) for every user in the datasets [3], [4]. If multiple users have the same similarity, you can just pick any three of them.
(c)
Hint:
1. In part (b), to facilitate the computation of the similarity measure as
defined in (**), you can use the inclusion-exclusion principle, i.e.
請加QQ：99515681 郵箱：99515681@qq.com WX：codehelp

標簽：

掃一掃在手機打開當前頁

上一篇: ICT239 代做、代寫 java/c/c++程序

下一篇:代寫COMP9334 Capacity Planning of Computer

注：本網條致力提供真實有用信息，所轉載的內容，其版權均由原作者和資料提供方所擁有！若有任何不適煩請聯系我們，將會在24小時內刪除。

無相關信息

昆明生活資訊

·昆明市義務教育階段招生入學系統(昆明義招網)

·昆明市護照辦理網點地址電話

·昆明胡志明舊居對公眾開放

·昆明市常用對外公開電話

·云南招生考試院

·云南省2023年度高等學校名單（權威發布）

·大理旅游投訴

·楚雄州中醫醫院

·西雙版納旅游度假區

·昆明社保查詢

·昆明市2023年城鄉居民醫保繳費標準為350元/人

·昆明市住房和城鄉建設局各部門電話

·昆明最新招聘信息

·2022年云南中考時間

·云南各地2022年高考舉報電話公布（云南省2022

·昆明清明出行指南，10條公交專線

·昆明就業服務網

·昆明29個發熱門診名單及電話

·昆明市公共租賃住房便民服務點

·2021年度昆明靈活就業人員參加城鎮職工基本養

·昆明電子犬證辦理指南（附33種禁止飼養的烈性

·云南省中等職業學校招生錄取系統

·云南違規違法中介機構、開發企業、物業服務、

·昆明市職稱評審申報指南（2021版）

·昆明市旅游投訴電話

·昆明工傷認定流程條件

·昆明市人力資源和社會保障局聯系方式

·昆明失業保險查詢

·昆明市最新水價標準，家里人多可以這樣申請優

·昆明主城五區高考共設21個考點

·昆明戶籍業務、身份證、居住證咨詢電話

·云南省創業擔保貸款政策咨詢及申辦程序30問（

·2020昆明年小學招生劃片、入學指南（持續更新

·昆明市住房保障局

·昆明住房公積金網上業務大廳登錄

·昆明市學生資助管理中心

·云南省高校畢業生就業創業政策百問（2019）

·昆明高校畢業生就業預登記

·昆明《云南省居住證》辦理條件和所需材料

·昆明人事代理檔案查詢

·云南野生菌攻略

·云南節慶活動攻略

·昆明市流浪乞討人員救助管理求助電話

·昆明不動產登記收費公示

·昆明市（區）不動產信息檔案查詢窗口聯系電話

·昆明市、縣（市）區不動產登記中心辦公地點和

·昆明市公租房在線申請

·昆明申請公共租賃住房提交材料清單非當地戶籍

·昆明申請公共租賃住房提交材料清單主城八區城

·昆明申請公共租賃住房優先保障條件及提交材料

·昆明市各縣（市）區教育部門咨詢電話

·昆明市教育局職能處室咨詢電話

·預防校園欺凌

·2018學年度昆明各級各類學校收費通告

·昆明市學生資助補助政策有哪些？如何申請辦理

·昆明社會保障IC卡遺失后如何掛失，0871-63331

·昆明市醫療保險中心各部門電話

·昆明各縣區醫保分中心的電話聯系方式及地址.

昆明圖文信息

蝴蝶泉（4A）-大理旅游

油炸竹蟲

酸筍煮魚（雞）

竹筒飯

香茅草烤魚

檸檬烤魚

昆明西山國家級風景名勝區

昆明旅游索道攻略

推薦信息

相關文章

無相關信息

欄目更新

·CSCI 2600代做、代寫Java設計程序

·CSCI 2600代做、代寫Java設計程序

·代寫GA.2250、Python/Java程序語言代做

·代寫MTH5510、代做Matlab程序語言

·代寫COMP4337、代做Python編程設計

·代寫COMP528、代做c/c++，Python程序語言

·CIT 593代做、代寫Java/c++語言編程

·代做COMP9021、代寫Python編程設計

·DDES9903代寫、代做Python，c/c++編程

·代寫 661985 – Safety Critical System編程

短信驗證碼平臺理財 WPS下載

關于我們 | 打賞支持 | 廣告服務 | 聯系我們 | 網站地圖 | 免責聲明 | 幫助中心 | 友情鏈接 |

Copyright © 2025 kmw.cc Inc. All Rights Reserved. 昆明網 版權所有
ICP備06013414號-3 公安備 42010502001045

爱情鸟第一论坛com高清免费_91免费精品国自产拍在线可以看_亚洲一区精品中文字幕_男人操心女人的视频

<strike id="bfrlb"></strike><form id="bfrlb"><form id="bfrlb"><nobr id="bfrlb"></nobr></form></form>

<sub id="bfrlb"><listing id="bfrlb"><menuitem id="bfrlb"></menuitem></listing></sub>

<form id="bfrlb"></form>

<form id="bfrlb"></form>

<address id="bfrlb"></address>

<address id="bfrlb"></address>

国产精品日韩一区二区三区| 一区二区三区视频观看| 欧美好骚综合网| 一区二区欧美在线观看| 在线播放亚洲一区| 欧美日韩视频在线| 午夜国产欧美理论在线播放| 国外成人免费视频| 91久久精品国产| 精品成人一区二区三区四区| 一区二区激情视频| 国产有码一区二区| av成人免费在线观看| 欧美美女视频| 国产欧美精品日韩区二区麻豆天美| 久久久久99| 亚洲激情av在线| 国产精品久久网| 久久亚洲精品视频| 日韩一级二级三级| 国产综合18久久久久久| 久久久久久久久久看片| 久久在线免费视频| 亚洲午夜久久久久久尤物| 欧美人成免费网站| 欧美大片在线观看一区二区| 99国产精品视频免费观看| 一区二区av| 国产精品日本| 国产精品亚洲аv天堂网| 国产色视频一区| 国产精品久久久久一区二区| 久久综合狠狠综合久久综合88| 亚洲性xxxx| 日韩亚洲欧美成人一区| 国产模特精品视频久久久久| 亚洲视频二区| 久久久久久有精品国产| 香蕉成人啪国产精品视频综合网| 欧美天天在线| 欧美成人精品在线| 久久精品一区二区三区不卡牛牛| 亚洲欧洲精品成人久久奇米网| 久久精品一本| 欧美成人一品| 一本大道久久a久久精品综合| 亚洲经典在线| 影音先锋久久精品| 亚洲黄页视频免费观看| 欧美福利电影在线观看| 精品av久久707| 欧美精品一区二区三区一线天视频| 国产欧美精品va在线观看| 欧美大成色www永久网站婷| 久久综合久色欧美综合狠狠| 欧美与欧洲交xxxx免费观看| 欧美日韩国产综合一区二区| 欧美成人精品三级在线观看| 亚洲日本乱码在线观看| 欧美aⅴ一区二区三区视频| 亚洲最新合集| 香蕉久久a毛片| 亚洲永久网站| 国产精品综合不卡av| av成人黄色| 伊人久久久大香线蕉综合直播| 亚洲午夜高清视频| 亚洲狠狠丁香婷婷综合久久久| 在线观看福利一区| 国产精品视频免费在线观看| 国产精品高潮视频| 国产真实乱子伦精品视频| 激情懂色av一区av二区av| 欧美精品亚洲| 欧美激情一区二区三区高清视频| 亚洲欧美国产另类| 亚洲男女毛片无遮挡| 亚洲国产小视频在线观看| 欧美在线观看一区二区三区| 亚洲国产欧美精品| 欧美影院在线| 欧美日韩国产综合在线| 欧美极品一区二区三区| 国产精品亚发布| 国产精品人人爽人人做我的可爱| 一区二区高清视频在线观看| 亚洲欧美国产三级| 性色av一区二区三区红粉影视| 国产精品久久久久免费a∨| 亚洲人体一区| 中文精品99久久国产香蕉| 亚洲国产一区二区在线| 欧美一区二区在线免费播放| 欧美日韩大陆在线| 欧美小视频在线| 夜夜嗨一区二区| 日韩午夜在线观看视频| 国产精品日韩欧美一区二区| 亚洲天堂av电影| 国产一区二区成人久久免费影院| 巨胸喷奶水www久久久免费动漫| 欧美77777| 国产精品青草久久| 欧美日韩系列| 国产毛片精品视频| 欧美在线二区| 一区视频在线看| 午夜精品在线| 久久久夜精品| 美女任你摸久久| 亚洲网站视频福利| 亚洲理伦电影| 国产精品99久久久久久宅男| 欧美日韩国产欧| 一区二区精品在线| 亚洲专区一二三| 男女精品网站| 久久精品国产99精品国产亚洲性色| 午夜久久tv| 校园春色综合网| 国产精品久久久久秋霞鲁丝| 欧美日韩99| 欧美日韩国产区一| 亚洲一区尤物| 中日韩男男gay无套| 国产欧美高清| 一区视频在线看| 一本色道久久综合亚洲二区三区| 尤物九九久久国产精品的特点| 欧美美女福利视频| 欧美另类女人| 欧美视频免费| 在线播放国产一区中文字幕剧情欧美| 欧美欧美天天天天操| 欧美日韩理论| 欧美一区二区在线看| 久久精品国产亚洲一区二区三区| 亚洲男女自偷自拍图片另类| 欧美日韩免费高清| 欧美色图五月天| 欧美—级a级欧美特级ar全黄| 国产日韩在线一区二区三区| 久久天堂av综合合色| 久久福利视频导航| 国产欧美精品日韩区二区麻豆天美| 国产日韩在线看| 亚洲一区在线视频| 激情欧美日韩一区| 欧美电影在线免费观看网站| 中文在线资源观看网站视频免费不卡| 亚洲一区二区成人| 久久综合久久久| 欧美成人xxx| 亚洲视频免费| 在线精品视频一区二区三四| 欧美日韩裸体免费视频| 欧美精品入口| 国产精品美女在线观看| 国内综合精品午夜久久资源| 欧美久久在线| 亚洲国产精品小视频| 欧美日韩一区综合| 亚洲一品av免费观看| 欧美亚洲免费| 欧美成人精品高清在线播放| 亚洲成人影音| 亚洲毛片在线观看.| 一区二区三区欧美亚洲| 亚洲欧洲精品一区二区三区| 欧美一区二区黄| 亚洲视频在线观看视频| 国产精品一区二区三区久久久| 久久伊人精品天天| 正在播放亚洲一区| 国产一区二区中文字幕免费看| 国产精品白丝黑袜喷水久久久| 国内精品写真在线观看| 亚洲电影免费观看高清完整版在线| 亚洲国产婷婷| 国产欧美视频一区二区三区| 美女视频黄a大片欧美| 亚洲一区二区动漫| 亚洲欧美一区二区三区在线| 欧美在线免费观看| 国产精品xvideos88| 欧美电影免费观看| 久久精品国产69国产精品亚洲| 欧美成人r级一区二区三区| 老色鬼久久亚洲一区二区| 国产精品色一区二区三区| 一区在线免费观看| 欧美激情视频在线播放| 国产精品乱码久久久久久| 一本色道久久综合亚洲精品婷婷| 欧美jizz19性欧美| 噜噜噜在线观看免费视频日韩| 亚洲影音先锋| 欧美在线亚洲| 一区二区欧美在线观看| 欧美高清在线|