代写CS 61B、代做java编程设计
Project 2C: Ngordnet Enhancements | CS 61B Spring 2024
1/11
CS 61B
Projects / Project 2C: Ngordnet Enhancements
Each assignment will have an FAQ linked at the top. You can also access it by adding “/faq” to
the end of the URL. The FAQ for Project 2C is located here.
In this project, you?ll complete your implementation of the NGordnet for k!=0 and
commonAncestors case.
As this is a quite new project, there may be occasional bugs or confusion with the spec. If you
notice anything of this sort, please post on Ed.
DANGER
Please read through the 2B spec before starting 2C.
DANGER
THE SETUP FOR THIS PROJECT IS DIFFERENT THAN THE OTHER LABS / PROJECTS.
PLEASE DO NOT SKIP THIS STEP!
Project 2C: Ngordnet Enhancements
FAQ
Checkpoint & Design Doc Due 03/15/2024
Coding Due 04/01/2024
Project Setup
Skeleton Setup
Similar to other assignments in this class, run git pull skeleton main to get the skeleton
code for this project.
1
NOTE: You?ll notice that this skeleton is (almost) the exact same as the Project 2B
skeleton. This is intentional.
a
Download the data files for this project using this link and move them into your proj2c
folder on the same level as src .
2
Project 2C: Ngordnet Enhancements | CS 61B Spring 2024
2/11
Once you are done, your proj2c directory should look like this:
WARNING
While you can (and should!) certainly design for 2C in advance, we suggest only starting to
code after you get a full score on Project 2B just in case your implementation has any
subtle bugs in it.
WARNING
IMPORTANT NOTE: You should really complete Project 2B/C: Checkpoint first before
starting coding, or even designing your project. It will be helpful for your understanding of
the project. We will also require you to submit a design document to Gradescope. More
details about the design document can be found in Deliverables and Scoring.
This part of the project is designed for you to come up with an efficient and correct design for
your implementation. The design you come up with will be very important to handle these
cases. Please read the 2B & 2C spec carefully before starting your design document.
Copy your implementation from 2A for ngrams , including TimeSeries and NGramMap , into
the proj2c folder.
3
Copy your implementation from 2B into the proj2c folder, since k!=0 &
commonAncestors will depend on your implementation from 2A and 2B.
4
proj2c
├── data
│ ├── ngrams
│ └── wordnet
├── src
│ ├── <2B helper files>
│ ├── browser
│ ├── main
│ ├── ngrams
│ │ ├── <Your NGramMap implementation from 2A>
│ │ └── <Your TimeSeries implementation from 2A>
│ └── plotting
├── static
└── tests
Copy
Getting Started
Project 2C: Ngordnet Enhancements | CS 61B Spring 2024
3/11
We?ve created two wonderful tools that you can (and should!) use to explore the dataset, see
how the staff solution behaves for specific inputs, and get expected outputs for your unit tests
(see Testing Your Code). We?ll link them here, as well as in other relevant parts of the spec.
Wordnet Visualizer: Useful for visually understanding how synsets and hyponyms work and
testing different words/lists of words for potential test case inputs. Click on the “?” bubbles
to learn how to use the various features of this tool!
Staff Solution Webpage: Useful for generating expected outputs for different test case
inputs. Use this to write your unit tests!
TASK
Read through the entire 2B/C spec and complete Project 2B/C: Checkpoint
After finishing the checkpoint, complete Design Document
In Project 2B, we handled the situation where k == 0 , which is the default value when the
user does not enter a k value.
Your required task is to handle the case where the user enters k . k represents the maximum
number of hyponyms that we want in our output. For example, if someone enters the word
“dog”, and then enters k = 5 , your code would return at most 5 words.
To choose the 5 hyponyms, you should return the k words which occurred the most times in
the time range requested. For example, if someone entered words = ["food", "cake"] ,
startYear = 1950 , endYear = 1990 , and k = 5 , then you would find the 5 most popular
words in that time period that are hyponyms of both food and cake. Here, the popularity is
defined as the total number of times the word appears over the entire time period requested.
The words should then be returned in alphabetical order. In this case, the answer is [cake,
cookie, kiss, snap, wafer] if we?re using top_14377_words.csv , total_counts.csv ,
synsets.txt , and hyponyms.txt .
DANGER
Be sure you are getting the words that appear with the highest counts, not the highest
weights. Otherwise, you will run into issues that are very difficult to debug!
Note that if the frontend doesn?t supply a year, default values of startYear = 1900 and endYear
= 2020 are provided by NGordnetQueryHandler.readQueryMap .
It might be hard to figure out the hyponyms of the words with k != 0 so we are providing
data that is easier to visualize! Below, you?ll see a modified version for EECS class
?
?
Handling k != 0
Project 2C: Ngordnet Enhancements | CS 61B Spring 2024
4/11
requirements, inspired by HKN. We have also provided the data that represents the graph
below ( frequency-EECS.csv , hyponyms-EECS.txt , synsets-EECS.txt ). If someone entered
words = ["CS61A"] , startYear = 2010 , endYear = 2020 , and k = 4 , you should receive
"[CS170, CS61A, CS61B, CS61C]" . This frequency-EECS.csv is a bit different from the
previous one since it has values with the same frequencies. We highly recommend you to take
a look at frequency-EECS.csv . Also, while you are designing your implementation, bear this in
mind that we can give you words with the same frequencies.
Project 2C: Ngordnet Enhancements - EECS Course Guide Edited 1 month ago
If a word never occurs in the time frame specified, i.e. the count is zero, it should not be
returned. In other words, if k > 0 , we should not show any words that do not appear in the
ngrams dataset.
If there are no words that have non-zero counts, you should return an empty list, i.e. [] .
If there are fewer than k words with non-zero counts, return only those words. For example if
you enter the word "potato" and enter k = 15 , but only 7 hyponyms of "potato" have
non-zero counts, you?d return only 7 words.
This task will be a little trickier since you?ll need to figure out how to pass information around
so that the HyponymsHandler knows how to access a useful NGramMap .
TASK
Project 2C: Ngordnet Enhancements | CS 61B Spring 2024
5/11
Modify your HyponymsHandler and the rest of your implementation to deal with the k !=
0 case.
WARNING
EECS-course guide is not available on the interactive web staff solution so it won?t return
anything if you give the input CS61A .
DANGER
DO NOT MAKE A STATIC NGRAMMAP FOR THIS TASK! It might be tempting to simply
make some sort of public static NGramMap that can be accessed from anywhere in your
code. This is called a "global variable".
We strongly discourage this way of thinking about programming, and instead suggest that
you should be passing an NGramMap to either constructors or methods. We?ll come back
to talking about this during the software engineering lectures.
Until you use the autograder, you?ll need to construct your own test cases. We provided
one in the previous section: words = ["food", "cake"] , startYear = 1950 , endYear =
1990 , k = 5 .
When constructing your own test cases, consider making your own input files. Using the
large input files we provide is extremely tedious.
Up until now, we have only been concerned with finding the common hyponyms of words. For
the last part of this project, your task is to find the common ancestors.
That is, given a set of words, what words contain the given set of words as hyponyms?
For example, consider synsets16.txt and hyponyms16.txt from 2B:
Tips
?
?
Finding Common Ancestors
Project 2C: Ngordnet Enhancements | CS 61B Spring 2024
6/11
If we find the ancestors of "adjustment" , we should get "[adjustment, alteration, event,
happening, modification, natural_event, occurrence, occurrent]" , as shown in the
graph below.
This also should apply to words in multiple contexts, as seen with "change" :
Project 2C: Ngordnet Enhancements | CS 61B Spring 2024
7/11
The ancestors of "change" should be "[act, action, alteration, change, event,
happening, human_action, human_activity, modification, natural_event, occurrence,
occurrent]" .
We can also ask for the common ancestors of sets of words, which can reveal some neat
relationships!
Here, we find the common ancestors of the words = ["change", "adjustment"] . The result
should be "[alteration, event, happening, modification, natural_event, occurrence,
Project 2C: Ngordnet Enhancements | CS 61B Spring 2024
8/11
occurrent]" , which are all the words in the graph that contain both "change" and
"adjustment" as hyponyms. Note that "alteration" and "modification" are also included
in the result, contrary to what you might expect, as explained below.
Note: Be sure to take a word intersection rather than a node intersection just as in 2B, so the
common ancestors of ["test_subject", "math"] in the following graph should return "
[subject]" , as "subject" contains both "test_subject" and "math" as hyponyms, even
though "test_subject" and "math" are not directly connected in the graph.
We may also ask for common ancestors of three or more words.
Note that the outputs are in alphabetical order, and keep in mind that k != 0 can also apply
to this task.
Your query handling needs to remain efficient for common ancestors (i.e., the timeouts applied
to 2B still apply here). This means that going through every single word and checking if it
contains all the words in the query as hyponyms will be too slow on the larger datasets!
You will need to modify your HyponymsHandler class to account for the type of query, i.e.,
hyponyms or common ancestors. This should look similar to how you found startYear ,
endYear , or k , and this will be specified for you with NgordnetQueryType.HYPONYMS or
NgordnetQueryType.ANCESTORS , respectively.
TASK
Modify your HyponymsHandler and the rest of your implementation to handle common
ancestor queries in addition to hyponym queries.
As mentioned before, you should not need to copy-paste your code or do anything too drastic
to handle this task. Consider how you can use the same data structures and methods from
before to solve this problem, perhaps with a few tweaks.
NgordnetQueryType
Design Tips
Project 2C: Ngordnet Enhancements | CS 61B Spring 2024
9/11
Helper methods are your friends! If you find yourself writing similar code more than once,
consider making a helper method that you can call from both places that does the common
work for you.
For Project 2C, the only required deliverable is the HyponymsHandler.java file, in addition to
any helper classes. However, we will not be directly grading these classes, since they can vary
from student to student.
Project 2B/C: Checkpoint: 5 points - Due March 15th
Project 2C Coding: 25 points - Due April 1st
HyponymsHandler popularity-hardcoded: 20%, k != 0
HyponymsHandler popularity-randomized: 30%, k != 0
HyponymsHandler common-ancestors: 50%
In addition to Project 2C, you will also have to turn in your design document. This will be worth
5 points and it is due March 15th. The design document?s main purpose is to serve as a
foundation for your project. It is important to think and ideate before coding. What we are
looking for in the design document:
Identify the data structures we have learned in the class that you will be using in your
implementation.
Pseudocode / general overview of your algorithm for your implementation.
Your design document should be around 1 - 2 pages long. Design document will be mainly
graded on effort, thought and completion.
Please make a copy of this template and submit to Gradescope.
Don?t worry if you decide to change your design document after. You are free to do so! We
want you to think about the implementation before coding therefore we require you to submit
your design as the part of the project.
The token limiting policy for this project will be as follows: You will start with 8 tokens, each of
which has a 24-hour refresh time.
We?ve provided you with two short unit test files for this project in the proj2c/tests
directory:
TestOneWordKNot0Hyponyms.java
Deliverables and Scoring
Project 2C: Ngordnet Enhancements | CS 61B Spring 2024
10/11
TestCommonAncestors.java
These test files are not comprehensive; in fact, they each only contain one sanity check
test. You should fill each file with more unit tests, and also use them as a template to create
two new test files for the respective cases.
If you need help figuring out what the expected outputs of your tests should be, you should
use the two tools that we linked in the Getting Started section.
Use the small files while testing! This decreases the startup time to run Main.java and
makes it easier to reason about the code. If you?re running Main.java , these files are set in
the first few lines of the main method. For unit tests, the file names are passed into the
getHyponymsHandler method.
You can run Main.java with the debugger to debug different inputs quickly. After clicking
the “Hyponyms” button, your code will execute with the debugger - breakpoints will be
triggered, you can use the variables window, etc.
There are a lot of moving parts to this project. Don?t start by debugging line-by-line.
Instead, narrow down which function/region of your code is not working correctly then
search more closely in those lines.
Check the FAQ for common issues and questions.
Throughout this assignment, we?ve had you use your front end to test your code. Our grader is
not sophisticated enough to pretend to be a web browser and call your code. Instead, we?ll
need you to provide a method in the proj2c_testing.AutograderBuddy class that provides a
handler that can deal with hyponyms requests.
When you ran git pull skeleton main at the start of this spec, you should have received a
file called AutograderBuddy.java
Just like 2B, open AutograderBuddy.java and fill in the getHyponymsHandler method such
that it returns a HyponymsHandler that uses the four given files. Your code here will probably
be similar to your code in Main.java .
Now that you?ve created proj2c.testing.AutograderBuddy , you can submit to the
autograder. If you fail any tests, you should be able to replicate them locally as JUnit tests by
building on the test files above. If any additional datafiles are needed, they will be added to this
section as links.
Submitting Your Code
Project 2C: Ngordnet Enhancements | CS 61B Spring 2024
11/11
If you?d like to go above and beyond in this project (and even explore some front-end
development), read through the Optional Features spec!
The WordNet part of this assignment is loosely adapted from Alina Ene and Kevin Wayne?s
Wordnet assignment at Princeton University.
Optional Extra Features
Acknowledgements
请加QQ:99515681 邮箱:99515681@qq.com WX:codinghelp
- Telegram怎么批量私信?电报群发拉群营销软件上线!
- 聊城物流网:连接世界的物流枢纽,助力城市腾飞
- Ig博主采集工具,Instagram全球采集软件,ins群发营销助手
- 北京爱尔英智眼科丁雪主任教你这样自测白内障,不抓瞎!
- WhatsApp全球拉群,ws协议号一键注册/ws群发策略/ws养号技巧
- FIT1047代做、Python/c++程序语言代写
- instagram爆粉神器,智能采集用户,助你实现社交引流!
- Vision Pro有了,谁是空间计算时代的“安卓机皇”?
- 北斗筛选全球app云筛的艺术:日筛选量可达2亿-3亿的号码量筛选
- COMP3310代做、代写C++, Java/Python编程
- COM6521代做、代写c/c++编程设计
- 掌上汽车:引领智能出行新风尚
- 数字革命 WhatsApp拉群新功能引领市场热潮 我们的工具告诉你原因
- 有知情人士透露,京东内部仍在频繁接触知名行业专家董宇辉
- Instagram引流神器 - ins采集软件/ig采集助手/ins群发助手
- 数字领袖 专家亲述 WhatsApp拉群营销工具如何激发好奇心 助业务成功腾飞
- Instagram群发筛选软件,Ins群发注册工具,助你轻松营销!
- 数字大法宝 科技魔法师揭秘 在WhatsApp拉群营销工具的世界里开启业务新篇章
- 全方位Telegram代群发,助力品牌全球曝光
- 代做COMP10002、c++编程设计代写
- 全新网络通信标识推广方案,Line协议号注册器助您一站式解决!
- Instagram群发软件 - ins自动登录/ig采集指定地区/ins群发助手/ig采集神器
- WhatsApp群发软件,ws/WhatsApp营销软件/ws协议号出售/ws筛选器
- 低成本引流推广!Ins自动私信工具,Instagram营销软件推荐!
- 代写CSC 330、代做C/C++编程语言
- telegram群发软件,tg营销软件,2024年海外获客神器上线!
- WhatsApp群发软件,ws营销软件/ws协议号/ws拉群/ws业务咨询大轩
- 海外推广利器,Line协议号注册器为您的品牌在国际市场闪耀登场!
- 解锁数字商海的全新纪元:2024年WhatsApp筛选器拉群人才引领行业巅峰
- WhatsApp营销软件,ws拉群/ws代拉/ws代发/ws协议号/ws业务咨询大轩
推荐
- 如何经营一家好企业,需要具备什么要素特点 我们大多数人刚开始创办一家企业都遇到经营 科技
- 智慧驱动 共创未来| 东芝硬盘创新数据存储技术 为期三天的第五届中国(昆明)南亚社会公共安 科技
- 苹果罕见大降价,华为的压力给到了? 1、苹果官网罕见大降价冲上热搜。原因是苹 科技
- 全力打造中国“创业之都”名片,第十届中国创业者大会将在郑州召开 北京创业科创科技中心主办的第十届中国创业 科技
- 疫情期间 这个品牌实现了疯狂扩张 记得第一次喝瑞幸,还是2017年底去北京出差的 科技
- B站更新决策机构名单:共有 29 名掌权管理者,包括陈睿、徐逸、李旎、樊欣等人 1 月 15 日消息,据界面新闻,B站上周发布内部 科技
- 丰田章男称未来依然需要内燃机 已经启动电动机新项目 尽管电动车在全球范围内持续崛起,但丰田章男 科技
- 创意驱动增长,Adobe护城河够深吗? Adobe通过其Creative Cloud订阅捆绑包具有 科技
- 升级的脉脉,正在以招聘业务铺开商业化版图 长久以来,求职信息流不对称、单向的信息传递 科技
- 老杨第一次再度抓握住一瓶水,他由此产生了新的憧憬 瘫痪十四年后,老杨第一次再度抓握住一瓶水,他 科技