CEG5304代做、代写Java/c++编程语言
Project #2 for CEG5304: Generating Images through Prompting and Diffusion-based Models.
Spring (Semester 2), AY 2023-2024
In this exploratory project, you are to explore how to generate (realistic) images via diffusion-based models (such as DALLE and Stable Diffusion) through prompting, in particular hard prompting. To recall and recap the concepts of prompting, prompt engineering, LLVM (Large Language Vision Models), and LMM (Large Multi-modal Models), please refer to the slides on Week 5 (“Lect5-DL_prompt.pdf”).
Before beginning this project, please read the following instructions carefully, failure to comply with the instructions may be penalized:
1.This project does not involve compulsory coding, complete your project with this given Word document file by filling in the “TO FILL” spaces. Save the completed file as a PDF file for submission. Please do NOT modify anything (including this instruction) in your submission file.
2.The marking of this project is based on how detailed the description and discussion are over the given questions. To score, please make sure your descriptions and discussions are readable, and adequate visualizations are provided.
3.The marking of this project is NOT based on any evaluation criteria (e.g., PSNR) over the generated image. Generating a good image does NOT guarantee a high score.
4.You may use ChatGPT/Claude or any online LLM services for polishing. However, purely using these services for question answering is prohibited (and is actually very obvious). If it is suspected that you generate your answers holistically with these online services, your assignment may be considered as committing plagiarism.
5.Submit your completed PDF on Canvas before the deadline: 1759 SGT on 20 April 2024 (updated from the slides). Please note that the deadlines are strict and late submission will be deducted 10 points (out of 100) for every 24 hours.
6.The report must be done individually. You may discuss with your peers, but NO plagiarism is allowed. The University, College, Department, and the teaching team take plagiarism very seriously. An originality report may be generated from iThenticate when necessary. A zero mark will be given to anyone found plagiarizing and a formal report will be handed to the Department/College for further investigation.
Task 1: generating an image with Stable Diffusion (via Huggingface Spaces) and compare it with the objective real image. (60%)
In this task, you are to generate an image with the Stable Diffusion model in Huggingface Spaces. The link is provided here: CLICK ME. You can play with the different prompts and negative prompts (prompts that instructs the model NOT to generate something). Your objective is to generate an image that looks like the following image:
1a) First, select a rather coarse text prompt. A coarse text prompt may not include a lot of details but should be a good starting prompt to generate images towards our objective. An example could be “A Singaporean university campus with a courtyard.”. Display your generated image and its corresponding text prompt (as well as the negative prompt, if applicable) below: (10%)
TO FILL
TO FILL
1b) Describe, in detail, how the generated image is compared to the objective image. You may include the discussion such as the components in the objective image that is missing from the generated image, or anything generated that does not make sense in the real world. (20%)
TO FILL
TO FILL
Next, you are to improve the generated image with prompt engineering. Note that it is highly likely that you may still be unable to obtain the objective image. A good reference material for prompt engineering can be found here: PROMPT ENGINEERING.
1c) Describe in detail how you improve your generated image. The description should include display of the generated images and their corresponding prompts, and detailed reasoning over the change in prompts. If the final improved image is generated with several iterations of prompt improvement, you should show each step in detail. I.e., you should display the result of each iteration of prompt change and discuss the result of each prompt change. You should also compare your improved image with both the first image you generated above, as well as the objective image. (30%)
TO FILL
TO FILL
TO FILL
Task 2: generating images with another diffusion-based model, DALL-E (mini-DALL-E, via Huggingface Spaces). (40%)
Stable Diffusion is not the only diffusion-based model that has the capability to generate good quality images. DALL-E is an alternative to Stable Diffusion. However, we are not to discuss the differences over these two models technically, but the differences over the generated images qualitatively (in a subjective manner). The link to generating with mini-DALL-E is provided here: MINI-DALL-E.
2a) You should first use the same prompt as you used in Task 1a and generate the image with mini-DALL-E. Display the generated image and compare, in detail, the new generated image with that generated by Stable Diffusion. (10%)
TO FILL
TO FILL
2b) Similar to what we performed for Stable Diffusion; you are to again improve the generated image with prompt engineering. Describe in detail how you improve your generated image. Similarly, if the final improved image is generated with several iterations of prompt improvement, you should show each step in detail. The description should include display of the generated images and their corresponding prompts, and detailed reasoning over the change in prompts. You should compare your improved image with both the first image you generated above, as well as the objective image.
In addition, you should also describe how the improvement is similar to or different from the previous improvement process with Stable Diffusion. (10%)
TO FILL
TO FILL
2c) From the generation process in Task 1 and Task 2, discuss the capabilities and limitations over image generation with off-the-shelf diffusion-based models and prompt engineering. You could further elaborate on possible alternatives or improvements that could generate images that are more realistic or similar to the objective 请加QQ:99515681 邮箱:99515681@qq.com WX:codinghelp
- 全力打造中国“创业之都”名片,第十届中国创业者大会将在郑州召开
- COMP 330代做、Python设计程序代写
- WhatsApp拉群奇妙体验:外贸小白的好奇心是如何被这工具激发的?
- 布局算力新基建,九章云极DataCanvas公司赋能AI产业高质量发展
- 群发不再是阻碍:海外营销高手亲授,WhatsApp拉群工具助您轻松应对风控挑战
- 外贸疑问时刻 WhatsApp拉群营销工具为何让我充满了好奇
- 数字领袖 专家亲述 WhatsApp拉群营销工具如何激发好奇心 助业务成功腾飞
- WhatsApp营销软件,ws群发/ws频道号/ws协议号/ws拉群/ws业务咨询大轩
- 谷器数据荣膺“协作之星”荣誉奖项
- 节能科技:塑造绿色未来,引领低碳生活
- Ins群发营销软件,Instagram打粉工具,让你的营销无往不利!
- 中国生态农业:守护绿色家园,共筑美好未来
- 外贸初探 WhatsApp拉群营销工具点亮了我在国际市场的创业之路
- 制冷设备:打造舒适环境的秘密武器
- WhatsApp营销软件/ws协议号/ws群发工具/ws自动注册
- 怎么找到靠谱的Line协议号卖家全球通证:博主推荐,LINE营销工具如何引领我业务走向国际巅峰
- Telegram协议号注册器,WS智能推广工具,带您走向网络通信的新高度
- 胡蜂养殖场:探秘自然之韵,品味甜蜜人生
- 2024深圳国际自有品牌展首创零供日,汇聚全球资源
- 中囤律商信用管理有限公司:引领企业信用修复与ISO体系认证的新篇章
- 时空商业的奇幻掌故:全球app云筛在全球用户沟通中的关键作用
- WhatsApp群发软件,ws怎么拉群引流/ws协议号/ws代拉群咨询大轩
- 外贸成长史 WhatsApp拉群工具 记录我从小白到外贸大咖的蜕变
- EB5项目考察之行:世贸通移民集团因何受美国国会议员接见
- 《热辣滚烫》掀起健康减脂潮流,植物基食品迎来“第二春”?
- PQE Group:收入超过1亿欧元
- WhatsApp拉群工具助力外贸新手 如何通过创意亮点蜕变成为营销领域的精英
- WhatsApp拉群营销新法宝,你尝试过了吗?欢迎大家一起探讨使用心得
- Telegram怎么批量私信?电报群发拉群营销软件上线!
- Instagram官方引流打粉软件,ins独家爆粉机器人爆款上线!
推荐
- 疫情期间 这个品牌实现了疯狂扩张 记得第一次喝瑞幸,还是2017年底去北京出差的 科技
- 创意驱动增长,Adobe护城河够深吗? Adobe通过其Creative Cloud订阅捆绑包具有 科技
- 智慧驱动 共创未来| 东芝硬盘创新数据存储技术 为期三天的第五届中国(昆明)南亚社会公共安 科技
- 如何经营一家好企业,需要具备什么要素特点 我们大多数人刚开始创办一家企业都遇到经营 科技
- B站更新决策机构名单:共有 29 名掌权管理者,包括陈睿、徐逸、李旎、樊欣等人 1 月 15 日消息,据界面新闻,B站上周发布内部 科技
- 苹果罕见大降价,华为的压力给到了? 1、苹果官网罕见大降价冲上热搜。原因是苹 科技
- 老杨第一次再度抓握住一瓶水,他由此产生了新的憧憬 瘫痪十四年后,老杨第一次再度抓握住一瓶水,他 科技
- 全力打造中国“创业之都”名片,第十届中国创业者大会将在郑州召开 北京创业科创科技中心主办的第十届中国创业 科技
- 丰田章男称未来依然需要内燃机 已经启动电动机新项目 尽管电动车在全球范围内持续崛起,但丰田章男 科技
- 升级的脉脉,正在以招聘业务铺开商业化版图 长久以来,求职信息流不对称、单向的信息传递 科技