Jiao Sun @sunjiao123sun_ Twitter profile | Pikagi

Pikagi

Jiao Sun

@sunjiao123sun_

3,087

Followers

419

Following

47

Media

360

Statuses

Research Scientist at Google DeepMind \n\n NLP PhD @ USC, Amazon ML Fellow \n\n ex-{Google Brain, Alexa AI} nlper, IIIS Tsinghua-Ren

https://t.co/wAnRceMuNg

Joined September 2019

Don't wanna be here? Send us removal request.

Pinned Tweet

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

11 months

Generated images not following your prompt? Introducing 𝔻𝕣𝕖𝕒𝕞𝕊𝕪𝕟𝕔 from @GoogleAI : improving alignment + aesthetics of image generation models with feedback from VLMs! ✅ Model Agnostic ✅ Plug and Play ❌ RL ❌ Human Annotation ❌ Real Image

Tweet media one

5

70

330

Last Seen Profiles

@canradosiphieo

@IksanMa30218812

@janeonwe28

@RomeoJulietLDN

@stw46

@Mariners

@Melquiades54095

@versedf33837666

@MDameion24025

@AfKaterina

@nobuteru98

@xbellablu

@Hayleywhite16g1

@yuraniystiven

@hydromancy_17

@Mecke_Dev

@BillsBackersROC

@unionmarket

@4nyaaaa1

@Wor_Phonix

@MagnesiumMonkey

@tiwa_made

@plv_ir

@NJ_AvalancheU16

@Heixintaimei

@OurLesbian_69

@sheesha_leaflet

@Roaamuhameddd

@missmodelindo

@linmaomao2020

@AMD

@missmodelindo

@SCLs_Style

@hognut1974

@vtdems

@azaleacircle15

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

4 months

Honored to receive the 🥇BEST PAPER AWARD🥇 from CVPR 2024, please consider using our collected fine grained feedback! Huge shout out to our work DreamSync, the key method that we use for using the fine grained feedback to improve the model, detail in my pined tweet! 🚀

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

4 months

🌟Rich Human Feedback for Text-to-Image Generation selected as CVPR 2024 Best Paper Award Candidate (top 1%)🌟 Current text-to-image models are not perfect, but where exactly? They suffer from artifacts, alignment and aesthetics. We collect feedback on 18K images to capture all

Tweet media one

3

27

226

20

26

591

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

1 year

Can LLMs generate exact 5 words? No How about 5 sentences? No How about 5 paragraphs? No 🤷🏻‍♀️ In , we evaluate the performance of LLMs on various controlled generation tasks including numerical planning, story generation, paraphrase generation, and etc. (1/n)

Tweet card media

Evaluating Large Language Models on Controlled Generation Tasks

While recent studies have looked into the abilities of large language models in various benchmark tasks, including question generation, reading comprehension, multilingual and etc, there have been...

14

81

416

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

9 months

Today I defended my thesis and became Dr. Sun! 🌞 Thank you my committee members @MaxMa1987 @VioletNPeng @jonathanmay @emilio__ferrara and Dan O’Leary! The slides of my presentation are here: . Ph.D done but research never ends! Fight on!

Tweet media one

Tweet media two

Tweet media three

39

3

315

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

4 months

🌟Rich Human Feedback for Text-to-Image Generation selected as CVPR 2024 Best Paper Award Candidate (top 1%)🌟 Current text-to-image models are not perfect, but where exactly? They suffer from artifacts, alignment and aesthetics. We collect feedback on 18K images to capture all

Tweet media one

3

27

226

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

6 months

Thanks @CSatUSC for capturing this one of the most important moments of my life! Thanks for my family and my dearest advisor @MaxMa1987 for making it happen! #PhD

Tweet media one

17

5

213

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

3 years

A team of collaborators from ALL different institutes? 5 female researchers + 1 high school student? I am excited that our fairness work "Pretty Princess vs. Successful Leader: Gender Roles in Greeting Card Messages" is conditionally accepted by #CHI2022 ! Stay tuned for details!

Tweet media one

3

13

179

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

1 year

After being four-year LinkedIn-less, I’m finally back! Let’s connect and chat if you: - are hiring — have an opening that I might be a fit! - are graduating, let’s go through the job searching together! - know me or my work! - just want to know me!

Tweet card media

Jiao Sun - Google DeepMind | LinkedIn

Jiao Sun is a Ph.D. candidate and Amazon Fellow at USC. Her research focus is trustworthy language modeling. Her works have appeared in top-tier conferences such as ACL, EMNLP, NAACL, CHI, etc.,...

www.linkedin.com

6

10

166

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

Wouldn't it be a 🌩️DISASTER if evaluation metrics always rate American English 10 times better than Indian English? ⚠️We (🔗) study the dialect robustness systematically, find current that evaluation metrics are NOT robust to dialects🤯, and propose NANO🧵

Tweet media one

6

28

137

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

3 years

#chi2022 Best paper honorable mention!! OMG?! thanks again my wonderful collaborators! esp. @tongshuangwu @YueJiang_nj @VictoriaLinML @Diyi_Yang ! See y’all at New Orleans!

Tweet media one

6

7

127

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

3 years

Can we paraphrase sentences into desirable syntactic structures? How to select proper syntactic parses that can properly guide paraphrase generation? 🤔 Our #EMNLP2021 paper AESOP (w/ @MaxMa1987 @VioletNPeng ) proposes an adaptive way to retrieve compatible parses! 😎(1/6)

Tweet media one

1

14

82

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

6 months

I’m working on @eccvconf rebuttal, and here’s one of the review: “The reliance on training data may raise concerns about the model's generalizability to unseen prompts and scenarios.” How should I rebut this? 🥲 I’m so speechless right now…

16

4

81

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

3 years

While #Wikipedia has been a great resource for knowledge, implicit biases can be subtle and detrimental. In our new #ACL2021 paper (w/ @VioletNPeng ), we found that #Wikipedia pages intermingle professional career events with personal events in a systematically biased way. 1/5

Tweet media one

3

12

67

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

3 years

🤶 Pretty Princess vs. Successful Leader? Have you ever sent someone greeting cards? People write greeting card messages out of goodwill, but gender stereotypes in these messages may be enforced without being noticed! Check out our #chi2022 work for a systematic analysis! (1/n)

Tweet media one

1

10

66

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

1 year

One reviewer emotionally lowered their score after a round of discussion without saying anything technical 😢 What should we do? #emnlp2023

9

0

63

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

1 year

#NewProfilePic thanks to @USC_ISI

Tweet media one

2

0

47

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

CFP of ACL 2023 is out! Ddl of direct submission is **Jan 20th**! As always, bunch of other deadlines to keep an eye out. Check it out!

Main Conference

Official website for the 61st Annual Meeting of the Association for Computational Linguistics

2023.aclweb.org

1

13

38

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

1 year

I will be at ACL next week to present this work! Look forward to connecting with folks who work on evaluation, data and beyond! HMU if any of these sounds interesting to you! DMs are open

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

Wouldn't it be a 🌩️DISASTER if evaluation metrics always rate American English 10 times better than Indian English? ⚠️We (🔗) study the dialect robustness systematically, find current that evaluation metrics are NOT robust to dialects🤯, and propose NANO🧵

Tweet media one

6

28

137

1

3

36

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

I enjoyed the interview with Amazon a lot! It is not only a summary about my experience in natural language generation, but also a deep conversation about how my works connect and contribute to the community! Read to learn more about me, my Amazon internship and more! 👇

@AmazonScience

Amazon Science

2 years

Can AI help an aspiring author write a novel? Could machines learn how to make jokes? Inspired by these questions, Jiao Sun has been exploring the potential of AI-generated text. Now, as an Amazon ML Fellow, she's hoping to develop her research further. #ConvAI #NLProc

2

8

57

4

3

34

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

Sebastian was my internship mentor for 6 months. He taught me everything including technicals, how to write a better paper and collaborate with others more efficiently! If you want to have a lifelong mentor and do great NLP research, I don’t see any reason why you wouldn’t apply!

@sebgehr

Sebastian Gehrmann

2 years

My group is hiring interns for summer 2023. If you are a current PhD student and interested, please email me. Info on internship topics: There are also multiple open full-time roles in AI Engineering - feel free to reach out :)

5

60

288

1

1

33

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

Are you excited about pun generation? In #EMNLP2022 , we have two works accepted in the main conference: 1️⃣ Context-Situated Pun Generation 👉 a brand-new task! 2️⃣ ExPUNations: Augmenting Puns with Keywords and Explanations 👉 a new dataset! Learn more! 🧵👇

2

6

29

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

11 months

My awesome co-first author Deqing is looking for a research internship opportunity this summer; He’s one of the most fast-paced researcher I’ve seen in these years! We would appreciate if you can send him a DM if you are recruiting interns working on LLM/Large Vision Models!

@DeqingFu

Deqing Fu

11 months

🚨New paper alert🚨 With 𝔻𝕣𝕖𝕒𝕞𝕊𝕪𝕟𝕔, large language models (LLMs), vision-language models (VLMs), and text-to-image (T2I) models 𝕊𝕪𝕟𝕔 together! They interactively and iteratively improve alignments and aesthetics of T2I models. No RL needed. No human annotation

2

7

91

0

3

29

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

Wow thanks for the nice words and I think EVERY modeling work from the creative generation community should really think about having context as the constrain for the generation!

@liyucheng_2

Yucheng Li

2 years

When I had the idea of Pun Generation one year ago, i told myself it is not going to be possible. Until i saw this in #EMNLP2022 from the incredible author @sunjiao123sun_ . So exciting to see creative language generation paper in our community!

Tweet media one

1

2

21

0

0

27

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

3 years

What does it mean for a generative AI model for code to be explainable? My internship work at IBM Research investigated the XAI need under 3 scenarios: code translation, code autocompletion and natural language to code. to appear at #IUI2022 #HCI 😏 (1/n)

Tweet card media

Investigating Explainability of Generative AI for Code through...

What does it mean for a generative AI model to be explainable? The emergent discipline of explainable AI (XAI) has made great strides in helping people understand discriminative models. Less...

2

5

27

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

3 years

Thanks @QVeraLiao for the warm welcome! A super late announcement: I will be doing a research internship @IBMResearch on code generation! The great combination of my beloved text generation and Human-AI collaboration! Saying I’m excited would be a massive understatement! 💪💯

@QVeraLiao

Vera Liao

3 years

same 🎉welcome @sunjiao123sun_ to the team!

0

0

5

3

0

27

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

We ⚠️Investigate the Benefits of Free-form Rationales in our #EMNLP2022 findings work, from both the human and the model perspectives. For humans, do rationales aid human interpretability? For models, do rationales boost the model performance? (0/n)

Tweet media one

2

8

25

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

Would NLU models trained on EN-US generalize well to EN-IN (Indian English)/ EN-GB (British English)? I am thinking about exploring the transferability of models between dialects. Does anyone here know some good datasets for this task? 🙏

4

2

26

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

1 year

LLMs just cannot count and generate exactly the number of words that we are asking for! With 7 being the magic number that models start to struggle with! (3/n)

Tweet media one

2

1

21

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

1 year

Tu has been my amazing Google internship mate, close friend and life mentor. Can’t wait to see what he will achieve at his Google & VT adventure! All the best Tu!

@tuvllms

Tu Vu

1 year

I successfully defended my Ph.D. thesis. A special thank you to the members of my thesis committee: my wonderful advisor @MohitIyyer , @MajiSubhransu , @HamedZamani , @lmthang , and @colinraffel for their insightful feedback and advice on my research and career plans.

19

6

137

1

0

20

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

1 year

Honored to be part of the efforts. Check out the magic that LIMA can achieve with only 1000 prompts! Our human eval resonates with GPT-4 eval showing how humans prefer LIMA over/on-par with other LLMs!

@sriniiyer88

Srini Iyer

1 year

New paper! Fine-tuning on just 1000 carefully selected prompts and responses produces a surprisingly strong chatbot model!

0

0

15

0

0

18

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

11 months

The key recipes of DreamSync are: 1. Diverse text prompts from LLMs 2. VQA feedback (TIFA score) for alignment and VILA feedback for aesthetics 3. Rejection Sampling with feedback 4. LoRA Fine-tuning 5. Multiple Iterations (2/n)

1

2

15

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

4 years

I'm in #gradcohort2021 organized by amazing @CRA_WP ! I've been enjoying the event a lot as it provides a platform for us female PhD students to connect and support each other! If you are here as well, feel free to drop me an email and we should talk!

1

0

15

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

1 year

Presenting this from 11:00-12:30 today! Come chat with me at the poster session!

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

Wouldn't it be a 🌩️DISASTER if evaluation metrics always rate American English 10 times better than Indian English? ⚠️We (🔗) study the dialect robustness systematically, find current that evaluation metrics are NOT robust to dialects🤯, and propose NANO🧵

Tweet media one

6

28

137

0

1

15

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

I’m excited to see awesome things #chatGPT can do, but we need to make sure it’s not producing gobbledygook that seems to be right — it is misleading and can be harmful as knowledge query. What is needed to explain generative models? Re-sharing our work:

@Thom_Wolf

Thomas Wolf

2 years

@geetkhosla because of behaviors like this 👇: super convincing yet plain and fully wrong. almost went back to my trigonometry book to check...

Tweet media one

9

4

52

2

4

15

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

Thanks for featuring my work with @QVeraLiao and all other colleagues at IBM Research. It has been a increasing effort around generative AI, and our work outlines what explainability would benefit users who will be using the models!

@censius

Censius

2 years

Generative AI is taking the industry by storm & seeing how it has become a niche of its own, How can we make Generative AI Models Explainable?🤔 This paper by @sunjiao123sun_ attempts to make Code-based GenAI Models explainable, let's break it down. 🧵

1

1

7

0

5

14

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

1 year

Great work by @DeqingFu that could potentially be used for data augmentation and challenging current models!

@DeqingFu

Deqing Fu

1 year

Excited to share our self-labeled counterfactual paper @emnlpmeeting #EMNLP2023 with @ameya_godbole1 and @robinomial : we develop an automated procedure that generates hard negative examples (e.g., subtle unanswerable questions) from positive examples (e.g. answerable examples).

Tweet media one

1

5

29

0

0

14

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

1 year

In total, we include five controlled generation tasks, we show a spectrum of abilities of LLMs. They are good at: constrained content generation (e.g., sentiment), story generation, rationale generation! Bad at: numerical planning and paraphrase generation! (4/n)

Tweet media one

1

0

14

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

11 months

Congratulations! If you are interested in decoding methods for generation, please check out the paper: . The look back decoding method automatically removes potential failures, repetitions and topic drifting from the decoding steps!

@xunannancy

Nan Xu

11 months

🌟Thrilled to share that our paper "Look-back Decoding for Open-Ended Text Generation" won the Outstanding Paper Award at EMNLP2023! Immense gratitude to anonymous reviewers and to my incredible collaborators @violet_zct , @real_asli and @MaxMa1987 . #EMNLP2023

Tweet media one

Tweet media two

Tweet media three

1

5

33

0

0

12

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

7 months

Welcome Logan!

@OfficialLoganK

Logan Kilpatrick

@OfficialLoganK

7 months

Excited to share I’ve joined @Google to lead product for AI Studio and support the Gemini API. Lots of hard work ahead, but we are going to make Google the best home for developers building with AI. I’m not going to settle for anything less.

574

187

5K

0

0

13

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

3 years

@ReviewAcl so will April 15th review cycle will be 4 week or 6 week? It is important as many of us want the reviews back before EMNLP’s May 24 decision deadline of if submitting it to softconf. Btw, not a big fan of “surprise” announcement 📣🥲

Tweet media one

3

2

12

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

Congrats! Need to read this! 📝

@sarahookr

Sara Hooker

2 years

Our work on "Intriguing Properties of Compression on Multilingual Models" has been accepted to EMNLP 2022. A collaboration led by Kelechi Ogueji w @orevaahia @lekeonilude , @sebgehr , @KreutzerJulia . 🎉🔥 Great news to hear at the end of a long two weeks of travel.

3

15

139

0

0

11

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

7 months

The deadline is around the corner, please consider voting for Kai-Wei! Please search for “sigdat elections”in your email inbox and it should less than two minutes to vote! Your support is greatly appreciated! ❤️

@kaiwei_chang

Kai-Wei Chang

8 months

I am honored to be nominated by SIGDAT (the org that oversees EMNLP) to run for VP-elect with other awesome candidates who share the goal of improving our community. Please check your email to vote by 3/24.🗳️ See details:

Tweet media one

3

36

135

0

1

12

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

1 year

We shared our intuition and hypothesis of reasons why this is the case and some potential solutions in our paper. Please check it out! (n-1/n)

Tweet card media

Evaluating Large Language Models on Controlled Generation Tasks

While recent studies have looked into the abilities of large language models in various benchmark tasks, including question generation, reading comprehension, multilingual and etc, there have been...

1

1

11

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

3 years

Efficient transformer architecture from Max!! 🎉 Check it out!

@MaxMa1987

Xuezhe Ma (Max)

3 years

Thrilled to share our #NeurIPS2021 work! "Luna: Linear Unified Nested Attention". This is a new linear time transformer architecture achieves competitive results across multiple benchmarks. co-authors: @XiangKong4 @sinongwang @violet_zct @jonathanmay @gabema @LukeZettlemoyer

Tweet media one

Tweet media two

1

8

50

0

2

11

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

1 year

It’s interesting to see how regional stereotypes got reflected in LLM just by adding the country tag in the prompts! Awesome work led by @esindurmusnlp !

@AnthropicAI

Anthropic

1 year

We develop a method to test global opinions represented in language models. We find the opinions represented by the models are most similar to those of the participants in USA, Canada, and some European countries. We also show the responses are steerable in separate experiments.

99

172

796

0

3

11

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

11 months

I sadly cannot make it to EMNLP, but please talk to @yufei_t our work, especially about numerical planning! A lot of people have reached out about code release, we are sorry for the day and are working on this. The first release of our input and output will come very soon! :)

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

1 year

Can LLMs generate exact 5 words? No How about 5 sentences? No How about 5 paragraphs? No 🤷🏻‍♀️ In , we evaluate the performance of LLMs on various controlled generation tasks including numerical planning, story generation, paraphrase generation, and etc. (1/n)

14

81

416

0

1

10

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

As my EMNLP trip is coming close, I wonder if there is a list of people who will be attending in person so that I don’t need to stalk everyone’s twitter? @emnlpmeeting if not, I’m happy to start one that people who want to connect can put down their name and websites 👩🏻‍💻

2

1

10

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

11 months

Finally, Human annotators also agree that DreamSync aligns to texts better than SDXL. (7/n)

Tweet media one

1

0

10

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

@mark_riedl Well, I really want to self-recommend my two pun generation papers that are gonna appear at emnlp 2022, but I’m pretty sure they are not the “best” 😑 how about checking AmbiPun first! By @yufei_t

1

0

10

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

1 year

Mark is extremely nice, work with him! 📣

@mdredze

Mark Dredze

1 year

I'm looking for a postdoc! Topics: LLMs, text generation, QA, medical NLP. Join two amazing postdocs in my group: @hanjie_chen (phd @CS_UVA , incoming prof @RiceCompSci ) and @sharonlevy21 (phd @ucsbNLP , incoming prof @RutgersU ) Apply: 🙏retweet Questions?

Tweet media one

5

56

132

0

0

10

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

1 year

This #EMNLP2023 work is co-lead by four students from USC, UCLA and ETH @xunannancy @yufei_t @wangchunshu Awesome collaborators @johnwieting2 Rahul and Qian and of course amazing advisors @MaxMa1987 and @VioletNPeng ! We welcome all kinds of feedback and discussions! (n/n)

0

0

10

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

1 year

Among all the tasks, Numerical Planning Benchmark (NPB) is the most intuitive task where LLMs are asked to generate sentences matching exact numerical constraints, such as count of words/syllables. We got the motivation from the real-world scenarios such as creative writing. 2/n

Tweet media one

1

0

9

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

The work is done during my internship at Google Research with my hosts @sebgehr @jacobeisenstein , and my awesome collaborators @ThiboIbo @eaclark07 @tuvuumass @TDozat @dhgarrette @adisid01 ! Discussion is more than welcome! Happy Thanksgiving! 🦃🍁

0

1

9

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

3 years

Thank you Nedjma so much for liking our work and such a wonderful summarization! 💯 We hope that you enjoyed our talk, and we would love to have/spike more discussion about event fairness in the community!

@nedjmaou

Nedjma Ousidhoum نجمة أوسيدهم

3 years

#ACL2021NLP Session:4E Ethics in NLP "Men Are Elected, Women Are Married: Events Gender Bias on Wikipedia" by @sunjiao123sun_ and @VioletNPeng Paper Presentation #ACL2021EN 1/n

1

2

7

1

0

8

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

3 years

Looking for a high-quality QA dataset for event-centric reasoning? You definitely don’t want to miss out ESTER with **FIVE** event relation types! We are looking forward to seeing everyone’s great efforts on solving this challenging task! 💪💪

@HanRujun

Rujun Han

3 years

(1/5) Introducing our #EMNLP21 paper “ESTER: A Machine Reading Comprehension Dataset for Event Semantic Relation Reasoning.” We invite everyone interested in event-centric reasoning to test your models on ESTER and submit results to our leaderboard:

Tweet media one

Tweet media two

1

6

31

0

0

8

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

4 months

Finally, we show that the predicted rich human feedback can be leveraged to improve image generation quality. Following the same recipe as in DreamSync, we use the rich human feedback to select high-quality training data to finetune and improve the generative models! (4/n)

Tweet media one

1

0

8

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

You should catch me at the conference if you are attending in person! 👇 1️⃣ Context-Situated Pun Generation (Dec 9th 16:00-17:30 @ Atrium) 2️⃣ ExPUNations: Augmenting Puns with Keywords and Explanations (Dec 11th 15:30-17:00 @ Aritum) Look forward to seeing many of you there!

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

Are you excited about pun generation? In #EMNLP2022 , we have two works accepted in the main conference: 1️⃣ Context-Situated Pun Generation 👉 a brand-new task! 2️⃣ ExPUNations: Augmenting Puns with Keywords and Explanations 👉 a new dataset! Learn more! 🧵👇

2

6

29

0

0

8

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

3 years

Paper link: . Live demo at , and talk to us in Session 10F (Tue 18:20 PDT)! #NAACL2021 👀🙌

0

0

7

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

11 months

This work is co-led by @DeqingFu and @huyushi98 ! With great collaborators: Su Wang, @RoyiRassin , Da-Cheng Juan, Dana Alon, Charles Herrmann, @vansteenkiste_s @RanjayKrishna and @CyrusRashtchian ! Discussions and feedbacks are more than welcomed!

0

0

6

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

3 years

Thanks Vera! Please swing by my talk — I look forward to talking to folks interested in Code Generation + explainability! It will happen at Weds March 23th around 9:20am EDT. 🤓

@QVeraLiao

Vera Liao

3 years

Trying to attend as many #IUI2022 sessions I can this week. Looking forward to catching up! If you are at IUI, check out the XAI session on Wednesday and @sunjiao123sun_ 's talk on "Investigating Explainability of Generative Models for Code through Scenario-based Design"😇

0

1

20

0

0

7

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

11 months

See the qualitative examples below about how DreamSync iteratively improves text-image alignment after each iteration! (4/n)

Tweet media one

1

0

7

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

4 months

Awesome collaboration with our student intern lead Yowei Liang from UCSD, Junfeng He, Gang Li, Peizhao, Arseniy, @N_Carolan +all other Google folks who are not on X at all :rofl. Feedback and discussions are absolutely welcome! (n/n)

0

0

6

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

A bit surprised, but this is important for folks who are having hard time deciding between ACL and EACL. Also, EACL anonymity deadline is October 13th, it sounds like a good combo of arxiv + EACL + ACL

@eaclmeeting

eaclmeeting

2 years

[1/3] Cross-submission policy with ACL 2023: As the #EACL2023 notification deadline and #ACL2023 submission deadline are unfortunately on the same day, you may submit your paper to ACL 2023 while it is still under review at EACL 2023. Keep reading...

2

14

69

2

3

7

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

11 months

First, where did we get the prompts for training? We utilize LLM’s creativity (i.e., PaLM 2 for us)! Check out the qualitative examples as a glimpse of the diverse prompts in our training, which sets the solid foundation of DreamSync’s performance. (3/n)

Tweet media one

1

0

7

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

3 years

Congrats on the fine work @yufei_t ! Actually AESOP, my EMNLP work on paraphrasing contributes to converting the generated hyperboles to more natural expressions! This is a great use case showing how much paraphrasing can help! Please keep tuned with my new post about AESOP!

@yufei_t

Yufei Tian ✈ EMNLP

3 years

Is generating hyperboles easy? Our machine says yes! Check our new #EMNLP2021 Findings paper "HypoGen: Hyperbole Generation with Commonsense and Counterfactual Knowledge" with Arvind and @VioletNPeng !🧾 Code and data coming soon!

Tweet media one

3

2

17

0

0

7

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

11 months

We also evaluate the performance of DreamSync on two benchmarks for both the text faithfulness and visual appeal. DreamSync performs the best among all the methods for textual faithfulness! (5/n)

Tweet media one

1

0

6

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

@mark_riedl @yufei_t Thanks Mark! This gives me motivation of putting them on arxiv first — will post them here once they are alive on arxiv. Here is the link for ambipun paper:

Tweet card media

AmbiPun: Generating Humorous Puns with Ambiguous Context

In this paper, we propose a simple yet effective way to generate pun sentences that does not require any training on existing puns. Our approach is inspired by humor theories that ambiguity comes...

1

0

6

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

4 months

From the annotation example, you can see that we not only 1) mark the image regions that are misaligned or implausible, but also 2) provide which words in the text prompts are misrepresented or missing! (2/n)

Tweet media one

1

0

6

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

Awesome work from Thibault and team! If you are working on TTS, this metric would greatly help with auditing the quality! Check it out!

@ThiboIbo

Thibault Sellam

2 years

1/N Tired of listening to your multilingual TTS models? SQuId 🦑 is an automatic metric for multilingual speech synthesis: give it a waveform, it predicts how natural it sounds. To develop the model we gathered 1.9 Million listening tests in 65 locales.

Tweet media one

5

28

101

0

0

6

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

Both works are done during my internship @AmazonScience with awesome @VioletNPeng @anjalisaa @shrnrby @Ale_Cervone @iuaaui Yang Liu, Tagyoung Chung and Jing Huang!

1

0

6

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

4 months

With the collected data, we train a multimodal transformer to predict the rich feedback (plausibility/ alignment/aesthetics scores) automatically. Our model greatly outperforms (w and wo finetuning) CLIP in terms of correlation coefficients on our test set. (3/n)

Tweet media one

1

0

6

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

@MaxMa1987 Thanks Max!

0

0

5

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

3 years

Come and say HI! 👋 🤩

@uclanlp

uclanlp

3 years

Join us for the 12:30-12:30 AST poster session on 11/8! @sunjiao123sun_ will present our work on adaptive syntactically controlled paraphrase generation. Joint work w/ @MaxMa1987 . She had a more interesting introduction 👇👇👇

1

0

1

0

0

5

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

In summary, yes, rationales can help with both human interpretability and model performance, but with a lot of caveats that people should take care of before getting to any conclusion! I will present this poster tomorrow #BlackboxNLP (Dec 8th) 11:00-12:30 at Mezzanine and Hall!

1

0

5

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

11 months

As an iterative approach, we also see the progressive improvement after each iteration quantitatively, both for text faithfulness and aesthetics. (6/n)

Tweet media one

1

0

5

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

Ideally, dialects that share the same semantics should get the exact same score! this is too strict and can be easily violated. We introduce semantic perturbation, and define relaxed dialect robustness as dialects should score higher than semantic perturbations! (1/n)

Tweet media one

1

1

4

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

3 years

Our code and collected data are available at . I will be presenting AESOP as a poster at . Come talk to me :)) 🤩🤩

GitHub - PlusLabNLP/AESOP: Code for Aesop: Paraphrase Generation with Adaptive Syntactic Control...

Code for Aesop: Paraphrase Generation with Adaptive Syntactic Control (EMNLP 2021) - PlusLabNLP/AESOP

0

0

4

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

3 years

We investigate XAI needs for generative AI models for code through scenario design. More specifically, we conducted 9 workshops with 43 software engineers using **real examples** from state-of-the-art generative AI models to elicit users' explainability needs! (2/n)

Tweet media one

1

1

4

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

3 years

This work is conducted with my amazing mentor @QVeraLiao and @mayankagarwal__ , together with expert @michael_muller , Stephanie Houde, @kr_t and fabulous manager Justin Weisz ( @gratefulspam (🧐)) ! Please check out our paper for more details, and HMU if you want to discuss! ❤️

1

0

4

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

3 months

Please chat with Lorena! #ACL2024

@LorenaYannnnn

Tianyi Lorena Yan (Seeking PhD25’)

5 months

Great news to wrap up this school year with my very first first-author paper: CoIN got accepted at #ACL2024 Findings! () 🌟Many thanks to my collaborators @fwang_nlp , @JamesYHuang36 , @WenxuanZhou_96 , @FanYin63689862 , @aram_galstyan and advisor @muhao_chen

1

8

44

1

0

4

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

3 years

@WikiResearch @USC @WikiWomenInRed Thanks for tagging! We hope our work brings awareness of potential event gender biases in knowledge sources (e.g., Wikipedia! I personally use it everyday 🥸), and urges Wikipedia contributors to be cautious when contributing to the pages! Check out my pinned Tweet for more!

0

0

4

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

You should catch me at the conference if you are attending in person! 👇 1️⃣ Context-Situated Pun Generation (Dec 9th 16:00-17:30 @ Atrium) 2️⃣ ExPUNations: Augmenting Puns with Keywords and Explanations (Dec 11th 15:30-17:00 @ Aritum) Look forward to seeing many of you there!

1

0

4

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

3 years

Thanks for tagging! We hope our work brings awareness of potential event gender biases in knowledge sources (e.g., Wikipedia! I personally use it everyday 🥸), and urges Wikipedia contributors to be cautious when contributing to the pages! Check out my pinned Tweet for more!

@WikiResearch

WikiResearch

3 years

"Men Are Elected, Women Are Married: Events Gender Bias on #Wikipedia " event-centric study of gender biases on a large English Wikipedia corpus shows that personal life related events are more likely to appear for females than males. (Sun et al, 2021)

Tweet media one

6

91

174

0

2

4

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

We ask two questions: 1️⃣ HOW MUCH do dialect rewrites improve the metric value over semantic perturbations? 2️⃣ HOW OFTEN do dialect rewrites score higher than semantic perturbations? We find that existing metrics BLEURT, Prism, YiSi, BLEU, chrF struggle at both. (2/n)

Tweet media one

1

0

4

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

In addition, our experiments show that NANO also helps improve the metric performance on the standard metric benchmarks!! You should use our metric if you are evaluating dialectal texts and want a more fair judgment! (6/n)

Tweet media one

1

0

4

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

This work is done with my great mentors @swabhz @jonathanmay and advisor @MaxMa1987 ! @nlp_usc #EMNLP2022 Preprint is:

Tweet card media

Investigating the Benefits of Free-Form Rationales

Free-form rationales aim to aid model interpretability by supplying the background knowledge that can help understand model decisions. Crowdsourced rationales are provided for commonsense QA...

0

0

4

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

4 years

Always a pleasure to work together with these wonderful researchers! 😊

@VioletNPeng

Violet Peng

4 years

Just had a 1-hour meeting with such a diverse group of undergrad, graduate, and post-doctoral researchers in CS. Guess what topic we discussed? w/ @ewsheng @jieyuzhao11 @JiaosunT @sunipa17 @houyu0939 @ovalle_elia @mattiesansev @kaiwei_chang Jinn Kim

Tweet media one

4

3

48

0

0

4

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

1 year

@sebgehr My tears 🥹 thank you!!

0

0

4

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

@sebgehr On my way to hitting you up 🙋🏻‍♀️

0

0

3

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

According to the value of {dialect} - {semantic perturb}, NANO helps improve the dialect robustness across different model sizes and languages (English, Portuguese, and Mandarin Chinese!) The success rates of {dialect} > {semantic perturb} also indicate that NANO helps! (5/n)

Tweet media one

1

0

3

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

3 years

As a result, we identify 11 categories of explainability needs in the context of GenAI for code with definitions and examples! (3/n)

Tweet media one

1

0

3

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

ExPUNations: Augmenting Puns with Keywords and Explanations 🧵 Humor understanding and generation are challenging even for humans! e.g., to get the funniness of the pun "the sushi said to the bee, wasabi!" it requires the commonsense that wasabi often goes with sushi! (0/2)

Tweet media one

1

0

3

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

@BlancheMinerva You can probably refer to what we did in our work. We took the mC4 corpus, get the region information from url (.in) and combine it with language identification model output (English) and use those text as en-IN, aka Inglish. This is a rough approximation but benefits at scale

0

1

3

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

We include 10 languages, 95 language variants in pretraining. Then, we adapt the metric to different use cases including within-language assessment and quality estimation with or without references. (4/n)

Tweet media one

1

0

3

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

3 years

We find that people tend to talk about achievement and career for males but appearance and domestic topics for females. Using WEAT scores, we find AI (GPT-2) generated greeting card messages further amplify such stereotypes!! 🥲 Check out techniques below: (2/n)

Tweet media one

1

0

2

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

@BlancheMinerva It depends on if you want a very accurate or coarse level of Inglish. For the analysis part of our Inglish dataset — we use the dataset from and I think this is probably the best that you can refer to! If you just want an approximation, (to be continued)

0

0

3

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

5 years

How to help Fraud Detection experts better tune the algorithms and evaluate the result? Check out our paper FDHelper! #chi2020 #AutoML

@lynn20434203

Yin Li

5 years

Although still in the mood of shattered Hawaii dream, I want to share a pre-print of our accepted #CHI2020 paper "FDHelper: Assist Unsupervised Fraud Detection Experts with Interactive Feature Selection and Evaluation". Find out more here: . 🧡 @JiaosunT

0

0

6

0

0

3

@sunjiao123sun_

Jiao Sun

@sunjiao123sun_

2 years

To facilitate this new setup, we collect a corpus that contains 4,551 tuples of context keywords and associated pun pairs, labeled with whether they are compatible for composing a pun, and a human-written pun for each compatible tuple. (1/3)

Tweet media one

1

0

3