David Profile Banner
David Profile
David

@dzhng

5,246
Followers
568
Following
246
Media
1,940
Statuses

founder @aomniapp • prev: @amity_hq @nasajpl

San Francisco, CA
Joined June 2011
Don't wanna be here? Send us removal request.
Pinned Tweet
@dzhng
David
1 year
Just launched: an agent specifically designed for research. Using a modified babyagi architecture by @yoheinakajima & AutoGPT For example: - podcast script from latest news - market research report - new github repos trending on hacker news Live now on
71
186
2K
@dzhng
David
10 months
Sam Altman is out at OpenAI, this seem to be the most convincing theory so far.
Tweet media one
358
1K
10K
@dzhng
David
10 months
Microsoft essentially acquired the best parts of OpenAI for $0 - 2 of the original founders (Sam & Greg), now unshackled & probably with a lot more skin in the game. - A significant amount of OpenAI staff will follow Sam & Greg to the new org, some of the best people in the
191
721
7K
@dzhng
David
10 months
Forever grateful to @sama for responding to a random email, at 12:32 AM, to give me some OpenAI credits so I don't bankrupt myself. 🫡
Tweet media one
21
52
2K
@dzhng
David
29 days
Hot tip - using gpt-4-mini as a reranker gives you better results, and now with strict mode it's just as reliable as any other reranker model.
Tweet media one
23
34
601
@dzhng
David
5 months
Introducing `deep-seek` - an open source research agent designed as an internet scale retrieval engine. It's a new approach to the current wave of answer engines. Instead of giving you one answer, deep-seek will retrieve an extremely comprehensive list of enriched results.
43
94
574
@dzhng
David
1 year
Just finished some benchmarks, I can confirm that Azure's GPT-3.5 endpoint is at least 3x faster than OpenAI's endpoint. I can't believe I'm saying this, but it's time to switch to Azure. Just updated my oss prompt eng & guardrails lib to support Azure:
Tweet media one
29
64
596
@dzhng
David
10 months
@FreddieRaynolds
FreddieRaynolds
10 months
Anon Reddit account created today shared this about Sam/ @OpenAI situation. Plausible? Posted 5 minutes ago with 0 upvotes. So boring it’s almost believable.
Tweet media one
278
475
4K
11
26
459
@dzhng
David
26 days
Hot tip: when using llms to generate structured outputs with libs like instructor, ai sdk, or openai's strict mode, the order of the properties passed into the schema really matters. Remember that these autoregressive models can only generate one token at a time, and use the
Tweet media one
19
46
440
@dzhng
David
1 year
I’ve shifted 80% of my LLM spend to @AnthropicAI Claude 2 at this point - it strikes the perfect balance between performance / cost / throughput. AND, because it’s a completion API, not a chat API, it’s a lot more steerable via prompt prefixes. Completion APIs are under-indexed.
25
24
304
@dzhng
David
3 months
@AnthropicAI I see what you did here 👀
Tweet media one
6
10
298
@dzhng
David
10 months
Context:
@satyanadella
Satya Nadella
10 months
@sama I’m super excited to have you join as CEO of this new group, Sam, setting a new pace for innovation. We’ve learned a lot over the years about how to give founders and innovators space to build independent identities and cultures within Microsoft, including GitHub, Mojang Studios,
1K
3K
32K
1
4
234
@dzhng
David
1 year
My new prompt engineering technique: chain-of-whys 😂 Results are actually surprising. Tested on GPT-3.5/4 and Claude-1/instant.
Tweet media one
14
16
207
@dzhng
David
4 months
There are a lot of auto browsing agents these days, but to productionize it, my guess is they are doing something different. My guess: - There's an index of all possible actions w/ descriptions for RAG retrieval. - All interactive & navigation elements are annotated for agent,
@tryramp
Ramp
4 months
Introducing Ramp Tour Guide: an AI Agent that can show you how to do anything on Ramp! Today, we'd like to share a sneak peek of Ramp's near future. As Ramp grows in functionality, we want to make all of it easily accessible to all of our customers. To do that, we're demoing a
34
57
596
11
9
202
@dzhng
David
4 months
Doing research on people is SUCH a pain, whether it is for a sales prospect, a podcast guest, or a potential hire. There are so many different directions to go and a ton of noise in search results. Introducing - a specialized research agent tuned just for
Tweet media one
18
10
169
@dzhng
David
10 months
Another great perspective:
@DrJimFan
Jim Fan
10 months
This is a master 4D chess move. WOW. 1. No new corporate structure. MSFT is literally one of the oldest for-profit tech companies out there, with a mature legal structure. Whether it's good for AGI is up for debate. 2. MSFT always wants to own the GPT weights. Now the moment has
236
785
7K
1
3
129
@dzhng
David
1 year
Whenever I meet developers building on top of LangChain
Tweet media one
7
1
131
@dzhng
David
10 months
@davecraige Or mischaracterizing the seriousness. This would definitely be a cause for firing if the board is conservative enough.
0
0
104
@dzhng
David
10 months
These 2 seem to be related. Essentially the "move fast" mentality vs the current AI safety environment.
@karaswisher
Kara Swisher
10 months
Scoop: There are about to be a lot more major departures of top folks at @OpenAI tonight and I assume Altman will make a statement tonight. But, as I understand it, it was a “misalignment” of the profit versus nonprofit adherents at the company. The developer day was an issue.
5
1K
6K
4
2
104
@dzhng
David
4 months
Agent auth is going to be a really tricky problem to solve. On one hand, agents *should* have access to your user accounts so it can perform actions in your behalf, and work with any product seamlessly (even ones without an api). I think the virtual machine solution is actually
@aaronwhite
Aaron White (Singularity.vc)
4 months
Is authing the Rabbit R1 against any of your accounts actually secure? I'm not so sure
26
59
646
28
4
106
@dzhng
David
2 months
We just shipped a people prospecting feature that lets our users directly search through our database of 400M people records to find the exact individual they want to reach out to. Now users can research an account, use our prospector to find the exact people to target, use our
4
8
103
@dzhng
David
1 year
For anyone who is building multi-step AI agents (e.g AutoGPT type systems), I highly recommend building it on top of a job queue orchestration framework like @inngest , the traceability these things provide out of the box is super useful, plus you get timeouts & retries for free.
Tweet media one
4
8
85
@dzhng
David
2 months
All this intelligence for $3 per million (output) token. That’s 5x cheaper than its closest closed source alternatives (gpt-4o and sonnet-3.5). I’ve spent zero effort optimizing for costs so far. Just build assuming intelligence will be free and the market will make it happen.
Tweet media one
6
8
80
@dzhng
David
1 year
@immad AI sidekick for outbound SDRs that learns your product and helps you with prospecting and research. You give it product docs, and it will help you home in on your ICP and craft a personalized outreach plan with each prospect. Goal is quality prospects with problem-solution fit.
6
3
78
@dzhng
David
10 months
@mr_mading Most likely not imo, the Occam’s razor explanation is it’s just incompetence and/or human conflicts.
3
1
74
@dzhng
David
1 year
Can’t wait for all the AI startups to pivot to AR startups
Tweet media one
7
4
74
@dzhng
David
5 months
@SullyOmarr explains why openai is so keen to keep the brand
1
0
61
@dzhng
David
2 months
We built @aomniapp to reimagine what sales can be, unburdened by what has been. With our latest update, we're one step closer to our master plan of giving you all the knowledge you need to better understand your customers.
8
7
58
@dzhng
David
10 months
Looks like a board coup, caused by the conflict between acceleration-ists & safety-ists. Basically incentive alignment issue with having a non-profit govern a for-profit. File this under one of these management practices that sounds good in theory but never work in practice.
@karaswisher
Kara Swisher
10 months
Looks like I was correct in my scoopage:
Tweet media one
2
113
1K
1
12
57
@dzhng
David
3 months
Been using Claude 3.5 for coding day-to-day and wow, I think this may happen sooner than people imagine. Software is dead. And we have killed him.
@cpaik
Chris Paik
3 months
The End of Software
386
461
3K
4
3
55
@dzhng
David
1 year
Some interesting use cases from users: 1. Create lesson plan on SVB banking crisis: 2. Market research on expense tracking apps in the UAE: 3. Podcast from HackerNews: 4. HN Github:
1
9
49
@dzhng
David
5 months
Here's the github repo: There're also more examples in the deployed version: This is a really early experiment, a lot of the results will suck! But I think it's an interesting concept that should be explored further. Enjoy!
3
5
52
@dzhng
David
10 months
@penngalusa It's ultimately sama's call for what the comm strategy is if an incident like this happens. Maybe there's a conflict here.
3
0
50
@dzhng
David
10 months
Latest update. The original scenario is definitely oversimplified.
@dzhng
David
10 months
Looks like a board coup, caused by the conflict between acceleration-ists & safety-ists. Basically incentive alignment issue with having a non-profit govern a for-profit. File this under one of these management practices that sounds good in theory but never work in practice.
1
12
57
1
3
49
@dzhng
David
5 months
First use of the @rabbit_hmi r1 - raw and unfiltered. Overall really impressed, it’s a little buggy, but the AI engineering happening behind the scenes is really impressive, def in line with SOTA performance. Great work @jessechenglyu & team.
5
6
49
@dzhng
David
1 year
So many AI apps today adds chatgpt to existing product surface area, when the real opportunity is to reimagine the product category from the ground up
3
4
48
@dzhng
David
1 year
Do you know you can coerce OpenAI's new models to always return structured JSON via functions? Add a `print` fn, then force the llm to always call this via `function_call`. Add Zod schema parsing & typing for amazing DX. I built zod-gpt to do just this:
4
6
45
@dzhng
David
10 months
@arthur_hyper88 It's one potential outcome, but unlikely imo considering people who's "in the know" like Eric Schmidt are already offering support. More likely to be some philosophical difference than ethical issue.
6
0
47
@dzhng
David
1 year
My current approach to building agents has slowly converged on: many composable & testable agents chained together > one large agent with many tools
6
0
43
@dzhng
David
1 year
Just ran some benchmarks for the new OpenAI endpoints, the new 0613 models are FAST. In fact, the new GPT-4 model is almost the SAME speed as the old GPT-3.5 model! Even if you are not using functions, there's no reason not to switch.
Tweet media one
Tweet media two
6
8
44
@dzhng
David
5 months
I've been seeing a lot of "virtual employee" AI products lately, but IMO that framing is fundamentally flawed and actually a bit limiting. 1. Saying that you are building virtual employees will probably get people's attention, but it'll also set the user's expectations way too
11
5
44
@dzhng
David
11 months
Interesting observation - being able to get predictable outputs from LLMs often requires you to shift your mindset of how to build software. LLMs have their own preferences, and would prefer to return data in a format that aligns with their preference. Instead of fighting that,
Tweet media one
9
2
43
@dzhng
David
10 months
Regardless of what you think about the concept, you gotta admit that @Humane is an incredibly well executed product. It’s so rare to see this level of polish from any product, but especially rare from a startup.
@samsheffer
Sam Sheffer
10 months
Introducing the @Humane Ai Pin Complete System You get: Ai Pin 2 Battery Boosters Charge Case Charge Pad USB-C Adapter + Cable Starting at $699. Order yours on 11/16 at 10AM PT at
68
34
519
5
3
42
@dzhng
David
1 year
10k users on @aomniapp . Too many AI products just launch a waitlist. We actually shipped & are iterating every day.
Tweet media one
11
3
37
@dzhng
David
10 months
@MelindaBChu1 If this is true (it's 100% speculative), the issue would be more about crisis management & overall strategy of pushing team to take shortcuts, not the actual technical issue.
1
0
39
@dzhng
David
1 year
Not to overhype the new OpenAI API's too much, it looks like it can still hallucinate invalid JSON & parameters. I thought there would be some sort of built in API lv guardrails to auto enforce JSON shape & parameters. You'll still need to implement application side validation.
Tweet media one
4
7
37
@dzhng
David
1 year
Comparing aomni to ChatGPT Browsing - Aomni blows it away. ChatGPT even with browsing enabled cannot handle complex multi step queries.
Tweet media one
Tweet media two
1
6
39
@dzhng
David
1 year
@sama Every LLM ops startup
Tweet media one
0
0
34
@dzhng
David
2 years
@lucy_guo It’s amazing with the right people. 95% of people wanting remote are seeking a lifestyle company, where they optimize for less work, not more. If you can find the 5% who are more productive remote (b/c convenience / setup), then it can be as good as office, even for 0-1 cos.
3
0
35
@dzhng
David
1 year
Just noticed that we broke 50k users 👀
Tweet media one
4
3
31
@dzhng
David
1 year
When you give it an objective, the system will automatically break it down and completes it. By tuning the system specifically for information retrieval tasks, aomni is able to be a lot more reliable than the more generalized AutoGPT systems.
Tweet media one
2
3
31
@dzhng
David
1 year
I’m really excited about this, we’ve been busy working with our business customers to bring AI agents to the enterprise, and now’s finally ready. Our product is now multiplayer-enabled, allowing teams to collaborate on training and using the agent. Plus, we’ve incorporated
@aomniapp
aomni
1 year
🚢🚢🚢 Big product update! We just shipped teams features that make Aomni one of the very first truly enterprise-ready AI agents on the market
Tweet media one
1
0
6
5
2
34
@dzhng
David
1 year
The new version of aomni ( @aomniapp ) will have massive improvements in critical thinking & default to long form content. When given a high level objective, it is able to break that down into specific questions and analyze it like a human. Been working on this for a while, should
Tweet media one
Tweet media two
5
7
31
@dzhng
David
5 months
Under the hood, it's a multi-step agent that breaks down the initial user query and creates & executes a research plan (it uses @ExaAILabs 's search engine for both keyword & neural search). The entities extracted is then enriched one at a time to ensure comprehensiveness.
Tweet media one
1
2
33
@dzhng
David
1 year
Here's another example with waterproof shoes. The AI searched through a bunch of sources, and was able to cluster and present back report exactly how I specified:
Tweet media one
1
2
31
@dzhng
David
1 year
2
0
32
@dzhng
David
1 year
Just shipped Aomni Pro - unlimited queries on @aomniapp for $49 a month.
Tweet media one
9
3
29
@dzhng
David
6 months
The new oil
Tweet media one
2
2
30
@dzhng
David
10 months
startup vs academia
Tweet media one
2
0
27
@dzhng
David
1 year
1/ We've been busy building Aomni into the ultimate sales sidekick, and we're finally at a point where we can start raising the curtain and show everyone where we're going. It starts with the premise - AI + Sales is a crowded category, but we've found all existing tools lacking
@aomniapp
aomni
1 year
Aomni got a big upgrade. We’re thrilled to announce our B2B Account Intelligence Sidekick, an AI agent built specifically to support sales professionals with automated account research and planning. 1/5
4
2
12
1
4
28
@dzhng
David
10 months
@kavirkaycee @sama 😂😂😂 I’m sorry
0
0
27
@dzhng
David
1 year
So @AnthropicAI 's Claude+ can solve the circular gear rotation problem, but need a bit more pushing from the human. My very early take is the reasoning ability seems to be between gpt-3.5 and gpt-4. BUT given the 100k context window AND much faster speed (even with the
Tweet media one
2
0
26
@dzhng
David
10 months
@transitive_bs interesting, well he *was* at the apec summit 👀
1
0
27
@dzhng
David
3 months
@sammcallister @AnthropicAI I'm sorry, but the trolling has to be done 🫡
1
0
26
@dzhng
David
7 months
@venturetwins Can’t tell if this is parody or for real
2
0
26
@dzhng
David
1 year
We've reached another all time high today 🤯 Our email provider is now rate limiting us. Currently working with them to increase the limit, eta ~24 hrs. In the mean time, some sign up / login links may not be sent out. Sorry! Please try again later.
Tweet media one
10
2
26
@dzhng
David
10 months
This may be one of the best corporate saves of all time.
@satyanadella
Satya Nadella
10 months
We remain committed to our partnership with OpenAI and have confidence in our product roadmap, our ability to continue to innovate with everything we announced at Microsoft Ignite, and in continuing to support our customers and partners. We look forward to getting to know Emmett
5K
15K
92K
4
3
26
@dzhng
David
1 year
Every day more people around the world discover AI agents. People in the bay area talks about it as if it's ubiquitous but that couldn't be further from the truth.
Tweet media one
2
2
26
@dzhng
David
4 months
Whew
Tweet media one
3
0
26
@dzhng
David
10 months
Just finetuned gpt-3.5-1106 w/ a modified gpt-4 chain-of-density implementation, using @aomniapp 's internal market research dataset. It's SO good. Better summaries than gpt-4 at 20x less cost. Results below vs gpt-4. Will be amazing for RAG. Try it out:
Tweet media one
Tweet media two
4
0
25
@dzhng
David
1 year
Updated benchmark results with new OpenAI updates: ~30% improvement on GPT-3.5. Definitely a big improvement, Azure is still the king of speed tho at ~2x faster. But just based on speed of improvement it seems like there were / still are a lot of low hanging fruits to optimize.
Tweet media one
@jeffintime
Jeff Harris
1 year
big speed boost to the GPT 3.5 Turbo API just landed 🏁
15
10
110
2
4
23
@dzhng
David
1 month
B2B sales in 2024
Tweet media one
2
0
22
@dzhng
David
1 year
Browsing is such a key part of making AI agents useful, but it’s got a ton of implementation / scalability quirks. This seems to be a huge bottleneck for scaling the reliability / usefulness of agents. Is there any interest for a LLM browsing API for content extraction &
6
5
22
@dzhng
David
1 year
Of all the founder communities that I've met in SF, this one definitely have the highest talent density. Highly recommend esp if you're building AI products!
@davefontenot
Dave Font
1 year
Founders have been asking us when the next HF0 batch is. There’s more interest than ever. And we just decided to launch another batch this year. - 10 teams - $500k uncapped - The best place in the world to build Apply now: (1/5)
50
125
593
1
4
20
@dzhng
David
1 year
100 concurrent users on @aomniapp / now. There are now 20 AI agents working in our virtual office for you all 🤖
Tweet media one
7
5
20
@dzhng
David
1 year
Doing lots of tests between Claude-2 and GPT-4, my initial observation is that Claude-2 actually seems to be following a given JSON schema's description a lot better (like the one in the screenshot). GPT-4 sometimes get a bit too creative, even at temperature = 0.
Tweet media one
2
0
21
@dzhng
David
1 year
Demoing @aomniapp at #SFAIMeetup . Thanks for hosting @OkGoDoIt !
Tweet media one
2
2
20
@dzhng
David
1 year
Surprisingly, smaller models performed much better. Seems like if the goal is to have open ended discussions, it's better to stick to smaller models b/c less RLHF (?). The larger models seems to be too aligned to do Q&A & have all conversational abilities tuned out of them.
1
1
21
@dzhng
David
11 months
The harder I push LLMs to give better & more accurate outputs, the more I realize that the actual words you use in prompts don't really matter. The shape of the output & the way you guide the model's chain-of-thought matters way more.
4
0
20
@dzhng
David
1 year
This is very well said and I see a lot of similarities in the AI (agents) ecosystem & the crypto ecosystem. If you say you have deep conviction in AI but will only build / invest in infrastructure companies, then you're really not being intellectually honest.
@dabit3
nader dabit
1 year
my main takeaway from @ethcc is that everyone is building their own slightly different infrastructure protocol, while the vast minority are actually building any apps to run on all of this infrastructure. the sceptic in me realizes most of these people just want to get rich but
181
268
2K
5
0
20
@dzhng
David
4 months
Let’s get more devices that goes way beyond sRGB and saturates all camera sensors pls. cc @rabbit_hmi
Tweet media one
0
1
20
@dzhng
David
5 years
I have a theory - the total amount of death in China from diseases will actually go DOWN in 2020 - because the lives saved from better air quality will be more than the death caused from Coronavirus.
Tweet media one
2
4
19
@dzhng
David
1 year
@hthieblot Yea agreed. But it's so hard to make the unit economics for gpt-4 to work that I moved most of my product's logic to 3.5 at this point 🤷‍♂️
2
0
19
@dzhng
David
1 year
Released a big internal update to @aomniapp that lays a lot of the groundwork for the next few months. As a user, the main change is browsing should be 20-30% more reliable now due to switching to puppeteer. Pls let me know if you notice any big difference!
3
1
18
@dzhng
David
1 year
The world rewards outliers. Double down on something where you are deliberately delusional, ignore the noise, and the rest takes care of itself.
1
0
19
@dzhng
David
1 year
The steerability of @OpenAI 's new 0613 models are amazing. Even if you force the model to call a function despite giving it a unrelated user prompt, it'll still keep the same JSON shape, and tries its best to map the user's prompt to the correct keys.
@dzhng
David
1 year
@jamesbbaker4 @jaredpalmer Good question - it'll hallucinate some non-sensical data to try its best to map it to the user's prompt. Here's an example. I asked a question in one domain, but give it a function meant for a completely different domain. It essentially ignored the description in the JSON
Tweet media one
Tweet media two
1
1
1
3
2
19
@dzhng
David
1 year
Just deployed a new version of @aomniapp - a lot of optimizations in this version, the agent should now be 5x faster(!) without sacrificing quality. It's fast enough that it's almost not an async experience anymore. Will add response streaming soon for even more interactivity.
2
1
19
@dzhng
David
1 year
Long form content creation is coming to @aomniapp - the first step is ensuring the agent can actually consume large amounts of data in order to build a useful knowledge graph. We just shipped a new browsing engine that gets us much closer to that goal. Instead of consuming a web
Tweet media one
2
2
19
@dzhng
David
6 months
Testing out a new research agent architecture optimized around mass data retrieval, going to open source tomorrow. It's a new take that out performs anything else when it comes to data retrieval. Here's 10% of the results for the search query "Top AI agent startups".
Tweet media one
7
2
18
@dzhng
David
1 year
Just shipped a small feature that'll automatically notify users when their @aomniapp query is done. Agents should be designed to run in the background by default. We have a bunch of other things in the works that really leans into this concept, but this is a small first step.
2
1
18
@dzhng
David
11 months
Collecting all the AI infinity stones
Tweet media one
3
1
18
@dzhng
David
1 year
Just rolled out the next version of Aomni. This version is tuned to give more comprehensive reports from more diverse sources. Here is a good example of how the agent is able to take a high level question, break it down and create information dense report
Tweet media one
2
1
18
@dzhng
David
1 month
At @aomniapp we have 4 developers and 19 different technology providers of different shapes & sizes (e.g. supabase, vercel, openai, posthog... etc). On one hand it's amazing how key infrastructure is getting unbundled & productized, allowing us to iterate faster than ever. On
1
0
18
@dzhng
David
1 year
2000 members on @aomniapp discord now 👀 Who would have thought AI agents are a big deal
Tweet media one
2
2
18
@dzhng
David
1 year
Just leaked - Apple Vision Air
Tweet media one
1
3
18
@dzhng
David
1 year
Claude instant started really shallow and generic, but got successively much deeper. Here is the final result.
Tweet media one
2
0
18
@dzhng
David
10 months
If these 2 comparisons by @GregKamradt are done & evaluated over the same methods, it seems like gpt-4-turbo is significantly better at retrieval compared to the new claude-2.1 model, at least for single fact “needle in the haystack” type of use cases.
Tweet media one
Tweet media two
2
3
17
@dzhng
David
11 months
Trying something new - if anyone is working on a new startup in either AI codegen or LLM evals space, I’d love to be your first customer & will pay. The catch is - we’ll give you access to our codebase, and you build & integrate it & make sure it works with our dev flows. I
3
0
17
@dzhng
David
1 year
@nabeelqu Cause or effect?
Tweet media one
2
0
16
@dzhng
David
1 year
I haven't found any typescript library for chat completion that supports Azure and OpenAI hosted models, PLUS also works on edge, node, and browser environments. SO I made one: Also comes with useful logic like auto token checking & retries as a bonus.
2
1
17