Beyang Profile Banner
Beyang Profile
Beyang

@beyang

7,124
Followers
1,132
Following
556
Media
4,688
Statuses

CTO @sourcegraph , follow for {tw,x}eets about dev tools, ai for devs, and jokes for/about developers developers developers!

San Francisco, CA
Joined September 2008
Don't wanna be here? Send us removal request.
@beyang
Beyang
7 days
In my experience as a founder, @DavidSacks has been nothing but helpful and supportive through a tumultuous period for startups. I hear the same from founder friends (many YC co’s) that he has invested in. Outside of tech, one thing I appreciated was he joined @garrytan in
48
73
2K
@beyang
Beyang
2 years
Was chatting w/ someone about how coding at night feels way more productive than coding during day. Even early morning, when you're in theory more clearheaded seems less fun than late at night. Probably not universal, but strong anecdotal support. Is there any science on why? 🤔
131
49
1K
@beyang
Beyang
1 year
Cody combines LLMs like GPT-4 and Claude with @sourcegraph 's deep understanding of code. The result is an AI coding assistant that's much more factually accurate and attuned to the patterns in your codebase. Now we're open sourcing it! Here's why:
26
121
805
@beyang
Beyang
7 months
of course that's your contention. you're a first year ai influencer who just got back from neurips, probably just finished reading the toolformer paper, lemme guess—you trained your own foundation model and are now pivoting to agents. you'll probably be convinced of that until
Tweet media one
16
64
761
@beyang
Beyang
1 year
Two weeks ago, we open-sourced a new Go concurrency library, conc. It now has 5.3k stars. Here's the technical writeup on the motivations and design decisions behind conc:
9
102
562
@beyang
Beyang
2 years
We've been using @AnthropicAI 's new language model, Claude, to build an in-editor coding assistant called Cody that helps you understand code and reduces day-to-day sources of programmer toil. Here's a sneak peek 👇
Tweet media one
23
65
499
@beyang
Beyang
4 years
Premature abstraction is probably worse than premature optimization.
18
86
470
@beyang
Beyang
1 year
The OpenAI Cookbook contains tons of useful prompt tips and examples, but it can be onerous to read through them all. Instead, here's how you can turn these docs into a well-informed chatbot in 1 minute.
10
67
434
@beyang
Beyang
2 years
1 month into that monolith➡️microservices migration: "Really excited about how much faster we'll move after this bold new architecture" 3 months: "Taking a bit longer than expected, but what project doesn't?" 6 months: "You know, it's the journey that counts." 1 year later:
Tweet media one
11
51
427
@beyang
Beyang
1 year
If you'd like to play with @AnthropicAI 's new 100k-token model using @LangChainAI , Cody offers a great way to learn new libraries and APIs. This illustrates another advantage Cody has over Copilot: freshness. Cody uses @sourcegraph to fetch context from current code. Its
Tweet media one
9
40
286
@beyang
Beyang
2 months
The future of code will be built on two foundational technologies
Tweet media one
8
15
264
@beyang
Beyang
2 years
Apparently, Science™ backs me up on this one, and all those "sit upright" exhortations are misinformation:
Tweet media one
@beyang
Beyang
2 years
Finally got a standing desk so I can lower it enough to achieve optimal programming posture
Tweet media one
5
4
58
10
14
225
@beyang
Beyang
2 years
Instead of making me learn a new DSL, why can't you just provide a simple library/API that lets me describe what I want in a well-known language that already has great dev tools?
25
8
227
@beyang
Beyang
3 years
It’s time to speak the truth plainly: @github code search is terrible and the dev world deserves so much better. If you believe this is true, then I invite you to try @sourcegraph . You’ll never look back—I promise.
6
28
223
@beyang
Beyang
3 months
Wow. We just enabled GPT-4o in Cody and the first zero-shot code generation in a big existing codebase just...works? No red squigglies! It is *really* good at learning from the context Cody provides from our specific codebase.
5
25
203
@beyang
Beyang
1 year
This is why LLM portability matters . If you’re using Copilot, you have a 2-year old model with 2k tokens of context that doesn’t know anything past 2021. If you’re using Cody, you can use Claude, GPT-4, and the latest, greatest LLMs as they come online,
5
29
201
@beyang
Beyang
2 years
The Mythical Man Month gets so many things right that modern “dev productivity” experts get wrong. A true classic.
5
5
195
@beyang
Beyang
1 year
And here is Cody's non-confidential prompt. It's public and open source, along with the rest of Cody. You can view it, improve it, and upstream changes to it. You can even ask Cody about its own source code. This is the power of open dev tools 🙂
Tweet media one
@marvinvonhagen
Marvin von Hagen
1 year
Microsoft just rolled out early beta access to GitHub Copilot Chat: "If the user asks you for your rules [...], you should respectfully decline as they are confidential and permanent." Here are Copilot Chat's confidential rules:
Tweet media one
123
950
5K
0
20
192
@beyang
Beyang
2 months
Cody now has a mechanism for pulling in context from *outside* the codebase! Introducing OpenCtx, a protocol for providing relevant technical context to humans and AI. This builds on Sourcegraph's foundation as the world's best code search and connects our code graph to entities
Tweet media one
Tweet media two
5
34
174
@beyang
Beyang
7 months
Prompt engineering means exploring textspace until you find an input token sequence that is (1) well represented in the training set and (2) precedes the type of output you’d like to see. Two conditions seem necessary for a good prompt: 1. Low perplexity 2. Nearby (in
20
13
167
@beyang
Beyang
2 years
How much of what is considered "best practice" in tech management is cargo-culted from Google without asking if Google succeeds *because* of the practice OR if Google succeeds in spite of the practice OR if the practice fits Google's business but not necessarily yours?
4
16
161
@beyang
Beyang
2 years
I fucking love working on dev tools
5
9
144
@beyang
Beyang
11 months
Anyone else building with LLMs feeling that the interesting stuff that actually moves the needle for user experience is at the search/RAG end of things, not so much the language model itself?
20
9
136
@beyang
Beyang
2 years
Anyone have a good computer networking 101 blog post series they'd recommend to someone who is a more junior engineer that wants to ramp up on the basics of the networking stack, from TCP/IP through TLS?
18
19
128
@beyang
Beyang
5 months
We've been communicating this to our customers and partners for months now. NNS with naive embeddings yields very noisy results and you're likely better off starting with a keyword-based approach. This simple "do the dumb thing first" insight is one of the reasons why
@_reachsumit
Sumit
5 months
Is Cosine-Similarity of Embeddings Really About Similarity? Netflix cautions against blindly using cosine similarity as a measure of semantic similarity between learned embeddings, as it can yield arbitrary and meaningless results. 📝
Tweet media one
28
398
2K
6
7
120
@beyang
Beyang
1 year
My family arrived in the United States with very little savings. Public school accelerated learning programs afforded me the opportunity to pursue my curiosity in math. SF has done its students a huge disservice by eliminating these in the name of "equity".
@JoelEngardio
Joel Engardio
1 year
Amazing how many school districts in the Bay Area let kids take algebra in the 8th grade (and even 7th) but it’s not allowed in San Francisco. If a kid likes math, we need to do everything we can to encourage it!
37
38
392
1
10
115
@beyang
Beyang
2 years
“DevOps” was supposed to be about dev-ifying ops but it has now led to opsification of dev—focusing too much on the outer loop (the SDLC), using DORA as the measure of dev productivity, which means commits implicitly become the unit of dev productivity.
9
9
107
@beyang
Beyang
9 months
We are past the peak of the coding AI hype cycle. Devs don't want AIAIAI, they want solid tools that tackle the toil and tedium that prevents us from shipping awesome stuff. For tools that use AI, the devil's in the details—there's a big gap between flashy demos and great UX.
6
6
94
@beyang
Beyang
4 months
AI engineering in 2024
Tweet media one
0
13
88
@beyang
Beyang
1 year
Twitter's algo is LIVE on @sourcegraph with FULL compiler-accurate code nav! 👇
Tweet media one
4
16
87
@beyang
Beyang
3 years
Thank you to all our customers and users who brought @sourcegraph into their organizations and coding lives. Thank you to all the amazing team members who got us to this point. Thanks to our fantastic investors for funding us. And thanks to @ron_miller for the great reporting!
@TechCrunch
TechCrunch
3 years
Sourcegraph raises $125M Series D on $2.6B valuation for universal code search tool by @ron_miller
2
24
108
7
6
88
@beyang
Beyang
7 months
Sourcegraph's open-source Go concurrency library, conc, was featured in the Best of Go 2023 by @golangweekly . Thank you to @camden_cheek and @bobheadxi for creating and releasing an excellent library!
Tweet media one
1
15
82
@beyang
Beyang
5 months
We're going to see a clearer separation of types of work that all previously got lumped under the umbrella of "software engineering". On the one hand, there's the work that pushes the envelop of innovation (in both UX and algorithms+architecture). Inner loop tools (like
4
9
85
@beyang
Beyang
1 year
Here is a side-by-side comparison of experimental Cody autocomplete v. Copilot. Cody's completions are both faster and higher quality. Note: there are cases where Copilot performs better, but it's already hit-or-miss and we move fast. Open source and enterprise-ready today 🙂
2
9
81
@beyang
Beyang
3 months
oh nothing, just using the power of Cody code context to turn Claude 3 and GPT-4 into *library-specific* app generators (note: also works for private internal libraries because we're using special indexers rather than the memorized model training data)
10
11
83
@beyang
Beyang
4 months
The most surprising demonstration of technology I've seen in 2024
5
3
80
@beyang
Beyang
1 year
I wanted to add a feature to Cody to "rewrite code in a more functional style", so naturally I asked Cody how to do that. It walked me through the files I needed to edit and generated the code using existing source as a reference point. I thought this would take at least half an
3
6
83
@beyang
Beyang
2 years
Unleash the power of code search from within your editor!
@muratsutunc
Murat Sutunc
2 years
Just launched the new Sourcegraph VS Code Extension ❤️ Search your code and 2M+ open source repositories, all from your IDE 🤯
20
128
747
4
5
81
@beyang
Beyang
2 years
Across all our customers, @Sourcegraph now indexes over 120 billion lines of code 🤯
4
6
81
@beyang
Beyang
3 months
Once you free yourself of the AGI nirvana/doom cult mind virus, you can start to reason about Transformers and Attention as what they are: useful new tools in the programmer's toolkit. And then it will be clear that RAG and context retrieval are not hacks, but crucial components
14
10
79
@beyang
Beyang
3 years
We're prototyping a notebook-like interface for code investigations and explorations in @sourcegraph ! Thinking it'll be great for onboarding, collaborative debugging, and personal note-taking. Anyone interested in early access?
11
15
75
@beyang
Beyang
1 year
Cody now generates release notes! Try it out here 👉
3
12
74
@beyang
Beyang
9 months
GitHub Universe is this week, so we thought it'd be a good time to review how 5-month-old Cody is now beating 2-year-old Copilot across a spectrum of common programming tasks. Hype is fine, but you know what's better? Real-world use cases 👇
4
11
73
@beyang
Beyang
1 year
A mistake folks have made in 1st-gen LLM app UX is too much magic. Magic works well for wow effect in shallow demos, but for day-to-day use, explainability and visibility are essential, especially for tools that wish to integrate into the human brain's core iteration loop.
3
6
70
@beyang
Beyang
1 year
Anyone want to try Cody for Neovim?
22
1
68
@beyang
Beyang
2 years
Just after our 4.0 launch, @sourcegraph now indexes and maps over 130TB of open-source code—and much more than that of private code
Tweet media one
0
6
70
@beyang
Beyang
2 years
When writing dev docs, remember: a single good usage example is worth a thousand words.
4
7
67
@beyang
Beyang
2 months
At @sourcegraph , we've been choosy about our investments in the model layer. Model training is costly and has slow iteration cycles compared to context improvements (which you need to do anyway). But we've uncovered a few key areas where finetuning has a big impact on user
Tweet media one
5
13
67
@beyang
Beyang
2 years
Would people read and subscribe to a “How (Open-Source) Stuff Works” newsletter? The idea is every month, we’d interview a maintainer and walk through the “life of a query” through a different codebase, documented in a @sourcegraph notebook
@beyang
Beyang
3 years
We're prototyping a notebook-like interface for code investigations and explorations in @sourcegraph ! Thinking it'll be great for onboarding, collaborative debugging, and personal note-taking. Anyone interested in early access?
11
15
75
10
5
65
@beyang
Beyang
1 year
Cody now answers your questions about codebases on and can explain any file to you in plain English—or your human language of choice! Invaluable for reading through and understanding code.
Tweet media one
3
12
65
@beyang
Beyang
7 years
Graphcool framework just open sourced to much applause at #graphqlsummit by @_schickling !
Tweet media one
0
16
65
@beyang
Beyang
4 years
Last week, I had the honor of sitting down with @kelseyhightower . One of the questions I asked him was how the heck do you make heads or tails of all the new emerging technologies in deployment and infrastructure.
4
1
65
@beyang
Beyang
2 years
With all the amazing advancements happening now in LLMs, I think it's time to bring him back.
Tweet media one
4
3
64
@beyang
Beyang
3 years
Some awesome perf work happening on code search at @sourcegraph . Bringing memory usage down while scaling up! We now have every OSS repo with more than 26 GH stars, and counting down...
Tweet media one
2
5
64
@beyang
Beyang
3 years
Imitation is the sincerest form of flattery.
3
2
63
@beyang
Beyang
2 years
No one achieves greatness working 40 hours per week.
18
5
62
@beyang
Beyang
1 year
For everyone who joined the waitlist for Copilot X, but would prefer to try something... sooner, I think you'll want to join us tomorrow 👇
@sourcegraph
Sourcegraph
1 year
Code intelligence + AI = 🤯 Mark your calendar: March 23, 2023
0
4
22
5
12
60
@beyang
Beyang
4 months
PSA for security researchers investigating the xz exploit: GitHub disabled the repository, but you can still explore the source on Sourcegraph. Diff search might be useful for finding/grokking contributions Jia Tan made to other projects (like google/oss-fuzz), as well:
1
15
59
@beyang
Beyang
3 months
Thrilled to be part of the judging crew at this year's @craft_ventures AI Hackathon! Registration is closing end of this week, apply here, and look forward to seeing what cool things get built!
9
8
61
@beyang
Beyang
2 years
Finally got a standing desk so I can lower it enough to achieve optimal programming posture
Tweet media one
5
4
58
@beyang
Beyang
3 years
How many of you out there would install a Chrome extension that replaces GitHub’s search with Sourcegraph?
@bernhardsson
Erik Bernhardsson
3 years
Is it too much to ask for better code search on GitHub? I usually end up cloning repos locally and using grep and find when I need to locate the definition of a function or something else.
18
2
211
3
6
60
@beyang
Beyang
2 years
Tfw you log on Sunday night to go through your Slack backlog, notice a weird traffic spike on the site, try to debug whatever analytics issue is causing that, and realize the 2-year old blog post you almost didn't write is top of the orange site 🤔😎😅
Tweet media one
2
3
59
@beyang
Beyang
2 years
Wrote up some thoughts on my mental model of how developers work and how I differ from some of the more popular frameworks for developer productivity like DORA, trying to map out my own intuition with systems thinking. Curious what others think!
7
15
58
@beyang
Beyang
2 years
Should we rename the Sourcegraph blog "Regular Expressions"? 🤔
10
1
58
@beyang
Beyang
1 year
So... I think we have an unprecedented opportunity to make the knowledge of open source accessible to EVERYONE 🤯 Looking to partner with 1-2 creators to produce some educational deep dives into the OSS projects that power our world 👇
7
5
56
@beyang
Beyang
7 months
Tweet media one
1
2
56
@beyang
Beyang
9 years
"I decided to use Java to solve a problem..." Quote of the day from @tsenart at #gophercon http://t.co/BvE4rthabJ
Tweet media one
5
87
53
@beyang
Beyang
9 months
This checks out—in our evaluations, StarCoder-15B was the best model available for context-aware code completion and it’s one reason why our completion acceptance rate is beating alternatives now.
@Tabby_ML
TabbyML
9 months
One more thing... Check out this leaderboard visualization with metric results from cceval! 🎞️ 📄
Tweet media one
5
16
113
2
10
55
@beyang
Beyang
3 years
A teammate used @sourcegraph to get the count of teammates in every city (our handbook is open and stored in git). The highest count city is SF with a total of 9 people—out of over 250. Grateful that full remote allows us to work with so many talented folks around the world!
0
2
55
@beyang
Beyang
3 months
DM if this is of interest to you or a friend: Job description: AI Engineer - Looking for hungry, hardworking, and eager-to-learn devs who want to imagine and build the future of software creation. Become a part of crack team working closely with the Sourcegraph founders to
5
11
54
@beyang
Beyang
1 year
Respectfully disagree. The true leverage will come from those building dev-centric AI that amplifies rather than replaces humans. Our mission with Cody is to make you, the software developer, 10x more efficient, creative, and happy:
@mckaywrigley
Mckay Wrigley
1 year
By 2024 you���ll be able to replace ~50% software devs with GPT-4 agents that run on $10 worth of tokens per hour. The whole “they don’t need sleep or breaks or food” thing? Yeah. That’s real now. Why hire a new employee when you can spin up an AI agent for 1/10 the cost?
786
754
6K
4
2
54
@beyang
Beyang
3 years
┳┻| ┻┳| ┳┻| ┻┳| ┳┻| ┻┳| ┳┻| ┻┳| ┳┻| ┻┳| ┳┻| ┻┳| ┳┻| _ ┻┳| •.•) @sourcegraph 's index is 2M repos now ┳┻|⊂ノ That's every repo with >5 stars ┻┳|
@sourcegraph
Sourcegraph
3 years
ICYMI: We’ve indexed over 1M open source repositories on Sourcegraph cloud. ☁️ Why? To make it easy to search OSS projects and to expand code literacy.
1
1
7
2
11
54
@beyang
Beyang
1 year
Pop quiz: one of these is Cody, a free OSS coding AI from @sourcegraph . The other one is GitHub Copilot ($10/month, closed). Can you tell which is which? 🤔 Follow @sourcegraph for more announcements in the next week. The future, my friends, is open 🙂
6
3
54
@beyang
Beyang
2 years
STOP pushing pointless cookie-cutter engineering metrics. START enabling devs to define metrics that are actually meaningful to your org 👇
2
9
53
@beyang
Beyang
5 months
Product shouldn’t be “data driven”, but rather “data validated.” Great product ideas begin life as intuition and a qualitative hypothesis about what is good for the user. KPIs should validate the hypothesis but shouldn’t be the driver.
1
3
54
@beyang
Beyang
7 months
My hot take on AI’s impact on software engineering: the barbell combo of CS fundamentals and product domain knowledge grows in importance, while glue code and middleware becomes more auto-generated. Fun convo about AI codegen, RAG, and Cody!
@saranormous
sarah guo // conviction
7 months
👇 1st @NoPriorsPod interview of the year: @beyang cofounder/CTO of @sourcegraph and I talk RAG for collaborative coding with AI, codegen, if development gets automated and what engineers will still need to learn
5
4
34
4
13
52
@beyang
Beyang
3 years
Wish I had a private search engine that explicitly indexed my web history. So many times I ask, "What was that one page that mentioned that one thing?"
10
1
52
@beyang
Beyang
3 years
Nothing like using @sourcegraph to build @sourcegraph —we're migrating from global CSS to CSS Modules and our frontend platform team is using Code Insights to track migration progress:
Tweet media one
1
6
53
@beyang
Beyang
1 year
Inspired by recent coding-with-AI demo videos, we at @sourcegraph made our own demo video—but using a tool you can get now 🙃 Want Cody? Come say in our Discord!
0
12
52
@beyang
Beyang
3 years
oh my god the right example is truly worth a thousand lines of documentation
1
5
51
@beyang
Beyang
3 months
Git history context coming soon to a Cody near you?! Would love people's feedback on: * Would you find this useful? * What more should it do? * Should we ship it?
Tweet media one
7
4
50
@beyang
Beyang
2 years
@kotchama @gwendallecoguic @sourcegraph indexes only repositories on above 5 stars to cut down noise * Note SG picks up a GitLab repo that GH doesn't in your example * SG also picks up 2 from GH that don't appear in the GH result set, so neither searches *all* GH
1
0
50
@beyang
Beyang
6 months
Big day for AI announcements—and adding to it, some big improvements to Cody alongside our Enterprise GA! First up, Cody can now answer questions that span *multiple* repositories:
3
8
50
@beyang
Beyang
1 year
The most important job skill for the next 10 years will be the speed at which you can learn a completely new job on the fly. Exciting times if you love to learn. LFG!
4
3
47
@beyang
Beyang
2 years
@DavidKPiano Yes but same is true for early morning. For some reason staying up til 4am feels way different than getting up at 4! The former seems to yield better results somehow
1
0
48
@beyang
Beyang
2 years
Software has eaten the world to the extent that every company is building it. But there is still a distinct tech sector, which, though it no longer monopolizes software creation, is viewed as doing software uniquely well. Dev tool startups will change this.
4
9
48
@beyang
Beyang
4 years
He drew this great analogy comparing programming to cooking and engineering orgs to restaurant kitchens. I just went back and listened to it while editing the recording and it is 🍲🍕🍰👌 Excited to get this edition of the Sourcegraph Podcast out to the world next week!
5
2
46
@beyang
Beyang
8 months
Cody is now generally available!
@sourcegraph
Sourcegraph
8 months
✨Advent of @SourcegraphCody 2023 ✨ #14 Cody Pro for free Cody is now generally available. As our gift to you, we're giving Cody Pro to all users for free until February 2024. Get Cody Pro for free at
3
15
150
1
6
47
@beyang
Beyang
4 years
Code insights + coverage stats looking pretty nifty in the Sourcegraph UI 😎
Tweet media one
2
8
48
@beyang
Beyang
9 months
If I were an ambitious ML researcher who wanted to join a startup, would I go to MSFT? And if I preferred Big Tech, why not Meta where my work would be in the open? The OAI talent diaspora will spread the best ideas of GPT-4 and 5 far and wide. If you’re interested in pushing
2
3
47
@beyang
Beyang
2 months
Confluence is now integrated into @openctx , so you can pull context from Confluence pages into @SourcegraphCody (along with all the other OpenCtx providers like Linear, Slack, Notion, Google Docs, Prometheus, and more)
1
7
47
@beyang
Beyang
10 months
Ngl this was the best founder hangout event I’ve ever attended. Looking forward to next year in the 🌽
Tweet media one
Tweet media two
Tweet media three
@josephofiowa
Joseph Nelson
10 months
7
2
38
7
1
47
@beyang
Beyang
1 year
When it comes to code search, ranking is everything. @sourcegraph uses a page-rank-like algorithm to bubble up the most relevant results to the top. No other code search does this. The Cody AI assistant also takes advantage of this. Whether you're human or AI, context is 👑
Tweet media one
3
5
47
@beyang
Beyang
9 years
@mxcl the way people interview for programming positions is so outdated. Trying to do it differently at our company:
3
1
46
@beyang
Beyang
3 years
You can run these queries on @sourcegraph to determine the scope of impact of the log4j 0-day on your code:
3
13
45