zlacker

I tried it out yesterday, after reading the enthousiastic article at https://www.macstories.net/stories/clawdbot-showed-me-what-t...

Setting it up was easy enough, but just as I was about to start linking it to some test accounts, I noticed I already had blown through about $5 of Claude tokens in half an hour, and deleted the VPS immediately.

Then today I saw this follow up: https://mastodon.macstories.net/@viticci/115968901926545907 - the author blew through $560 of tokens in a weekend of playing with it.

If you want to run this full time to organise your mailbox and your agenda, it's probably cheaper to hire a real human personal assistant.

replies(11): >>lurkin+X3 >>wartyw+xc >>itissi+pw >>turnso+eQ >>quiets+Ba1 >>guluar+ab1 >>geek_s+Ul1 >>jaunty+yN1 >>ern_av+LN1 >>0xbadc+g12 >>columk+Tu4

>>lode+(OP)
part of me sympathizes, but part of me also rolls my eyes. Am i the only one that’s configuring limits on spend and also alerts? Takes 2 seconds to configure a “project” in OpenAI or Claude and to scope an api key appropriately.

Not doing so feels like asking for trouble.

replies(3): >>lode+V4 >>iamthe+f7 >>jmatha+ul

>>lurkin+X3
That's what I did, which is why I abandoned my experiment this quickly.

I'd find it hard to write such an article about how this is the next best thing since sliced bread without mentioning it spending so much money.

replies(2): >>lurkin+z5 >>adastr+Wd1

>>lode+V4
good on you! The anecdote of that person spending hundreds of dollar is scary.

>>lurkin+X3
not only that, but clawdbot/moltbot/openclaw/whatever they call themselves tomorrow/etc also tells you your token usage and how much you have left on your plan while you're using it (in the terminal/console). So this is pretty easily tracked...

>>lode+(OP)
Huge pyramids are built of relatively small blocks, kudos to everyone contributed.

replies(1): >>Sharli+3T

>>lurkin+X3
Are you all enabling auto reload for personal projects?

I load $20 at a time and wait for it to break and add more.

replies(1): >>fnordl+jU

>>lode+(OP)
I think one thing these things could benefit from is an optimization algorithm that creates prompts based on various costs. $$, and what prompts actually gives good results. But it's not an optimization algorithm in the sense gradient descent is, but more like Bandits and RL.

There has been some work around this practically being tried out using it for structured data outputs from LLMs https://docs.boundaryml.com/guide/baml-advanced/prompt-optim...

I won't claim I understand its implementation very well but it seems like the only approach to have a GOFAI style thing where the agent can ask for human help if it blows through a budget

>>lode+(OP)
Yeah, I looked at Clawdbot / OpenClaw at the beginning of the week (Monday), but the token use scared me off.

But I was inspired to use Claude Code to create my own personal assistant. It was shocking to see CC bang out an MVP in one Plan execution. I've been iterating it all week, but I've had it be careful with token usage. It defaults to Haiku (more than enough for things like email categorization), properly uses prompt caching, and has a focused set of tools to avoid bloating the context window. The cost is under $1 per check-in, which I'm okay with.

Now I get a morning and afternoon check-in about outstanding items, and my Inbox is clear. I can see this changing my relationship to email completely.

replies(1): >>azinma+tW

>>wartyw+xc
"Pyramid" is an interesting metaphor to use, given the connotations.

replies(1): >>pohl+I11

>>jmatha+ul
Can you get meaningful work done with CC at $20 at a time? I load $20 at a time onto the API for general chatting purposes and it lasts a few months at a time. I've always avoided trying CC because I got the impression people were burning $100+/mo, which is beyond my personal hobby budget.

replies(5): >>sanaro+411 >>browni+7c1 >>quiets+Xc1 >>jmatha+bp1 >>TheGRS+5E1

>>turnso+eQ
Post it!

replies(1): >>turnso+X31

>>fnordl+jU
/Not a software engineer perspective working on side projects

I guess if you're letting it vibe code huge chunks. I'm doing mostly handwritten code for my current project with a little bit of "I don't want to deal with this, Claude can handle it" and I've spent $1.26 this month for my 446 lines of code.

But yes I suppose at that rate, if Gastown or Beads or whatever is 300,000 lines of code (just to use a project known to be fully vibe coded with rough LOC reported), that would be over $800.

Don't let it vibe code hundreds of thousands of lines of code I guess.

>>Sharli+3T
Are you alluding to pyramid schemes or “Look on my Works, ye Mighty, and despair”?

replies(1): >>Sharli+z31

>>pohl+I11
I was thinking of the former, but the latter could certainly apply too.

replies(1): >>abusta+JL1

>>azinma+tW
A lot of the system prompt, skills and tools center around my specific needs (I manage separate IMAP and Gmail inboxes, use Granola, and have iCloud calendars). And there are some hard assumptions baked in (I want to have a morning & afternoon check-in). It probably wouldn't be useful as-is, but maybe as inspiration?

replies(3): >>browni+cb1 >>RickS+UD1 >>azinma+pf4

>>lode+(OP)
Just watch a few videos on Clawdbot. You'll invariably see some influencer's Anthropic key, and just use that. Wokka wokka!

>>lode+(OP)
you can use your claude max subscription

replies(3): >>swords+Fi1 >>preomm+m32 >>drewst+zo3

>>turnso+X31
I'd love to see even a filtered version of it. I've been doing very similar things with an "everything" database. That's been my own personal northstar.

BTW, OpenCode has free Kimi (I haven't hit a quota yet) right now and it's done pretty great things for me in the last 24 hours.

replies(1): >>turnso+1d1

>>fnordl+jU
I keep a master llm.md file and rotate between Claude Code (Pro), Antigravity Opus, Antigravity Flash, and OpenCode Kimi. I don't actually mind hitting limits.. though I'm least happy when Opus goes away.

My entire process is to build a generic llm.md file that all the tools can use and record to. I don't want to be tied completely to any one solution. You can get pretty far without spending a lot on tokens. I can run almost continually, and presently I'm the bottleneck anyway.

>>fnordl+jU
I'm successful with personal projects (reverse engineering USB devices, sledding spot finder, silly stuff) on the $20/mo Claude plan. I rarely use Opus except for planning larger things.

>>browni+cb1
Oh interesting—how do you find OpenCode vs CC? I'll check it out. And I'll try to get a version of this assistant in a form I could share publicly.

replies(1): >>browni+Qo1

>>lode+V4
People using it have subscriptions.

>>guluar+ab1
oh yeah let me just pull my 200$ monthly subscription out of my back pocket

replies(1): >>guluar+Om1

>>lode+(OP)
I had the same problem. Ask Clawdbot to optimize token usage. It cut my usage in half.

replies(1): >>testde+HE1

>>swords+Fi1
yeah it is only worth it if you are already paying otherwise it is not

>>turnso+1d1
They're neck and neck for me, in terms of PRDs, coding, and web searching. CC built the bulk of my current project, I did a lot of analysis of it with Antigravity (the interface is esp good for reviewing/commenting on long .md output files) and then, after building a simple roadmap of v2 features, OpenCode + Kimi was the most aggressive about running in a fairly autonomous manner and finishing the items on said roadmap. OC was also pretty hardcore about misinterpreting a limit I expressed earlier in one context as a limitation in another context -- which was fine, I'd rather say "no, really, you can go do that, I'm giving you permission and here's what I meant before" than find out it was too brazen.

It's a lot like managing two experienced mid- to sr- engineers each of whom have slightly different personalities and intro/extro verted personalities. CC has more personality but OC wants to race. They can both code, but for disparate tasks you might pick the personality and posture of one person over the other.

I find myself picking daily tasks based on which of the tools I'm in the mood to sit with. But across a few days I sit with all three.

>>fnordl+jU
For Claude Code, I now pay the $20/mo subscription for pro because I was spending more using it via API credits.

Even if I had to reload manually very often, I still would not enable auto reload. These APIs are crazy expensive and I'm not looking for a surprise bill.

>>turnso+X31
If it was oneshotted, I'd be curious to see the prompt

replies(1): >>turnso+Sg2

>>fnordl+jU
I was doing that initially, but I think the subscriptions are generally worth it for personal projects. $20/mo is good if you're like me and you can do this stuff maybe a couple nights a week, I haven't run into the limitations on that yet. The $100+ subscriptions are needed if you're doing it every day. YMMV

>>geek_s+Ul1
Just imagine what would happen if you asked again.

replies(1): >>deadba+rK1

>>testde+HE1
What if you asked the opposite?

>>Sharli+z31
I took it as "pyramid was built by slaves..." connotation

replies(1): >>Sharli+HM1

>>abusta+JL1
That's another good one, even though in reality they weren't.

replies(1): >>abusta+5Y1

>>lode+(OP)
Would have been $68 on DeepSeek, which is also imho very good.

I still have Opus review the shit out of & plan my work. But it doesn't need to be hands on keyboard doing the work.

>>lode+(OP)
Can't you just point it at a local ollama? It'd be slower, but free (except for your electricity bill).

>>Sharli+HM1
Huh, today I learned! Thanks

>>lode+(OP)
If you have an old M1 Macbook lying around, you use that to run a local model. Then it only costs whatever the electricity costs. May not be a frontier model, but local models are insanely good now compared to before. Some people are buying Mac Minis for this, but there's many kinds of old/cheap hardware that works. An old 1U/2U server some company's throwing out with a tech refresh, lots of old RAM, an old GPU off eBay, is pretty perfect. MacBook M1 Max or Mac Mini w/64GB RAM is much quieter, power efficient, compact. But even my ThinkPad T14s runs local models. Then you can start optimizing inference settings and get it to run nearly 2x faster.

(keep in mind with the cost savings: do an initial calculation of your cloud cost first with a low-cost cloud model, not the default ones, and then multiply times 1-2 years, compare that cost to the cost of a local machine + power bill. don't just buy hardware because you think it's cheaper; cloud models are generally cost effective)

replies(1): >>muwtyh+dq2

>>guluar+ab1
Isn't that explictly against the TOS? I feel like Anthropic brought out the ban hammer a few days ago for things like opencode because it wasn't using the apis but the max subscriptions that are pretty much only allowed through things like claude code.

>>RickS+UD1
I wouldn't say it was oneshotted, but it did produce a working MVP in one Plan execution. Meaning, I went back & forth a few times about requirements, it built a plan, and then CC spent just under 15 minutes writing the code. Once I got the credentials plugged in, the core integrations (Slack, gmail, IMAP, iCloud calendar) and agent loop did work. I can share the initial message if you're curious.

>>0xbadc+g12
> don't just buy hardware because you think it's cheaper

Surely there is also the benefit of data privacy and not having a private company creating yet another ad profile of me to sell later on?

>>guluar+ab1
No you can't, Anthropic keep blocking it

>>turnso+X31
I’d still love to see it

>>lode+(OP)
That's the sad thing. There are so many millions of talented under-employed people in the world that would gladly run errands or set up automations for you for $200-$1000 per month or whatever people are spending on this bot.

Developers trust lobsters more than humans.

The other wild thing is that many of these expensive automations that are being celebrated on X can already be done by voice using Siri, Google, or any MCP client.