For the past month or so I've been slowly having claude build something in the same ballpark. Basically something to nag you to take care of grown-up things so your boss/spouse/local municipality doesn't have to.
I was going to call it "Nagatha Christy", but the joke gets old after 48 hours. At the moment, its called "Jarbis" (old Simpsons reference).
For me, checklists are useful but I suck at creating them, maintaining them, etc. I want this thing to be able to look at my calendar/email/groupme and be able to say things like:
"Hey, you have 2 kid birthday parties this weekend and a soccer game - you're bringing snacks. You want me to update your shopping list?"
or
"The dentist office just sent out a reminder - you have an appointment on Thursday that's not on the calendar. It conflicts with your daily standup. You want me to create a task for you to resolve it?"
Its using: - AWS CDK - Telegram as primary chat interface - Trello/Jira/Something Custom - Integrations into GoogleCalendar and GMail - Ability to use Claude/OpenAI and different models
FWIW, if someone figures out how to create a reliable "secretary in a box" that I don't have to DIY but doesn't scream data-collection-watering-hole (facebook) I'd _happily_ pay $200 / mo for it. ;-)
Btw, I'm in the process of training my own small model so that I can run it on my cpu-only VPS and stop paying for API costs
I set $10 on fire the other day as I was running through some tests.
Like old school arcade games "Please insert more ${money} to keep playing...". Local, smaller, specialized (unix philosophy?) seems like the way to go so you don't bk yourself having AGI distill pintrest recipes to just recipes.
2. Access to my TODO list on Apple Notes and basically remind my ADHD brain that I ought to be doing something and not let it slip because it is uninteresting.
3. Have access to all models via API keys I configure and maintain a "research journal" of all the things I go to LLMs for - "research of bike that fits my needs" whatever and figure out if there needs to be a TODO about them and add if I say yes.
4. View my activity as a professional coach and nudge me into action "Hey you wanted to do this at work this year, but you haven't begun.. may be it is time you look at it Thursday at 3 PM?"
5. View my activity as a mental health coach and nudge me like "hey you're researching this, that and blah while X, Y and Z are pending. Want me to record the state of this research so you can get back to doing X, Y and Z?" or Just talk to me like a therapist would.
6. Be my spaghetti wall. When a new idea pops into my head, I send this secretary a message, and it ruminates over it like I would and matures that idea in a directory that I can review and obsess over later when there is time..
As you see, this is quite personal in nature, I dont want hosted LLMs to know me this deeply. It has to be a local model even if it is slow.
I wonder if the real unlock is moving the task forward in some way. “I know you were interested in X, and the research approach petered out, here and some new approaches we could try:”
“You’ve got two kids’ birthdays next week, shall I order some legos?”
I'm actually going to take it further and use clawd to check Jira, linear, slack, and Apple reminders and help me to unify and aggregate them - as I'll often remember and record a reminder on Siri - and kind of ping me about these and adjusting dates when they're overdue so nothing slips through too past due
Apple has a big opportunity with this.
It has a handful of core features:
- key obligations & insights are grok'd from emails and calendar events - these get turned into an ever-evolving always-up-to-date set of tasks; displayed on a web UX and sent to you in a personalized daily briefing - you can chat via telegram or email with the agent, and it can research/query your inbox or calendar/create or resolve tasks/email others/etc - if the AI identifies opportunities to be proactive (eg upcoming deadline or lack of RSVP on an event), it pings you with more context and you can give the green light for the agent to execute
Generally trying to identify finite list of busywork tasks that could be automated, and let users delegate the agent to execute them. Or, in the future (and with high enough confidence), let the agent just execute automatically.
Built the stack on Cloudflare (d1, Cloudflare Workers/Workfolows/queues, Vectorize), using gemini-3-flash as the model.
Would love any feedback: https://elani.ai.