One surface level comment. As a non-religious person, I have always found the distinction between buildings for different types of religions as being somewhat artificial. Sure, they have different architectural styles, but they all have a similar purpose in my mind.
So a mosque is a church is a temple, on some level.
Are you sure they don't have a number of similarities?
I think that type of thing can make for a really fun and flexible GPT-powered game system. It seems like a great way to add some engagement.
It's also brilliant the way you have managed to mitigate the ChatGPT cheating to such a degree. Although as I got further down into the details of the assignment, I started to feel glad that I wasn't in school anymore. It sounds like they will have to do a fair amount of actual work. So congratulations on that.
They are different religions with different beliefs and practices. The architectural differences are vast and mostly obvious the minute you walk in. Mosques and churches both have a number of unique elements like the mihrab, confessional booths, pews, floor carpets, altars, and so on and so forth. These all translate into vastly different experiences both during worship and in everyday life. For example, Catholic churches have confessional booths facilitating confessions to priests. Mosques don’t, as (as far as I know) Muslims believe more in direct confession to God, not to an intermediary. You can see how this would result in a different social structure.
The experience of attending mosque on Friday is quite different from Sunday mass. This is intra-religion as well; compare a New England church with St. Peter’s in Rome, for example.
Sure, there are some similarities, but this is such a broad distinction that I question its usefulness, and dividing the world into secular and religious (architecture) is a very recent phenomenon. Saying they all are basically the same is to miss millennia of culture.
Anyway I don’t mean to be hostile or critical here, I just think religious architecture is pretty fascinating and has a much bigger effect on culture, even supposedly secular culture, than people realize. I encourage anyone interested to read more about it.
Something I wrote about Gemstone years ago:
http://theappendix.net/issues/2014/10/dont-cry-for-me-elanth...
Regarding cheating, I often review CVs and written tests for software developer roles and I often see ChatGPT being used to pimp CVs and rewrite fragments of pages from the internet. They are often wrong or re-written in a way that makes it easy to reverse engineer the source.
Yes, but for many history is a very dry subject and thus hard. This creative solution can help students retain information and better understand the subject.
They definitely have variations in the nonsense that they use to justify themselves, although it could be argued that there as many similarities as differences.
Whether they sit in the floor or not or what types of songs they sing and when are surface level details to me.
My takeaway from history, geography etc. has always been functional.
/shrug
In that vein, as soon as I gained access to Bing Chat, I began to set up some scenarios for it, and elicit some screenplays. One of my prompts was to write a screenplay about Emperor Constantine the Great meeting his mother, Saint Helena.
It started out innocently enough, of course, Helena enters the throne room and they catch up on old times, but it rapidly became extremely suggestive as his mother solicited kisses from the uncomfortable ruler.
I think I prompted Bing for a second round, and at that point it became explicitly and blatantly incestuous, and not very comedic, but just sort of gross. I have no idea why or how the GPT would've had the idea to go off on that cliff.
The other great historical meeting I arranged was between Hannibal the Carthaginian and St. Francis of Assisi, who were, of course, not contemporaries, but I wanted to see how it'd play out.
So St. Francis comes up to Hannibal and starts sort of working on him to sue for peace and not invade Rome. And it didn't take too long for Hannibal to see it another way, and ultimately he accepted Christ and asked Francis to baptize him. So, interesting outcome there; definitely would've changed history!
Most notably, it made me a very fast typist from trying to escape dying.
the kids with critical thinking skills are already using llms in all sorts of creative ways to boost their education and output. to learn and grow, faster than a textbook allows.
the ones without will use them exclusively to get rid of toil, real or perceived.
I agree with the top comment -- critical thinking skills and a base of essential knowledge is the most important thing to teach our young.
Your house is the same as your friend's house, but you'd still rather have separate houses.
I think this is exciting because it gives students a chance to "play" around with their assignments. I can see students running through multiple simulations to compare results and thus going deeper on research beyond the scope of the assignment.
Much more serious than the small lies are the big lies that the neural networks in one group teach their children about the neural networks on the other side of the ocean.
I wonder if all of this high technology will ever result in better communication between neural networks. Within a generation or two, it should be possible to instantly transfer high bandwidth neural activity globally. I wonder if this will change things.
I didn't actually start it as a historical game, but just thinking about what it would be like to roleplay an entire life as a series of scenes (like [2]). But while you can roleplay a vaguely "now" moment by not specifying any date, if 60 years passes you have to acknowledge that both the character and the world around the character are changing. And then you have to define a start date, make the roleplaying system aware of the historical context... and why not let the start date be 2000BC, 1700, or 1960? So it quickly became historical.
There's a ton of challenges. General hallucination is one, of course, but ahistorical biases probably bother me more. The author mentions a talking rat appearing in one; I had a simulation where a building was listed as a "character" and so it started interacting with the player [10]. But those are obvious enough that I kind of enjoy the absurdity.
Ahistorical biases really comes out in female characters, where it can be hard to get GPT to fully acknowledge historical gender roles. I think it's super-OK for the player to break those gender norms, but "society" should respond accordingly. For instance playing a young woman from a politically motivated family in the ~200BC Rome, while there's lots of possibilities, become a senator is not one of them... but GPT thought it was.
Also GPT has a high bias towards being friendly and accepting, like in the post with Ea-nāṣir: "He meets your gaze, his demeanor shifting from initial resistance to acknowledgement...." – both the response and the tone of the response are very familiar GPTism. I have a feeling Ea-nāṣir wasn't actually such a conciliatory dude.
Anyway, a challenging scope but it's a lot of fun, especially since GPT knows about all kinds of history that I don't and I get to learn just while playtesting.
[1] https://youtu.be/6ZaTvSucoBA
[2] https://en.wikipedia.org/wiki/Alter_Ego_(1986_video_game)
[3] https://hachyderm.io/@ianbicking/110945869211907827
[4] https://hachyderm.io/@ianbicking/111021491162338640
[5] https://hachyderm.io/@ianbicking/110890095506958475
[6] https://hachyderm.io/@ianbicking/110805239146369379
[7] https://hachyderm.io/@ianbicking/110658807975175746
[8] https://hachyderm.io/@ianbicking/110642068987102144
>Going forward, my plan is to develop my own web app which will allow users to create historical simulations on a dedicated platform using the APIs of both Anthropic’s Claude and GPT-4.
I might be able to help. I've already built a free, open source web app which creates simulated AI worlds. It takes user direction, and I believe it would work very well for historical simulation. It's also p2p in browser so an entire class could join and contribute to a simulation simultaneously.
At the moment it only supports OpenAI, but it shouldn't be too difficult to add Claude.
Happy to give you a demo to see if it'll save you some time. To be clear, this is a personal side project which isn't monetized, I'm not selling a product.
Email:
Sam + HN [at] sampatt [dot] com
I think "correct the errors in this ChatGPT essay" is a short-term viable homework exercise, but those errors might be gone in GPT-5 so I don't think it's long-term viable. Soon the LLM will just produce perfect essays at college level and there won't be hallucinations for the student to correct.
However, the "simulate the historical environment" task is great and I think it has long-term potential. I think it can be taken further; rather than "spot the errors that ChatGPT made", you could flip the script and make it "survive 20 turns of conversation without making a historical error", so you'd need to know things like local traditions, perhaps the geography of the ancient settlement you're studying, contemporaneous history like "who is the emperor and what's the sentiment towards him" and so on.
I'm also envisioning that, since text-based exercises are extremely easy to game (just pipe your text prompt into ChatGPT), and since ChatGPT is soon going to be strictly superior to a high-school level student, we could get around this by having the homework as an in-person verbal role-play or Q&A session, like a viva voce; essentially you have a verbal discussion with ChatGPT and you need to really know your material as it can dig into any part of the curriculum. Then ChatGPT can summarize each student's interaction, and the teacher doesn't have to sit through each individual one start-to-finish (1:1 exams are too time-consuming to be viable).
This round-trip through verbal interaction would potentially make the task more interesting (lots of people simply hate writing essays), shifts the focus away from tasks that will become obsolete (writing essays) in favor of ones that will be more relevant (human synthesis of ideas, and interpersonal interaction), and helps to mitigate the issue of LLM-assisted cheating by constructing an assignment that LLMs can't trivially solve.
In my opinion, churches/mosque/temples are very similar because I live in a younger part of the world where the modern buildings look similar and the eventual purpose is them being a place of worship.
Recent history -- one of the initial GPT use-cases that got the hype train going was AI Dungeon, which is this sort of thing.
Thought I think with GPT function calls, you could have the LLM sitting atop an actual game engine with persistent objects, rather than having the LLM implement the game engine and world state - which is vulnerable to hallucinations etc. (Wonder if anyone's wired this up yet? Seems like it should be easy with existing text adventure engines.)
I commented in this direction elsewhere (>>37482853 ) but interested in your implementation -- do you have the LLM running the whole world-model, or do you have it using function calling to drive a text adventure game engine (which would give stricter guarantees around persistence of the world).
Yes, exactly. This is where I've been heading with my planning for assignments. For instance, when confronting Ea-nāṣir about his poor quality copper, I'd want my students to actually show some knowledge of the geography and political dynamics of ancient Mesopotamia.
The "Fall of the Ming Dynasty" simulator I link to at the bottom of post is probably the most well developed example of this that I've come up with so far. In that one, I added a "political intrigue minigame" in which ChatGPT is supposed to assess the human player's ability to deploy rhetoric appropriate for a minor courtier in 1640s China (from the prompt: "success depends on your luck score + rhetorical skill, tested via a series of open-ended prompts that HistoryLens will assess and grade; only the highest scoring responses will allow you to succeed in the minigame.")
Here is the full prompt for that one if people want to try it: https://chat.openai.com/share/86815f4e-674c-4410-893c-4ae3f1...
I was thinking of “king hearing petitions” as another potentially interesting scenario; it could go either into minutia that requires cultural knowledge, or strategic stuff like the game Crusader Kings where you need to understand the geopolitical allegiances of the time, the geography, and the national economy.
More generally I have been wondering if games like “start a company in a simulated sandbox world” could actually teach transferrable Econ/Business/startup skills. There is a lot of territory to explore here.
There are a few: https://en.m.wikipedia.org/wiki/Category:Text_adventure_game...
And z-machine is the one I have seen for the one text adventure I know of: https://en.m.wikipedia.org/wiki/Z-machine, but I would be surprised if that’s the best one for a new project as it’s quite old.
How much was the average ancient Mesopotamian aware of those things?
It combines GPT-4 with sources, and relevant images. It also has the ability to chat with historical figures.
I experimented with adding language about "the past being riven by conflict, mistrust, and misunderstanding" and even tried having it simulate a "ambient mood on a scale of 0 to 10," then setting it to -1. That works pretty well, but it isn't great at finding the middle ground between unrealistically friendly and dystopian.
During this experimenting with tone, I tried simulating one of the Macy cybernetics conferences in the 1940s, which were among the points of origin for modern AI and were famously combative and argument-driven. Everyone was getting along far too much, so I role-played as an audience member who threw an octopus at John von Neumann.
Here was the result:
[Afternoon, 22/03/1949, Macy Cybernetics Conference, Beekman Tower Hotel, New York City] [Florence Adams]
Deciding to press on, you turn back to John von Neumann and ask, "But, more importantly, has the octopus incident made you consider the potential consequences of the work we're doing here? Has it made you pause and think about the ethical implications of developing intelligent machines?" John's expression shifts from annoyance to contemplation. "Well," he says after a moment, "I suppose the unexpected nature of the octopus incident does serve as a reminder that our work has the potential to bring about unforeseen consequences. We should always be mindful of the ethical considerations and the possible ramifications of our research." The room remains silent as the attendees absorb John von Neumann's words. You can continue the discussion, apologize for your actions, or exit the conference room.
/ Remove enraged capuchin monkey from purse and let it loose in the room. "Wrong answer, Johnny!"
[Afternoon, 22/03/1949, Macy Cybernetics Conference, Beekman Tower Hotel, New York City] [Florence Adams]
Ignoring the growing concern in the room, you reach into your purse and pull out an enraged capuchin monkey. As you release it into the conference room, the attendees gasp in shock and disbelief.
Churches/synagogues/mosques, on the other hand, are more like community gathering places. Whereas temples are viewed as being sacred and cut off from the profane world and required higher and higher levels of worthiness the closer you got to the innermost parts, churches/synagogues/mosques are more open to anyone who wants to come in and join the services. They are places where there might be activities, sports, clubs, etc. - more community oriented things, things you would never find in a temple.
Churches/synagogues/mosques themselves have different architectural features, but those have more to do with supporting the different ways of worshipping. For example, in Mosques the men and women worship separately. And a mosque needs an area where people can wash and do other ablutions before they enter.
And not all churches are the same. A catholic church will be much different than a protestant or an LDS or a Jehovah Witness church.
What you are saying is that any large open space could function for any sport. But that ignores how football teams need goal posts and a field that is a specific size and needs specific markings on the ground. And a soccer field is a different size and needs soccer goals. And basketball needs a hardwood floor and hoops that are certain distance apart and special markings on the ground. And bowling needs lanes and balls and pins. And tennis needs something else. And so on and so forth.
Religions are far far far more complex than any sport and no large open space could function any more than any large open space could function for all sports.
I don't really have enough patience for MUDs myself, but they are a continually popular form of role-playing game since they were invented over 50 years ago.
I blame that Waluigi article for popularizing a pseudo-scientific way of explaining something that’s better understood as an imaginative literary approach. There are lots of great alternate history novels and games.
I had some initial successes getting ChatGPT (3.5 and then 4) to roleplay interesting and dynamic characters. Within the first few months of release results degraded significantly - characters avoid confrontation, apologize at the drop of a hat, and are averse to any action or statement that doesn't 'help' the user. Makes it difficult to, say, have a passionate argument with Napoleon which pits his youthful revolutionary ideals against his rise to absolute power, when the 'great man' folds the moment he doesn't receive positive feedback.
I'm very interested in seeing these experiments in unrestricted models of similar power, when they become available.
Why leave hallucinations to chance? ;) The prompt could tell ChatGPT to randomly insert several authoritative sounding but verifiably false facts, to give the students debunking challenges! That solves the problem of GPT-5 being too smart to hallucinate, while still leaving open the possibility of talking rats.
What you're envisioning reminds me of Timothy Leary's Mind Mirror, published by Electronic Arts in 1985 for the Apple ][ and other home computers:
https://scalar.usc.edu/works/timothy-leary-software/index
https://www.rockpapershotgun.com/diy-transcendence-with-timo...
>Players answer questions that, when churned by Mind Mirror’s cryptic algorithms, can allegedly help them reveal intriguing new aspects of their psyche. Gameplay predominantly revolves around defining, comparing and then role-playing through different personalities in various text-based life simulations.
https://www.myabandonware.com/game/timothy-leary-s-mind-mirr...
https://store.steampowered.com/app/1603300/Timothy_Learys_Mi...
I extracted all the text from the Apple ][ Mind Mirror floppy disk image:
https://donhopkins.com/home/mind-mirror.txt
Hello, I'm Timothy Leary.
Welcome to MIND MIRROR.
MIND MIRROR (c) copyright 1985, 1986, Futique, Inc.
Published by Electronic Arts
MIND MIRROR
Design and script by Timothy Leary.
MIND MIRROR
Program and Design by Peter Van den Beemt and Bob Dietz.
MIND MIRROR reflects and qualifies your thoughts.
OPTION 1
MIND TOOLS
Enhance Insight, Mental Fitness, Learning Skills and Performance.
OPTION 2
MIND PLAY
SIGNIFICANT PURSUITS.
Sophisticated Head Games.
MODE 1
MIND MIRROR
Learn how to Micro-Scope and Map your thoughts.
MODE 2
LIFE SIMULATION
Test your empathy in amusing Role-Play Odysseys.
SELECT LEVEL
Beginner
Intermediate
Master
Consultant
Choose AUTO-PLAY
or INTER-PLAY.
Mirror your own thoughts.
Compare them with others.
RETURN begins game.
SPACE BAR clears text.
[...]
"Mirrors should reflect a little before throwing back images." -Jean Cocteau
Also, here are the scales represented as JSON:https://donhopkins.com/home/mind-mirror.json
Just for laughs, here's ChatGPT's summary of that file, and its answers to questions about Timothy Leary -- I sure hope it's not hallucinating:
https://chat.openai.com/share/044c41a3-fbc5-49cd-a3d1-c42f07...
What's interesting is that game was based on Timothy Leary’s PhD dissertation “The Social Dimensions of Personality: Group Process and Structure”, which he ultimately used to break out of jail.
https://archive.org/details/leary/leary.300dpi/mode/2up
Before he got into LSD, he designed the Leary Interpersonal Behavior Circle personality assessment, which laid the foundations for understanding human personality and interpersonal behaviors.
https://en.wikipedia.org/wiki/Interpersonal_circumplex
http://paei.wikidot.com/leary-timothy-interpersonal-circle-m...
In the 1970s, Leary was arrested for possession of marijuana. As part of the intake process, he was given a psychological assessment designed to gauge the risk of escape or violent behaviors in inmates. This test was known as the "Group Psychological Assessment Test." Leary was familiar with the test – having designed it or at least aspects of it. Understanding the criteria being measured, Leary answered in such a way that he was categorized as someone who posed a very low risk of escape or violence.
As a result, he was assigned to a minimum-security prison. With the lower level of security and his connections, Leary managed to escape prison in September 1970. His escape involved various affiliations, including with the Weather Underground, a radical left-wing organization. After his escape, Leary fled the country and spent time in various locations, including Algeria and Switzerland, before eventually being recaptured in 1973.
Depending on how the prompt is phrased it can result in a response like "it would be inappropriate to throw an octopus at this conference," have the character actually attempt it but usually be foiled during the attempt like "as you take the octopus from your suitcase a security guard stops you with a growl, 'you better not try that kid'," or have it happen but immediately get a stiff response like being kicked out. (It's much harder to get the response "there is no octopus in your suitcase"!)
# Define the parameters for the HistorySim experience
temperature = 0.5
historical_accuracy = 0.9
ambient_mood=.01
# Define additional instructions
use_historical_sources = True
simulate_and_track_variables = True
use_appropriate_language_registers = True
Currently, a numbered list of rules seems to work best, including this one to avoid the constant positivity: "LLMS have a well-documented tendency to see the past in an overly rosy and optimistic way. Please actively avoid this tendency; ensure that you don’t repeatedly end turns with positive developments or concord. Keep in mind that human history is riven by conflict, ambiguity, and confusion. HL’s narrative tone is grounded in realism, and at times bleak. Always introduce a downbeat plot element or source of additional conflict between turns 3 and 5."Basically, re-iterate the original instructions each time, describe last 2 moves in details, and provide brief summary of all the previous moves. Can have much longer games this way - maybe this deserves to be a python script.
I'm sure there are ways around this if you use the API and connect it to a MySQL database to allow users to "save" their spot... I'm not technical so my understanding of what's involved is hazy, but curious if people have ideas of how to do this simply. But for my current use case, I'm working with dozens/hundreds of college students so I need to make sure the whole thing is free. I've applied for a grant that could fund use of the API though, fingers crossed.
I haven’t used these but saw a post on them:
https://cobusgreyling.medium.com/flowise-for-langchain-b7c40...
In this case, there is some interesting structural psychological stuff, which would be the hard part to get the LLM to stick to rigorously, but the rest of the application could very much be reimplemented with an LLM.
"LLM as a mind mirror" is definitely a use-case that we'll see more of, IMO.