[P] AI Learns PvP in Old School RuneScape (Reinforcement Learning)

CanadianTuero · 2024-02-11T03:07:29+00:00

As a fellow scaper, this is very interesting! I've always wanted to do something similar, especially creating some sort of a stripped down engine so that forward search techniques could be applied to it. I'll have to check out how you integrate with elvarg. If you don't mind me asking, what resources did you use to figure out how to create any hooks for the custom environments?

KomradKot · 2024-02-11T04:45:09+00:00

I've always thought of OSRS as an amazing potential environment for RL agent research. The tile system and low tick rate allows for easier state representation and simulation than games with higher resolutions. The game world is also extremely rich and quests are quite varied and often require reading between the lines to figure out what needs to be done (no repetitive kill y NPC x times trope like other MMOs). With the availability of open source clients, and p-server code, I'm thinking a "tutorial island to dragon slayer challenge" would be a good benchmark for autonomous generalist agents.

preordains · 2024-02-11T05:56:46+00:00

I have been looking at your code for longer than I would like to admit and Im struggling to see one thing: does this RSPS allow you to interact with it by only sending actions, and returns to you an observation? How does the interaction with the game take place?

itsPixels · 2024-02-11T08:59:12+00:00

This is absolutely incredible work! How would you think the model would fair in a more typical edge style fight instead of nhing? If traning is optimized more towards ko potential instead of outlasting. Might be something I need to try myself as its the style that interests me more. Anyhow, this is just fantastic (and slightly worrying)!

infinitay_ · 2024-02-11T10:06:29+00:00

I was wondering when someone would finally do something like this given how prominent RSPS' are. The same seems like a match made in heaven for reinforcement learning. Given how many people bot/cheat in OSRS, it's only a matter of time until PVP is flooded with PK bots with the help of this.

Not to blame you OP, this is a really cool project and great work. But I am sure there will be a script-kiddie adding support for their cheat clients given a month or so.

Anyways, I find it fascinating how it learns that it's better to stand under your opponent when in combat, so your opponent can't hit you as easily. Further everything is done tick-perfect and even one-ticking such as armor swaps so it's even more advanced. One idea that I had when considering RL within an RSPS was manipulating the game tick rate for faster training. Although, thinking about it again, now I'm not too sure how to scale it back up to 600. First thought was to just add a delay to the actions. Second thought is to fine tune the model on 600ms/tick after you train it on say 100ms/tick.

hazard02 · 2024-02-11T09:53:33+00:00

How important is the novelty reward?

low-day-leh-sun · 2024-02-12T02:04:56+00:00

Good work !!

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS