×
all 7 comments

[–]gwern 3 points4 points  (2 children)

What sort of challenges and bugs have you had implementing MuZero? Everything I've heard from people trying to implement a MuZero has been that it is surprisingly subtle and tricky to get working.

[–]silverlight6[S] 1 point2 points  (1 child)

Our biggest problem so far has been throttling the cpu which is causing every action to take longer than it should. Our GPU operations are 20% of our time costs currently and when scaling up, that percentage only going down.

[–]dx_rd_to_DX 1 point2 points  (0 children)

Use multiprocessing to have more GPU workers could help. My code based on EfficientZero https://github.com/YeWR/EfficientZero is utilizing CPUs and GPUs to 90%. It uses Ray for multiprocessing and splits Reanalyze into CPU and GPU workers to maximize resource utilization. By the way, it's not converging to optimal policy well: it gets stuck at 50% optimal episode return at with a small amount of training. Have you had this issue before?

[–]jeremybub 2 points3 points  (0 children)

Very cool, I thought you might like to know about https://github.com/JDBumgardner/stone_ground_hearth_battles which plays Hearthstone Battlegrounds, a similar auto-chess game.

[–]Epicnightt 0 points1 point  (0 children)

Thats incredibly cool. Im a complete beginner to ML and RL so I barely understand anything, but is there any chance of a youtube series going over the development in the future? That would be a very interesting thing to watch and I bet it would get you alot of exposure to your project aswell.

[–]Unlikely-Leg499 0 points1 point  (1 child)

Does battle simulation of TFT contains all features of the game? And I agree, a youtube video explanation would be great

[–]silverlight6[S] 0 points1 point  (0 children)

It contains all of the features of set 4 of the game. I may make a video a little later once we get a model running.