Dark Mode

Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Refresh browser-control local training and deployment workflow#71

Open
oceantime wants to merge 11 commits intoLiquid4All:mainfrom
oceantime:browser-control-training-refresh
Open

Refresh browser-control local training and deployment workflow#71
oceantime wants to merge 11 commits intoLiquid4All:mainfrom
oceantime:browser-control-training-refresh

Conversation

Copy link

oceantime commented Mar 9, 2026

Summary

  • fix evaluation entry by switching src/browser_control/evaluate.py to a valid debug config
  • align local GRPO training with the Android inference flow by updating prompts, action parsing, reward shaping, and adding configs/lfm2_350m_local_full_v2.yaml
  • improve local Docker training by supporting an external MiniWoB server, adding AXTree debug injection, and stabilizing the training image setup
  • update local GGUF conversion to use the latest checkpoint path and run without interactive confirmation
  • normalize project entry docs with AGENT.md / README.md, remove legacy workflow files, refresh evaluation screenshots, and ignore workflow metadata directories in .gitignore

Testing

  • verified local baseline workflow previously recorded in project notes:
    • uv sync
    • debug fine-tune run
    • make evaluation over 10 episodes
  • refreshed media/episode_0_step_0.png through media/episode_9_step_0.png from the latest evaluation pass
  • no additional full test/build run was executed after the final doc and ignore-rule cleanup

Notes

  • .project/ and .learnings/ are intentionally excluded from version control
  • the only remaining untracked item is the outer-repo file ../../docs/, which is not part of this project PR

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

No reviews

Assignees

No one assigned

Labels

None yet

Projects

None yet

Milestone

No milestone

Development

Successfully merging this pull request may close these issues.

1 participant