-
Notifications
You must be signed in to change notification settings - Fork 216
Refresh browser-control local training and deployment workflow#71
Open
oceantime wants to merge 11 commits intoLiquid4All:mainfrom
Open
Refresh browser-control local training and deployment workflow#71oceantime wants to merge 11 commits intoLiquid4All:mainfrom
oceantime wants to merge 11 commits intoLiquid4All:mainfrom
Conversation
Summary
- fix evaluation entry by switching
src/browser_control/evaluate.pyto a valid debug config - align local GRPO training with the Android inference flow by updating prompts, action parsing, reward shaping, and adding
configs/lfm2_350m_local_full_v2.yaml - improve local Docker training by supporting an external MiniWoB server, adding AXTree debug injection, and stabilizing the training image setup
- update local GGUF conversion to use the latest checkpoint path and run without interactive confirmation
- normalize project entry docs with
AGENT.md/README.md, remove legacy workflow files, refresh evaluation screenshots, and ignore workflow metadata directories in.gitignore
Testing
- verified local baseline workflow previously recorded in project notes:
uv sync- debug fine-tune run
make evaluationover 10 episodes
- refreshed
media/episode_0_step_0.pngthroughmedia/episode_9_step_0.pngfrom the latest evaluation pass - no additional full test/build run was executed after the final doc and ignore-rule cleanup
Notes
.project/and.learnings/are intentionally excluded from version control- the only remaining untracked item is the outer-repo file
../../docs/, which is not part of this project PR
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.