6 Commits

Author SHA1 Message Date
let5sne.win10
b97d3f3a9f Improve Video Learning Agent with action-based detection and analysis toggle
- Change video detection from screenshot hash to action-based (Swipe detection)
- Add enable_analysis toggle to disable VLM screenshot analysis
- Improve task prompt to prevent VLM from stopping prematurely
- Add debug logging for action detection troubleshooting
- Fix ModelResponse attribute error (content -> raw_content)

Co-Authored-By: Claude <noreply@anthropic.com>
2026-01-10 01:47:09 +08:00
let5sne.win10
6b770832aa Skip app startup screens with warmup logic
- Added warmup counter to skip first 3 steps after entering app
- Reset counter when leaving target app
- Prevents recording splash screens as videos

Co-Authored-By: Claude <noreply@anthropic.com>
2026-01-09 23:56:21 +08:00
let5sne.win10
a823c03788 Improve screenshot analysis prompt and add debug logs
- Simplified prompt to force JSON-only response
- Added debug logs to track VLM response and parsing
- Better error messages for troubleshooting

Co-Authored-By: Claude <noreply@anthropic.com>
2026-01-09 23:37:35 +08:00
let5sne.win10
c4325d57d4 Fix record_video method to accept analysis fields
Added missing parameters: shares, tags, category, elements
Now analysis results are properly saved to VideoRecord

Co-Authored-By: Claude <noreply@anthropic.com>
2026-01-09 23:23:57 +08:00
let5sne.win10
195a93b7e0 Add screenshot content analysis using VLM
Features:
- ScreenshotAnalyzer class for VLM-based image analysis
- Real-time analysis during video recording
- Extract likes, comments, tags, category from screenshots
- Frontend display for category badges and tags
- Batch analysis API endpoint

Co-Authored-By: Claude <noreply@anthropic.com>
2026-01-09 23:20:52 +08:00
let5sne.win10
5b3f214e20 Add Video Learning Agent for short video platforms
Features:
- VideoLearningAgent for automated video watching on Douyin/Kuaishou/TikTok
- Web dashboard UI for video learning sessions
- Real-time progress tracking with screenshot capture
- App detection using get_current_app() for accurate recording
- Session management with pause/resume/stop controls

Technical improvements:
- Simplified video detection logic using direct app detection
- Full base64 hash for sensitive screenshot change detection
- Immediate stop when target video count is reached
- Fixed circular import issues with ModelConfig

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-09 22:54:57 +08:00