Okay this is insane... WebGPU-accelerated semantic video tracking, powered by DINOv3 and Transformers.js! π€― Demo (+ source code): webml-community/DINOv3-video-tracking
This will revolutionize AI-powered video editors... which can now run 100% locally in your browser, no server inference required (costs $0)! π
How does it work? π€ 1οΈβ£ Generate and cache image features for each frame 2οΈβ£ Create a list of embeddings for selected patch(es) 3οΈβ£ Compute cosine similarity between each patch and the selected patch(es) 4οΈβ£ Highlight those whose score is above some threshold
... et voilΓ ! π₯³
You can also make selections across frames to improve temporal consistency! This is super useful if the object changes its appearance slightly throughout the video.
Excited to see what the community builds with it!
2 replies
Β·
reacted to MoritzLaurer's
post with π8 months ago
Quite excited by the ModernBERT release! 0.15/0.4B small, 2T modern pre-training data and tokenizer with code, 8k context window, great efficient model for embeddings & classification!
This will probably be the basis for many future SOTA encoders! And I can finally stop using DeBERTav3 from 2021 :D
π Releasing a new zeroshot-classifier based on ModernBERT! Some key takeaways:
- β‘ Speed & efficiency: It's multiple times faster and uses significantly less memory than DeBERTav3. You can use larger batch sizes and enabling bf16 (instead of fp16) gave me a ~2x speed boost as well - π Performance tradeoff: It performs slightly worse than DeBERTav3 on average across my zeroshot classification task collection - π§ Use cases: I recommend using it for scenarios requiring speed and a larger context window (8k). - π‘ Whatβs next? Iβm preparing a newer version trained on better + longer synthetic data to fully leverage the 8k context window and improve upon the training mix of my older zeroshot-v2.0 models. I also hope that there will be a multilingual variant in the future.
I'm excited to announce that Transformers.js V3 is finally available on NPM! π₯ State-of-the-art Machine Learning for the web, now with WebGPU support! π€―β‘οΈ
Install it from NPM with: πππ π @πππππππππππ/ππππππππππππ
I'm excited to announce that Transformers.js V3 is finally available on NPM! π₯ State-of-the-art Machine Learning for the web, now with WebGPU support! π€―β‘οΈ
Install it from NPM with: πππ π @πππππππππππ/ππππππππππππ