Hacker News

EfficientSAM

by Thomashueton 12/6/2023, 12:35:21 PM with 6 comments

by skadamaton 12/6/2023, 6:25:23 PM
Excited to play with this more! Forked the repo and added the models into the repo itself (migrated from Dropbox): https://github.com/xetdata/EfficientSAM
by IshanMion 12/6/2023, 9:34:10 PM
So if I'm understanding this correctly:
The SAM paper from this past April (that let you do zero-shot segmentation on any image, seemingly better than even OpenAI's CLIP) was using a ~600M parameter ViT model to generate image embeddings. And in order to make it less computationally expensive to generate those same embeddings, they replace that model with a smaller ViT encoder that was pre-trained using the masked auto-encoder back propagation method?
by GaggiXon 12/6/2023, 1:38:11 PM
https://github.com/ChaoningZhang/MobileSAM was the previous attempt at reducing the size of the large image encoder used by SAM.
by cchanceon 12/6/2023, 5:13:27 PM
it's called efficient Sam and it appears to be onpar or better than fastsam but did I miss a memory or speed comparison?
by naveen99on 12/6/2023, 1:25:49 PM
can’t wait for everywhere all at once function.
by ShadowBanThis01on 12/6/2023, 8:46:00 PM
Is what?