I think I’m at a 3060 or so and it works decently depending on the model. I can generally get away with around 13B, or some 20+ Q4 or so but they get real slow by that point.
It’s a lot of messing around to find something that performs decent while not being so limited as to get crazy repetitive or saying loony things.
I think I’m at a 3060 or so and it works decently depending on the model. I can generally get away with around 13B, or some 20+ Q4 or so but they get real slow by that point.
It’s a lot of messing around to find something that performs decent while not being so limited as to get crazy repetitive or saying loony things.