This is how rax (https://github.com/antirez/rax) and most of Redis is tested. Indeed this is the only way to be confident about a complex data structure implementation sanity.
> Testing is necessarily incomplete but if you beat the crap out of the subset of the API that you actually intend to use, it’s possible to become reasonably confident that it will work in practice.
This is the kicker for me. I'm not a C++ expert, but knowing that a coverage-guided fuzzer has been running against my code for the past few months gives me a lot of confidence.