The OP said elsewhere they are using this[1] library, which allows you to specify minimum seconds to match, so you'd presumably set it to match 20 seconds or whatever minimum length podcast commercials usually are.
Most other audio fingerprinting libraries I've seen allow you to specify min/max time, as well.