The generalizability of argument quality dimensions in NLP models