For consistency between all system tests, add missing setup.sh scripts for tests which do not have one yet and ensure every setup.sh script calls its respective clean.sh script.