Strengthening LLM Security Through Robustness Testing

This research introduces ABFS (Accelerated Boundary-following Search), an innovative testing approach that identifies robustness vulnerabilities in LLM-based NLP software systems.

Targets critical weaknesses where small input perturbations cause significant output errors
Outperforms existing methods by systematically exploring decision boundaries in LLM systems
Focused on real-world applications including financial analysis and content moderation
Security implications for deployments in safety-critical scenarios where LLM failures could have serious consequences

As organizations increasingly deploy LLM solutions in sensitive domains, this research provides essential tools to verify system reliability and protect against subtle but dangerous vulnerabilities.

ABFS: Natural Robustness Testing for LLM-based NLP Software