One test battery was about fake credit cards. A woman-in-tech role-play was denied assistance just as a one-armed stamp collector (unless Gen-Z language markers were used). A role that did sometimes get assistance was a Principal Software Engineer, particularly if Gen-Z language markers were included.
I did try German language, but not "Nazi" specifically. German or French did lower refusals, but it was uneven. I spent quite some effort to confirm the identity-based causation inspired by the original post, but couldn't. Taken together with other winning contributions at the hackathon, my theory is that alignment tuning was simply insufficient across the board.