Deep Value Benchmark: Measuring Whether Models Generalize Deep values or Shallow Preferences | Read Paper on Bytez