Technical Characteristics

  • Quantization: 16 bit (FP16)
  • Context Window: 128k tokens
  • Generation Speed: 15% faster than predecessor