Text, image, or audio input to the model, used to generate a response. Can also contain previous assistant responses.