For Video objects, the current AI does a great job capturing the correct words spoken and converting them into transcription, however it really, REALLLLLLY struggles with punctuation and capitalization. A typical 3-4 minute video yields a transcri...