CHANGELOG.md 3.0 KB

CHANGELOG

v20230314

  • abort find_alignment on empty input (#1090)
  • Fix truncated words list when the replacement character is decoded (#1089)
  • fix github language stats getting dominated by jupyter notebook (#1076)
  • Fix alignment between the segments and the list of words (#1087)
  • Use tiktoken (#1044)

v20230308

  • kwargs in decode() for convenience (#1061)
  • fix all_tokens handling that caused more repetitions and discrepancy in JSON (#1060)
  • fix typo in CHANGELOG.md

v20230307

  • Fix the repetition/hallucination issue identified in #1046 (#1052)
  • Use triton==2.0.0 (#1053)
  • Install triton in x86_64 linux only (#1051)
  • update setup.py to specify python >= 3.8 requirement

v20230306

  • remove auxiliary audio extension (#1021)
  • apply formatting with black, isort, and flake8 (#1038)
  • word-level timestamps in transcribe() (#869)
  • Decoding improvements (#1033)
  • Update README.md (#894)
  • Fix infinite loop caused by incorrect timestamp tokens prediction (#914)
  • drop python 3.7 support (#889)

v20230124

  • handle printing even if sys.stdout.buffer is not available (#887)
  • Add TSV formatted output in transcript, using integer start/end time in milliseconds (#228)
  • Added --output_format option (#333)
  • Handle XDG_CACHE_HOME properly for download_root (#864)
  • use stdout for printing transcription progress (#867)
  • Fix bug where mm is mistakenly replaced with hmm in e.g. 20mm (#659)
  • print '?' if a letter can't be encoded using the system default encoding (#859)

v20230117

The first versioned release available on PyPI