Yahoo Suche Web Suche

Suchergebnisse

  1. Suchergebnisse:
  1. 21. Sept. 2022 · Whisper is an open-source system that can transcribe and translate speech in multiple languages from a large and diverse web dataset. It uses a simple encoder-decoder Transformer architecture and outperforms existing models on zero-shot tasks.

  2. 26. Mai 2023 · Seit September 2022 gibt es Whisper, eine kostenlos nutzbare Transkriptionssoftware des US-amerikanischen KI-Start-ups OpenAI, das mit dem KI-Chatbot ChatGPT Furore gemacht hat. Das Open-Source...

    • Overview
    • Approach
    • Setup
    • Available models and languages
    • Command-line usage
    • Python usage
    • More examples
    • License
    • GeneratedCaptionsTabForHeroSec

    [Blog] [Paper] [Model card] [Colab example]

    Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification.

    A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection. These tasks are jointly represented as a sequence of tokens to be predicted by the decoder, allowing a single model to replace many stage...

    We used Python 3.9.9 and PyTorch 1.10.1 to train and test our models, but the codebase is expected to be compatible with Python 3.8-3.11 and recent PyTorch versions. The codebase also depends on a few Python packages, most notably OpenAI's tiktoken for their fast tokenizer implementation. You can download and install (or update to) the latest release of Whisper with the following command:

    Alternatively, the following command will pull and install the latest commit from this repository, along with its Python dependencies:

    To update the package to the latest version of this repository, please run:

    It also requires the command-line tool ffmpeg to be installed on your system, which is available from most package managers:

    There are five model sizes, four with English-only versions, offering speed and accuracy tradeoffs. Below are the names of the available models and their approximate memory requirements and inference speed relative to the large model; actual speed may vary depending on many factors including the available hardware.

    The .en models for English-only applications tend to perform better, especially for the tiny.en and base.en models. We observed that the difference becomes less significant for the small.en and medium.en models.

    The following command will transcribe speech in audio files, using the medium model:

    The default setting (which selects the small model) works well for transcribing English. To transcribe an audio file containing non-English speech, you can specify the language using the --language option:

    Adding --task translate will translate the speech into English:

    Run the following to view all available options:

    Transcription can also be performed within Python:

    Internally, the transcribe() method reads the entire file and processes the audio with a sliding 30-second window, performing autoregressive sequence-to-sequence predictions on each window.

    Please use the 🙌 Show and tell category in Discussions for sharing more example usages of Whisper and third-party extensions such as web demos, integrations with other tools, ports for different platforms, etc.

    Whisper's code and model weights are released under the MIT License. See LICENSE for further details.

    Whisper is a Transformer-based model that can perform multilingual speech recognition, speech translation, and language identification. It is trained on a large dataset of diverse audio and can be installed and used with Python and ffmpeg.

  3. 30. Mai 2023 · OpenAI Whisper wandelt Ihre Stimme auf Windows 11/10-Geräten in Text um. Da dieses Programm von OpenAI entwickelt wird , sollte klar sein, dass künstliche Intelligenz im Mittelpunkt seiner Möglichkeiten steht.

  4. Whisper ist eine Online-Community, die es Millionen von Nutzern weltweit ermöglicht, ihre Gedanken mit anderen zu teilen, Ratschläge auszutauschen und über den neuesten Klatsch und Tratsch zu...

    • (244,3K)
    • Ab 13 Jahren
  5. 29. Jan. 2023 · Das neuronale Netz Whisper ist ein automatisches Spracherkennungssystem, das auf 680.000 Stunden mehrsprachiger Audiodaten trainiert wurde. Besonders die Fähigkeiten zur Transkription überzeugen.

  6. 22. Sept. 2022 · Das Spracherkennungssystem Whisper soll Sprachen erkennen, ins Englische übersetzen und Aufnahmen transkribieren. Fünf freie Varianten finden sich auf GitHub.

  1. Verwandte Suchbegriffe zu Whisper

    Whisper app