Speech
Upload a file to MediaCatch Speech API and get the results
Uploads a file to MediaCatch Speech API.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
fpath
|
str | Path
|
Path to the file to upload. |
required |
api_key
|
str
|
API key for the vision API. Defaults to None. |
None
|
quota
|
str
|
The quota to bill transcription hours from. Can be None if the user only has one quota. Defaults to None. |
None
|
fallback_language
|
str
|
Overrides the language to transcribe in if language identification fails. If None, uses the default language of the quota. Defaults to None. |
None
|
max_threads
|
int
|
Number of maximum threads. Defaults to 5. |
5
|
max_request_retries
|
int
|
Number of maximum retries for request. Defaults to 3. |
3
|
request_delay
|
float
|
Delay between request retries. Defaults to 0.5. |
0.5
|
chunk_size
|
_type_
|
Size of each chunk to upload. Defaults to 10010241024. |
100 * 1024 * 1024
|
url
|
str
|
URL of the MediaCatch Speech API. Defaults to 'https://s2t.mediacatch.io/api/v2'. |
'https://s2t.mediacatch.io/api/v2'
|
compress_input
|
bool
|
Compress the input file to OGG format (Requires FFMPEG >= 6.1). Defaults to False. |
False
|
sample_rate
|
int
|
Sample rate of the audio file. Defaults to 16000. |
16000
|
verbose
|
bool
|
Show verbose output. Defaults to True. |
True
|
Returns:
Name | Type | Description |
---|---|---|
str |
str
|
File ID of the uploaded file. |
Wait for result from a URL.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
file_id
|
str
|
The file ID to get the result from. |
required |
url
|
str
|
The URL to get the result from. |
'https://s2t.mediacatch.io/api/v2'
|
timeout
|
int
|
Timeout for waiting in seconds. Defaults to 3600. |
3600
|
delay
|
int
|
Delay between each request. Defaults to 10. |
10
|
verbose
|
bool
|
Show verbose output. Defaults to True. |
True
|
Returns:
Type | Description |
---|---|
dict[str, Any] | None
|
dict[str, Any] | None: Dictionary with the result from the URL or None if failed. |