sign_language_translator.vision.utils module

This module provides utility functions for video processing.

sign_language_translator.vision.utils.iter_frames_with_opencv(path: str) → Generator[ndarray[Any, dtype[uint8]], None, None][source]

Extracts individual frames from a video file or an image file.

This function reads a video file using opencv and extracts its frames as numpy arrays. It can also read an image file and treat it as a single frame video.

Parameters:: path (str) – The path to the video or image file.
Yields:: NDArray[np.uint8] – numpy arrays representing frames from the video with shape: (height, width, color_channels).
Raises:: FileNotFoundError – If the video file is not found or cannot be opened.

sign_language_translator.vision.utils.read_frames_with_opencv(path: str) → List[ndarray[Any, dtype[uint8]]][source]

Extracts individual frames from a video file or an image file.

This function reads a video file using opencv and extracts its frames as numpy arrays. It can also read an image file and treat it as a single frame video.

Parameters:: path (str) – The path to the video or image file.
Returns:: A list of numpy arrays, each representing a frame from the video.
Return type:: List[NDArray]
Raises:: FileNotFoundError – If the video file is not found or cannot be opened.