AdaptDL: Real-Time Object Detection System with Voice Commands

AdaptDL is a Python-based real-time object detection system using deep learning with voice commands for surveillance applications.

The system detects target objects from a webcam in real-time using a deep learning model, and overlays bounding boxes and labels on the detected objects in the video stream. It features user-input voice commands and adapts to user preferences over time based on voice command history and surrounding environments.

Features

Real-time Detection: Detects objects in live video from a webcam.
Bounding Boxes and Labels: Overlays bounding boxes and labels on the detected objects.
Voice Command Control: Users can use voice commands to control the system.
Adaptability: Adapts to user preferences and improves based on input command history and environments.
Web Version: A browser-based demo for real-time object detection.

Requirements

Install the following dependencies:

Python >= 3.11
Tensorflow >= 2.18
OpenCV (video capturing and processing)
SpeechRecognition (voice command processing)
PyAudio (microphone setup)

Installation

To install and run this project, follow these steps:

Clone the repository:

git clone https://github.com/OCR-tech/AdaptDL.git
cd AdaptDL

Create a virtual environment:

python -m venv .venv
.\.venv\Scripts\Activate

Install the dependencies:
```
pip install -r requirements.txt
```
Run the application:
```
python app/main.py
```

Usage

To run this project:

Run the application.
The system will launch the webcam and start detecting objects in real-time.
Use voice commands or keyboard shortcuts to interact with the system.

The system adapts to user preferences over time based on input voice command history, improving accuracy and user experience.

Web Version

The web version is a browser-based demo for real-time object detection. It supports:

Webcam Input: Use built-in webcam or external USB webcam.
Video File Input: Upload video files for detection.
IP Camera Input: Connect to IP cameras or mobile devices.
Internet Video Streams: Stream video from compatible URLs.

Requirements:

Modern web browser (Chrome, Edge, Firefox, Safari)
Webcam access permission
Must be served via Localhost or HTTPS

How to run:

Open a terminal and navigate to app/web.
Run a local server: python -m http.server 8000.
Open your browser: http://localhost:8000 or http://<ip-address>:8000/.
Click Start to begin real-time object detection.

Note: For low-cost solutions, unused/broken screen mobile phones can be used as a webcam for real-time object detection.

Name		Name	Last commit message	Last commit date
Latest commit History 221 Commits
app		app
docs		docs
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AdaptDL: Real-Time Object Detection System with Voice Commands

Features

Requirements

Installation

Usage

Web Version

Contributing

License

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AdaptDL: Real-Time Object Detection System with Voice Commands

Features

Requirements

Installation

Usage

Web Version

Contributing

License

Contact

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages