Huffman Coding Compression

This project implements a simple Huffman coding algorithm for compressing and decompressing text files. Huffman coding is a popular method for lossless data compression, where frequently occurring characters are represented by shorter binary codes, thus reducing the overall file size.

Features

Compress text files into a binary format (.bin)
Decompress binary files back to their original text format
Error handling for empty input files

Requirements

Python 3.x

Installation

Clone this repository:

git clone https://github.com/Nikson2003/huffman-coding.git

Navigate into the project directory:
```
cd huffman-coding
```
Ensure you have Python installed. You can check your Python version with:
```
python --version
```

Usage

Place the text file you want to compress in the project directory and update the path variable in main.py to point to your text file.
```
path = "testing.txt"
```
Run the main.py script:
```
python main.py
```
After running, the compressed file will be created in the same directory with a .bin extension. The decompressed text file will have _decompressed appended to the original filename.

How It Works

Frequency Dictionary: The algorithm first calculates the frequency of each character in the input text.
Heap Creation: A min-heap is created based on the frequencies of the characters.
Huffman Tree Construction: Nodes are merged to create a binary tree, where each leaf node represents a character.
Code Generation: Binary codes are generated for each character based on their position in the tree.
Encoding: The input text is encoded into a binary string, padded to ensure its length is a multiple of 8 bits.
Compression: The padded binary string is converted into a byte array and written to a binary file.
Decompression: The binary file is read, decoded, and the original text is restored.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
__pycache__		__pycache__
README.md		README.md
huffman.py		huffman.py
main.py		main.py
testing.bin		testing.bin
testing.txt		testing.txt
testing_decompressed.txt		testing_decompressed.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Huffman Coding Compression

Features

Requirements

Installation

Usage

How It Works

About

Releases

Packages

Languages

Nikson2003/huffman-coding

Folders and files

Latest commit

History

Repository files navigation

Huffman Coding Compression

Features

Requirements

Installation

Usage

How It Works

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages