Java TreeSet Tutorial | Sorted Set Implementation & Examples

1. The Engine: Red-Black Trees

While HashSet uses buckets and hash codes, TreeSet uses a Self-Balancing Binary Search Tree known as a Red-Black Tree. This ensures that the tree never becomes "lopsided," keeping the height of the tree minimal for faster searching.

How it stores data

Every time you add an element, Java compares it to the existing nodes. If it's smaller, it goes left; if larger, it goes right. It then "re-balances" itself to stay efficient.

The Logarithmic Edge

Search, Add, and Remove operations take O(log n) time. While slower than HashSet's O(1), it stays consistently fast even as the dataset grows into the millions.

2. NavigableSet: Moving Beyond Simple Sets

Because the data is sorted, TreeSet provides powerful "Navigation" methods that are impossible in a HashSet. You can "peek" at neighbors or find the closest match to a value.

Method	Functionality
`first() / last()`	Returns the lowest and highest elements.
`lower(e) / higher(e)`	Returns the element strictly less than or greater than 'e'.
`floor(e) / ceiling(e)`	Returns the element less than/equal to or greater than/equal to 'e'.
`pollFirst() / pollLast()`	Retrieves and removes the first/last element (Great for priority tasks).

3. The Requirement: Comparable or Comparator

A TreeSet cannot function if it doesn't know how to compare two objects. If you try to add a custom object that doesn't implement the Comparable interface, Java will throw a ClassCastException.

Pro Tip: Unlike HashSet, a TreeSet does not use hashCode() or equals() to detect duplicates. It uses the compareTo() (or compare()) method. If compareTo returns 0, the TreeSet considers the elements identical and won't add the new one.

4. Performance Comparison: Why choose TreeSet?

You should only use TreeSet when the order is a requirement. If you just need uniqueness, HashSet is nearly 5-10 times faster.

HashSet: Best for high-speed lookups where order doesn't matter.
LinkedHashSet: Best when you need to remember the order in which items arrived.
TreeSet: Best when you need the items to be sorted alphabetically or numerically at all times.

5. Mastery Code Example: Smart Stock Price Tracker

This example shows how a TreeSet can be used to track stock prices and quickly find "the highest price below $100."

            import java.util.TreeSet;

            public class PriceTracker {

              public static void main(String[] args) {

                TreeSet<Double> prices = new TreeSet<>();

                prices.add(150.50);

                prices.add(85.00);

                prices.add(102.30);

                prices.add(45.90);

                // Automatic Sorting

                System.out.println("All Prices: " + prices); // [45.9, 85.0, 102.3, 150.5]

                // Range Search: Find highest price under 100

                Double target = prices.floor(100.00);

                System.out.println("Highest budget-friendly: " + target); // 85.0

                // Lowest price

                System.out.println("Starting price: " + prices.first());

              }

            }

6. Memory Overhead

A TreeSet is memory-intensive. Every element is wrapped in a TreeMap.Entry object, which contains four references (left, right, parent, and value) and a boolean for the color (Red or Black). If memory is a tight constraint, consider using a sorted array or ArrayList and Collections.sort() only when needed.

7. Dealing with Nulls (The Big Warning)

A TreeSet does not allow null elements. Because it needs to call compareTo() on every element to find its position, adding a null will result in a NullPointerException immediately. This is a major difference compared to HashSet.

8. Interview Preparation: The TreeSet Deep-Dive

Q: How does TreeSet handle duplicates?
A: It uses the return value of compare() or compareTo(). If the method returns 0, the element is considered a duplicate and is discarded.

Q: Is TreeSet thread-safe?
A: No. Like most of the Collections Framework, it is not synchronized. For thread-safety, use Collections.synchronizedSortedSet(new TreeSet(...)) or ConcurrentSkipListSet.

Q: What is the time complexity of the contains() method in TreeSet?
A: It is $O(\log n)$ because it follows a path from the root to a leaf in a balanced tree.

Final Verdict

The TreeSet is the professional’s choice for maintaining sorted, unique datasets. While it carries more memory and performance overhead than a HashSet, the added power of the NavigableSet interface makes it indispensable for applications like scheduling, ranking systems, and range-based data analysis. Use it when order isn't just a preference, but a requirement.

Next: Mastering the Map Interface →

Java TreeSet: The Master of Sorted Uniqueness