Optuna Is Still The HPO King (Yes, Even In 2026)

Actually, I should clarify – I spent last Tuesday fighting with a “self-optimizing” LLM agent that promised to tune my hyperparameters automatically. It was supposed to be the future. “Just describe your data,” they said. “The AI handles the rest,” they said.

Two hours and $45 in API credits later, the agent confidently handed me a learning rate of 0.0 and a batch size of 1. It hallucinated the config. Great.

So I killed the process, opened my terminal, and went back to the tool I should have used from the start: Optuna. But probably, even now, in February 2026, with all the generative noise out there, Preferred Networks’ framework remains the absolute gold standard for hyperparameter optimization (HPO). It’s not flashy. It doesn’t try to chat with you. It just finds the best parameters without burning down your GPU cluster.

Why “Define-by-Run” Still Wins

Here’s the thing about HPO tools. Most of them force you to learn a weird, domain-specific language (DSL) or define your search space in a massive JSON blob that’s impossible to debug. I hate that. If I can’t debug it with a simple print() statement, I don’t want it in my production pipeline.

Optuna’s “define-by-run” philosophy is the antidote. You write Python. That’s it. You define the search space inside the objective function. It’s dynamic. It’s messy in the way real code is messy, and that makes it beautiful.

And I was tuning a RAG (Retrieval-Augmented Generation) pipeline recently—trying to balance chunk size against retrieval k count. With other tools, conditional logic is a nightmare. “If chunk size is > 500, check parameter B, else check parameter C.” In a static config file? Good luck.

In Optuna, it’s just an if statement. Look at this snippet I ran yesterday on Python 3.13:

machine learning neural network - Python Machine Learning Neural Networks Tutorial Neural Network Ai ... — machine learning neural network – Python Machine Learning Neural Networks Tutorial Neural Network Ai …

import optuna
from my_rag_pipeline import evaluate_retrieval

def objective(trial):
    # Dynamic search space definition
    chunk_size = trial.suggest_int('chunk_size', 128, 1024, step=64)
    
    # Conditional logic that just works
    if chunk_size > 512:
        overlap = trial.suggest_int('overlap', 64, 128)
    else:
        overlap = trial.suggest_int('overlap', 16, 48)
        
    embedding_model = trial.suggest_categorical('model', ['bge-m3', 'gte-large-en'])
    
    score = evaluate_retrieval(chunk_size, overlap, embedding_model)
    return score

# The magic part
study = optuna.create_study(direction='maximize')
study.optimize(objective, n_trials=100)

See that conditional logic? It just works. No schema validation errors. No YAML indentation nightmares. You just write code.

The Pruning Capability is a Wallet Saver

Let’s be real about compute costs. We aren’t all sitting on a hoard of H100s. I do most of my dev work on a local rig with a single GPU, and waiting for 100 trials to finish is agony if 80 of them are doomed from the start.

This is where Optuna’s pruning API saves my bacon. It monitors intermediate results and kills the bad trials early. I ran a test last week comparing a standard grid search against Optuna’s TPE (Tree-structured Parzen Estimator) sampler with the Hyperband pruner enabled.

The setup: Fine-tuning a small 3B parameter model.

The result: The grid search wasted 14 hours. Optuna found a better configuration in 3 hours and 42 minutes. It pruned 65% of the trials before they even hit the halfway mark. That’s not just time; that’s electricity bill money.

A Real-World Gotcha: The SQLite Lock

Well, it’s not all sunshine and rainbows. I need to warn you about something that bit me hard a few years ago and is still a trap for new users.

By default, if you specify a storage URL, Optuna uses SQLite. It’s fine for running on your laptop. But the second you try to scale this up—say, running distributed optimization across three different worker nodes—SQLite falls apart. I kept getting OperationalError: database is locked exceptions crashing my workers at 2 AM.

machine learning neural network - 12 Types of Neural Networks in Deep Learning — machine learning neural network – 12 Types of Neural Networks in Deep Learning

If you are serious about production HPO, do not use the default file-based storage for distributed runs. Spin up a Postgres container. It takes five minutes.

# Don't do this for distributed:
optuna-dashboard sqlite:///db.sqlite3

# Do this instead (PostgreSQL 16+ recommended):
optuna-dashboard postgresql://user:pass@localhost/optuna_db

Switching to Postgres solved my concurrency issues instantly. I can now run 50 parallel workers hammering the database without a single lock error. It’s a simple infrastructure change that makes the tool robust enough for heavy enterprise workloads.

Why Not Just Use “Auto-Everything”?

There’s a trend right now to abstract everything away. “AutoML” platforms promise to take your CSV and give you a deployed API. They’re great for generic tabular data. But for the weird stuff? The custom loss functions? The multi-modal pipelines we’re building in 2026?

They break. They lack the flexibility to optimize a metric that isn’t just “accuracy” or “F1 score.”

And I recently had to optimize a pipeline where the metric was a composite score of “inference latency” vs. “toxicity check pass rate.” Try explaining that trade-off to a black-box AutoML tool. With Optuna, I just wrote a custom function returning a tuple for multi-objective optimization. It plotted the Pareto front for me, and I could literally point to the spot on the graph where speed met safety.

Final Thoughts

Optuna isn’t the “new kid” anymore. It’s the boring, reliable veteran. And honestly? In a tech stack that changes every six weeks, I crave boring. I want tools that respect my Python code, handle failures gracefully, and don’t try to outsmart me with an opaque AI layer.

If you’re still manually tuning learning rates like a caveman, or trusting a hallucinating agent to do it for you, stop. Install Optuna. Your GPU will thank you.

Questions readers ask

Why does Optuna use define-by-run instead of a config file for hyperparameter search?

Optuna’s define-by-run philosophy lets you define the search space in Python inside the objective function, rather than a static JSON or YAML config. This makes conditional logic trivial, such as choosing different overlap ranges when chunk_size exceeds 512. You can debug it with a simple print() statement, avoid schema validation errors, and handle dynamic, messy real-world search spaces without learning a domain-specific language.

How much faster is Optuna’s TPE sampler with Hyperband pruning than grid search?

In a test fine-tuning a 3B parameter model, grid search ran for 14 hours, while Optuna’s TPE sampler with the Hyperband pruner found a better configuration in 3 hours and 42 minutes. The pruner killed 65% of trials before they reached the halfway mark by monitoring intermediate results, saving significant GPU time and electricity costs on single-GPU local rigs.

How do I fix Optuna’s ‘database is locked’ error when running distributed trials?

The error comes from Optuna’s default SQLite storage, which cannot handle concurrent writes from multiple worker nodes. Replace it with PostgreSQL 16+ by spinning up a Postgres container and pointing optuna-dashboard at a postgresql:// URL instead of sqlite:///db.sqlite3. This eliminates OperationalError lock crashes and lets you run 50 parallel workers hammering the database without issues.

Can Optuna optimize for multiple competing metrics like latency and safety?

Yes, Optuna supports multi-objective optimization by returning a tuple from your objective function. The author used this to balance inference latency against a toxicity check pass rate, a composite trade-off that black-box AutoML tools cannot handle. Optuna plotted the Pareto front, making it possible to visually identify the configuration where speed met safety, which generic accuracy-focused AutoML platforms lack the flexibility to optimize.

AI Dev News | Machine Learning Engineering

Why “Define-by-Run” Still Wins

The Pruning Capability is a Wallet Saver

A Real-World Gotcha: The SQLite Lock

Why Not Just Use “Auto-Everything”?

Final Thoughts

Questions readers ask

Why does Optuna use define-by-run instead of a config file for hyperparameter search?

How much faster is Optuna’s TPE sampler with Hyperband pruning than grid search?

How do I fix Optuna’s ‘database is locked’ error when running distributed trials?

Can Optuna optimize for multiple competing metrics like latency and safety?

Li Mei Fong

Why “Define-by-Run” Still Wins

The Pruning Capability is a Wallet Saver

A Real-World Gotcha: The SQLite Lock

Why Not Just Use “Auto-Everything”?

Final Thoughts

Questions readers ask

Why does Optuna use define-by-run instead of a config file for hyperparameter search?

How much faster is Optuna’s TPE sampler with Hyperband pruning than grid search?

How do I fix Optuna’s ‘database is locked’ error when running distributed trials?

Can Optuna optimize for multiple competing metrics like latency and safety?

Li Mei Fong

Related Posts