What is regression testing?

Regression testing is the repeatable verification that changes in the code (a new feature, a fix, a refactoring) have not broken functionality that previously worked. Its goal is not to verify the new code, but to protect what already worked.

When should you run regression tests?

After every significant change in the code: after adding a feature, after fixing a bug, before a release, and after a production hotfix. In practice, selective regression runs on every pull request, and full regression runs before a release.

Do regression tests have to be automated?

Not all of them, but stable and frequently repeated regression scenarios are the best candidates for automation. Manually repeating hundreds of tests for every version is expensive and error-prone. Exploratory testing and UX assessment are left to experts.

How does full regression differ from selective regression?

Full regression runs the entire test suite and provides the highest confidence, but it takes a long time. Selective regression covers only the areas related to the change — it is fast and suited to frequent runs, at the cost of narrower coverage.

Regression Testing: When and How | ARDURA

Regression testing is the repeatable verification that the changes introduced in the code have not broken functionality that worked before. It is not about verifying the new code — that is what functional tests are for — but about protecting the entire rest of the application from the unintended effects of a change. The older and more elaborate a system is, the easier it is for a single, seemingly harmless fix to trigger a bug in a completely different module. Regression testing is the safety net that catches this scenario before the user does.

What regression in software actually is

Regression is a situation in which functionality that worked correctly stops working after a change is introduced. The cause can be a new feature, fixing another bug, a library update, a configuration change, or a refactoring. The mechanism is always the same: dependencies in the code are denser than they appear, and a modification in one place propagates further than the author of the change assumed.

Regression tests address exactly this risk. You run a set of scenarios that previously passed and check whether they still pass. If one of them starts failing — you have a regression and a concrete lead on where to look for the cause. This is what distinguishes them from smoke tests, which only check whether the application starts up at all, and from acceptance tests, which verify new requirements.

It is also worth distinguishing functional regression from visual and performance regression. The first watches over logic — whether a button still does what it should. The second catches unintended changes in the look of the interface; the third — performance drops after a seemingly cosmetic change. In most projects, the starting point is functional regression, with the others added wherever the risk demands it.

When to run regression tests

The simplest answer is: after every significant change in the code. In practice, it is worth highlighting several moments in which regression is mandatory:

After adding a new feature. New code shares state, the database, and components with the rest of the system. Regression confirms that adding something has not subtracted something else.
After fixing a bug. Fixes can be treacherous — by patching one case, it is easy to break the neighboring one. After every bugfix, regression should cover the area around the change.
Before a release. This is the moment for full regression. Before the code reaches users, you want to be sure that all critical business paths still work.
After a production hotfix. Fixes made under time pressure are particularly risky. Even if the hotfix puts out the fire, regression on the key flows protects against triggering a second one.
After a dependency or environment update. Changing the version of a framework, database, or library can quietly change the application’s behavior.

The rule is simple: the closer to production and the greater the risk of the change, the broader the scope of regression should be.

Full vs. selective regression

This is one of the most important decisions in the entire testing strategy. You cannot run the complete test suite after every minor change — with a large system it would take hours. Hence the split into two modes.

Full regression covers the entire test suite. It provides the highest confidence and detects side effects in any module, but it is costly in terms of time. It works well before a release, after a large refactoring, or after changes touching the core of the application.

Selective regression covers only the areas related to the change — the module that was modified and its nearest dependencies. It is fast, so it can be run frequently, for example on every pull request. The cost is narrower coverage: if you misjudge which areas are related, regression may miss a bug.

In a mature process, these two modes coexist: selective regression runs continuously and provides fast feedback, while full regression stands as a quality gate before a release. The accuracy of selective regression depends on how well you understand the dependency map of the system. Change analysis (which files a commit touched), defect history, and code coverage data all help with this. Where this knowledge is lacking, it is safer to broaden the scope than to rely on intuition.

How to select cases for regression

A regression suite should not be a catch-all bag for every test ever written — because then it grows endlessly, takes a long time to run, and nobody wants to maintain it. Selecting cases is prioritization work. A regression suite should include, above all:

Critical business paths — login, payment, placing an order, in other words the things whose failure means a real loss.
High-frequency-of-use functionality — what the majority of users rely on every day.
Historically bug-prone areas — modules in which regressions have already occurred tend statistically to break again.
Places with dense dependencies — fragments of code that many other components rely on.
Functions covered by legal or security requirements — where the cost of an error extends beyond the application itself.

Equally important is regularly cleaning up the suite. Cases that test functions that no longer exist or that duplicate others should be removed. We write more broadly about prioritization and keeping the suite in healthy shape in our article on why and where to automate software testing.

Manual vs. automated regression

Regression can be performed manually or automatically, and both methods have their place. The key is a deliberate division of their roles, not an “either-or” choice.

Manual regression makes sense early in a project’s life, when the interface and requirements change quickly and writing automated tests would mean constant rewriting. It also works well where human judgment matters — in exploratory testing, usability and UX assessment, and in scenarios that are hard to capture in a rigid script. The downside is cost: manually repeating the same hundreds of steps for every version is slow, expensive, and prone to tester fatigue.

Automated regression shines exactly where manual regression struggles: in repeatable, stable scenarios that have to be run on every change. A test written once can be run as often as you like, at night, in parallel, without human involvement. This makes regression tests the best candidate for automation of all test types.

The boundary is pragmatic: automate what is stable and frequently repeated; leave to people what requires judgment and intuition.

Regression automation — how to approach it

Regression automation is not a one-off project, but a process that has to be built and maintained. A good practice is a layered approach, in line with the test pyramid: the most fast unit tests at the base, fewer integration tests in the middle, and the fewest slow end-to-end tests at the top. Such an arrangement provides fast feedback and a low maintenance cost.

The choice of tools depends on the layer and the technology. For web interface testing, frameworks such as Playwright, Cypress, or Selenium are used today; for APIs — tools like Postman or testing libraries in a given language; for the unit layer — the native frameworks of a given ecosystem. There is no single “best” tool — there is a tool matched to a specific problem and to the team’s competencies.

Regardless of the stack, a few principles are universal. Tests must be stable — a test that passes one time and fails another without a change in the code (a so-called flaky test) destroys trust in the entire suite. They must be independent — the order of execution should not matter, and each test prepares and cleans up its own data. And they must be maintainable — readable, with a reasonable structure, so that updating them does not take more effort than the change they concern. Separating test logic from interface details (e.g., the Page Object pattern) and using stable element identifiers instead of fragile selectors based on page layout both help with this.

Test data is a separate topic. Regression should start from a known, repeatable state — otherwise the result depends on whatever happens to be in the database rather than on the code. That is why it is worth investing in preparing and isolating data rather than testing “on whatever is there.” We write more broadly about building an effective process in our article on test automation at ARDURA Consulting.

Regression in CI/CD

Automated regression shows its full value only when it is wired into a CI/CD pipeline. The point is that the tests run automatically on every code change, without waiting for a human decision — and the result becomes the gate through which a change either passes on or gets stopped.

In practice, the division of roles usually looks like this:

On a pull request, selective regression and unit tests run — fast feedback in minutes, before the code reaches the main branch.
After a merge or in a nightly build, a broader or full regression runs, the kind it is not worth keeping on the critical path of every PR.
Before deployment, full regression acts as the last quality gate.

This integration provides two things that manually run regression cannot: repeatability (the tests always run the same way, in the same environment) and early detection (a bug surfaces an hour after the commit, not a week later in production). We have devoted a separate guide to integrating tests into CI/CD to pipeline configuration and wiring tests into it.

Good practices and common mistakes

The most frequent pitfalls in regression testing are predictable and can be avoided:

A growing suite without review. Tests only get added, never removed. After a year, regression takes hours and nobody trusts it. The remedy is regular review and deleting dead cases.
Tolerating flaky tests. A test that fails randomly teaches the team to ignore red results — and then a real regression passes unnoticed. An unstable test has to be fixed or disabled, not “clicked again.”
Automating an unstable interface. Writing E2E tests against a UI that changes every week means constant rewriting. Stabilization first, then automation.
Lack of prioritization. Treating all tests as equally important means critical paths wait in line behind trivial ones. Business-critical scenarios should take precedence.
Regression only before a release. If you run the full suite once a month, you accumulate risk. Better often and selectively than rarely and everything at once.

For a fuller picture of where regression fits among the other types of verification, it is worth reaching for an overview of the types of software testing.

How ARDURA Consulting supports test automation and QA

At ARDURA Consulting, we treat regression automation as a tool, not an end in itself. We start by analyzing which areas of the application will genuinely benefit from automation and where it is better to leave the assessment to testers — because automating everything “on principle” is the fastest route to an expensive, fragile suite that nobody maintains. We deploy our QA specialists and automation engineers in a staff augmentation model, joining them to client teams within 2 weeks, with access to over 500 seniors and experience from more than 211 projects. We build stable, maintainable regression suites wired into CI/CD, and we put existing ones in order when they have grown beyond reason.

We are a technology partner, not a supplier of ready-made scripts — we help the team close out the process so that it works even after we leave. If you want to accelerate your release cycle without losing control over quality, take a look at ARDURA Consulting’s testing services and let’s talk about how to build regression automation tailored to your project.

Regression Testing: When and How to Do It