Architecture Attributes Testing

How do you test Non-Functional requirements? How to ensure the system fulfils all of them? Holistic approach to NFR testing inside.

Jan 05, 2026

∙ Paid

One of the responsibilities of an architect is to ensure that the system fulfils its requirements. There is plenty of experience and information on how to setup quality control for functional requirements. However, there are few pieces covering overall approach to NFR testing. In this article I will compare functional testing to testing of architecture attributes, provide the tactics for testing performance, security, availability and maintainability and describe an approach of continuous testing.

Unfortunate aspects of NFRs

Different kind of requirements

Testing functionality is relatively straightforward(oversimplified intentionally): take the view of your users, write down the scenarios, go through them, think about combinations of features, observe the result and compare it with the expected. Say, we are testing the login screen of a mobile application.

You need to enter some valid pairs of login and password and press login, ensure they work. Then enter some invalid password, ensure there is a proper error message. Then try to enter some invalid email and check the validation. At last, you try to disable internet connection and observe the proper error. Pretty much it.

This approach does not work with something like performance: a single performance measure of a particular use case does not tell anything about the ability of the system to perform accordingly to the requirement. If it performs well, you just might be lucky. If it performs poorly, it might be a problem with a particular user, network lag or something else. So, you need a completely different approach.

90th Percentile in Performance Testing | How to calculate |

Usually, such requirements include a measure and some statistical value, where this measure should be observed. For example: “The response time for a static webpage should not exceed 500 ms in 99 percentile”. That means that in 99 cases out of 100, the page should be loaded in half a second, and in one case it may exceed that value. More on percentiles here. Same thing works for availability: we typically want the system to be available 99,9% of time or something like that(we call it 3 nines).

Business Oriented System Design Course

Wanna learn how to design systems which truly impact business and get promoted? Sign up for Business Oriented System Design Course: starts 4th of February. Begin the year with learning crucial skills and advance your career!

Cross-cutting concerns

Another problem with NFRs is that they are cross-cutting: the team may accidentally break them unintentionally while developing a new feature.

Imagine you’re a developer and you make a new feature, say adding stories on the main page of your application. Everything works well from your point of view and you ship the feature to production. However it appears that with stories the main page now loads half a second later and it starts to affect the business metrics and people launch the app less frequently and you start to lose money. If you had a proper performance test for that you could prevent such situation.

Is it that bad

The fortunate thing about testing NFRs is that actually overall approach is not that different from functional requirements or constraints. You still may use static analysis for some part of security testing; you can write end-to-end testing focusing on performance; there are unit test even for maintainability and architecture like ArchUnit. In the end, the difference is within internal methods and tools, but not the overall approach itself. I will get back to the approach later in this article. Now I want to focus on the testing of particular NFRs.

Testing NFRs

Performance

As per Neil Ford’s “Fundamentals of Software Architecture”, you may see the system from the perspective of architecture quantas: groups of components with same non-functional requirements. Let’s say you make an url-shortener: you accept a small portion of user data for a big amount of users; at the same time you need a small web interface to see the analytics on the link usages. The first part is heavily loaded and should be highly available; the second part though will have 1 or 2 users and should be available mostly during business hours.

Shot during 34th international Elite athletics meeting in Montgeron-Essonne (France), Sunday, May 13th, 2018 — Photo by Nicolas Hoizey / Unsplash

Those 2 parts have different performance requirements and require different testing. Moreover, the consequences of performance degradation is different as well. So the idea is that you need to come up with performance testing not for the whole systems but for the quantas.

Considering performance we need to distinguish the reads and writes, and properly collected requirements usually do that. We can require the webpage to load under 1 second 99.9% of time. We can also want that form submission yields result in under 5 seconds 99% of time. Those are different requirements and should be tested individually, meaning testing the former gives zero information on testing the latter.

The next issue with testing performance is using the right load. Returning to webpage load under 1 second. But in what circumstances? It might be very easy to do under 1 concurrent user, and much harder under 1000 ones. Also, the problem is you can not test form submission with some random data. You will have to at least test against minimal input, average input and some extreme values. Long story short, you need to know your load model and run your tests against it. Otherwise you know little about your system from load perspective.

Testing environment is usually a problem as well. A lot of parameters of your system affects the performance yield. Replica count affects the number of concurrent users; resource allocation(cpus and memory) limits the max load; storage limits the size of your databases, etc. And here you face the dilemma: either you test in production, or spend a budget on having a separate environment and setting up complex procedures to replicate anonymized production data in a perf env. There is no proper general answer, and you need to weight for your system.

As an example of production performance testing for a highly popular service, please refer to the article by Netflix. Netflix cares a lot about user experience and performance is its crucial aspect. With large user base of 200+ millions viewers and the wide variety of the devices proper performance testing should be in place. The article discloses approaches and measures to ensure no performance regressions reach production.

Security

Continue reading this post for free, courtesy of Architecture Weekly Newsletter.

Or purchase a paid subscription.

Software Architecture Weekly