Viw Magazine

Men's Weekly

.

Algorithms are everywhere but what will it take for us to trust them?

An algorithm is just following rules designed either directly or indirectly by a human. Shutterstock/Billion Photos

The role of algorithms in our lives is growing rapidly, from simply suggesting online search results or content in our social media feed, to more critical matters like helping doctors determine our cancer risk.

But how do we know we can trust an algorithm’s decision? In June, nearly 100 drivers in the United States learned the hard way that sometimes algorithms can get it very wrong.

Google Maps got them all stuck on a muddy private road in a failed detour to escape a traffic jam heading to Denver International Airport, in Colorado.

Google Maps glitch sends Colorado drivers to muddy backroad.

As our society becomes increasingly dependent on algorithms for advice and decision-making, it’s becoming urgent to tackle the thorny issue of how we can trust them.


Read more: What's not to like? Instagram's trial to hide the number of 'likes' could save users' self-esteem


Algorithms are regularly accused of bias and discrimination. They have attracted concern from US politicians, amid claims we have white men developing facial recognition algorithms trained to work well only for white men.

US committee investigates racial bias in facial recognition software.

But algorithms are nothing more than computer programs making decisions based on rules: either rules that we gave them, or rules they figured out themselves based on examples we gave them.

In both cases, humans are in control of these algorithms and how they behave. If an algorithm is flawed, it’s our doing.

So before we all end up in a metaphorical (or literal!) muddy traffic jam, there is an urgent need to revisit how we humans choose to stress-test those rules and gain trust in algorithms.

Algorithms put to the test, kind of

Humans are naturally suspicious creatures, but most of us can be convinced by evidence.

Given enough test examples – with known correct answers - we develop trust if an algorithm consistently gives the correct answer, and not just for easy obvious examples but for the challenging, realistic and diverse examples. Then we can be convinced the algorithm is unbiased and reliable.

Sounds easy enough, right? But is this how algorithms are usually tested? It’s harder than it sounds to make sure that test examples are unbiased and representative of all possible scenarios that could be encountered.

More commonly, well studied benchmark examples are used because they are easily available from websites. (Microsoft had a database of celebrity faces for testing facial recognition algorithms but it was recently deleted due to privacy concerns.)

Comparison of algorithms is also easier when tested on shared benchmarks, but these test examples are rarely scrutinised for their biases. Even worse, the performance of algorithms is typically reported on average across the test examples.

Unfortunately, knowing an algorithm performs well on average doesn’t tell us anything about whether we can trust it in specific cases.

It’s not surprising to read that doctors are sceptical of Google’s algorithm for cancer diagnosis, which offers 89% accuracy on average. How does a doctor know if their patient is one of the unlucky 11% with an incorrect diagnosis?


Read more: Treat or trick: we asked people how they feel about sharing fitness data with insurance companies


With increasing demand for personalised medicine tailored to the individual (not just Mr/Ms Average), and with averages known to hide all sorts of sins, the average results won’t win human trust.

The need for new testing protocols

It’s clearly not rigorous enough to test a bunch of examples - well-studied benchmarks or not - without proving they are unbiased, and then draw conclusions about reliability of an algorithm on average.

And yet paradoxically this is the approach on which research labs around the world depend to flex their algorithmic muscles. The academic peer-review process reinforces these inherited and rarely questioned testing procedures.

A new algorithm is publishable if it’s better on average than existing algorithms on well-studied benchmark examples. If it’s not competitive in this way, it’s either hidden away from further peer-review scrutiny, or new examples are presented for which the algorithm looks useful.

In this way, a warm, flattering light is shone on each newly published algorithm, with little attempt to stress-test its strengths and weaknesses, and present it warts and all. It’s the computer science version of medical researchers failing to publish the full results of clinical trials.

As algorithmic trust becomes more crucial, we urgently need to update this methodology to scrutinise whether the chosen test examples are fit for purpose. So far, researchers have been held back from more rigorous analysis by the lack of suitable tools.

We’ve built a better stress-test

After more than a decade of research, my team has launched a new online algorithm analysis tool called MATILDA: Melbourne Algorithm Test Instance Library with Data Analytics.

It helps stress-test algorithms more rigorously by creating powerful visualisations of a problem, showing all scenarios or examples an algorithm should consider for comprehensive testing.

MATILDA identifies each algorithm’s unique strengths and weaknesses, recommending which of the available algorithms to use under different scenarios and why.

For example, if recent rain has turned unsealed roads into mud, some “shortest-path” algorithms may be unreliable unless they can anticipate the likely impact of weather on travel times when advising the quickest route. Unless developers test such scenarios they’ll never know about such weaknesses until it is too late and we are stuck in the mud.

MATILDA helps us see the diversity and comprehensiveness of benchmarks, and where new test examples should be designed to fill every nook and cranny of the possible space in which the algorithm could be asked to operate.

The image below shows a diverse set of scenarios (dots) for a Google Maps type of problem. Each scenario varies conditions - like the origin and destination locations, the available road network, weather conditions, travel times on various roads - and all this information is mathematically captured and summarised by each scenario’s two-dimensional coordinates in the space.

A Google-maps-type problem with diverse test scenarios as dots: Algorithm B (red) is best on average, but Algorithm A (green) is better in many cases. MATILDA, Author provided

Two algorithms are compared (red and green) to see which can find the shortest route. Each algorithm is proven to be best (or shown to be unreliable) in different regions depending on how it performs on these tested scenarios.

We can also take a good guess at which algorithm is likely to be best for the missing scenarios (gaps) we haven’t yet tested.


Read more: Consumer watchdog calls for new measures to combat Facebook and Google's digital dominance


The mathematics behind MATILDA helps to create this visualisation, by analysing algorithm reliability data from test scenarios, and finding a way to see the patterns easily.

The insights and explanations mean we can choose the best algorithm for the problem at hand, rather than crossing our fingers and hoping we can trust the algorithm that performs best on average.

By rigorously stress-testing algorithms in this way – warts and all – we should reduce the risk of rogue algorithm decisions, securing the trust of Mr/Ms Average, and perhaps even the most sceptical humans.

Kate Smith-Miles receives funding from the Australian Research Council as a Georgina Sweet Australian Laureate Fellow, and as a Chief Investigator in the Australian Centre of Excellence in Mathematical and Statistical Frontiers (ACEMS). She is Immediate Past President of the Australian Mathematical Society, and a member of the Australian Research Council's College of Experts.

Authors: Kate Smith-Miles, Professor of Applied Mathematics, ARC Laureate Fellow, Chief Investigator in the Australian Centre of Excellence in Mathematical and Statistical Frontiers, University of Melbourne...

Read more

LifeStyle

The Importance Of Professional Fiberglass Boat Repair For Strength, Safety And Long-Term Performance

Boats made from fiberglass are known for their durability, lightweight structure and smooth perfor...

Why Choosing the Right Cosmetic Clinic Bundoora Matters for Confidence and Care

Personal appearance can influence confidence, comfort, and overall wellbeing. Many people seek tre...

Paint Protection Film Brisbane: The Ultimate Guide to Protecting Your Vehicle

Brisbane's harsh subtropical climate, with its intense UV rays, summer storms, and coastal condition...

How Family Court Lawyers Can Guide You Through High-Conflict Parenting Disputes

High-conflict parenting disputes can be draining, unpredictable and emotionally overwhelming, espe...

hacklink hack forum hacklink film izle hacklink สล็อตเว็บตรงbets10คลิปโป๊casibombets10casibom girişonwinjojobet güncel girişjojobet girişbets10kavbetcasibomroyal reelsbets10betkolikKayseri EscortpusulabetJojobettaraftariummilanobetmilanobetbettiltpusulabetGalabetaviator gamematbettimebettimebettimebetbahisoistanbul escort telegramcasibomcasibompantheraproject.netcasibomcrown155 casinohb88aussuper96 loginbetsmovebetofficebetofficehttps://www.chicnic.org/casibom한국야동casibom girişสล็อตpadişahbetcasibomgiftcardmall/mygift주소모음 주소모아spin2u loginneoaus96 casino loginpadişahbetStreameastzirvebetmarsbahisjojobetgooglebets10ff29 casinoStreameastmatbetstakemate77best e-wallet pokies 2025破解工具топ 10 казинорейтинг лучших казиноjojobet 1115splashzbahis girişjojobetmostbetizmit escortdinamobetmostbetbahis siteleri 2025matbet girişjojobetwww.giftcardmall.com/mygiftpusulabetcasibomcasibom girişgiftcardmall/mygiftsadfasdfsdfasdasdasdasdmeritkingpusulabetjojobettaraftariumлучшие казино на деньгиpin up azcasino med Klarnajojobet 1115Casibomwww.mcgift.giftcardmall.com balancegiftcardmall/mygiftwww.giftcardmall.com/mygift activatetm menards loginartemisbetnerobetbetasussekabetjojobetcasibomcasibomlunabetzbahiskingroyalkingroyal güncel girişjojobetcasibomcasibom girişhazbetjojobetsitus slot gacorGalabetcasibom9022google hit botudizipalperabetrealbahisrealbahisperabetbetwoonizmit escortonwin girişeSIM Evropapusulabetpusulabetpusulabetartemisbetbetasusholiganbet girişmeritkingpusulabetjojobetMarsbahismarsbahiscasibomjojobet girişkingroyaljojobetgiftcardmall/mygiftbetlikedeneme bonusu veren sitelercasibom güncel girişholiganbet girişStreameastтоп рейтинг казиноcasibomjojobetbets10matbetGanobetGalabetcasinolevantsekabet girişmarsbahisjojobet girişmeritkingextrabetholiganbetprimebahisiptv satın almatbetjojobetjojobetgrandpashabetcasibomjojobetonwinbetpasbets10 hacklink hack forum hacklink film izle hacklink สล็อตเว็บตรงbets10คลิปโป๊casibombets10casibom girişonwinjojobet güncel girişjojobet girişbets10kavbetcasibomroyal reelsbetkolikKayseri EscortpusulabetJojobettaraftariummilanobetmilanobetbettiltpusulabetGalabetaviator gamematbettimebettimebettimebetbahisoistanbul escort telegramcasibomcasibompantheraproject.netcasibomcrown155 casinohb88aussuper96 loginbetsmovecasibom한국야동casibom girişสล็อตpadişahbetcasibomgiftcardmall/mygift주소모음 주소모아spin2u loginneoaus96 casino loginpadişahbetStreameastzirvebetmarsbahisjojobetgooglebets10ff29 casinoStreameastmatbetstakemate77best e-wallet pokies 2025топ 10 казинорейтинг лучших казиноjojobet 1115splashzbahis girişjojobetmostbetizmit escortdinamobetmostbetbahis siteleri 2025matbet girişjojobetwww.giftcardmall.com/mygiftpusulabetcasibomcasibom girişgiftcardmall/mygiftsadfasdfsdfasdasdasdasdmeritkingpusulabetjojobettaraftariumлучшие казино на деньгиpin up azcasino med Klarnajojobet 1115Casibomwww.mcgift.giftcardmall.com balancegiftcardmall/mygiftwww.giftcardmall.com/mygift activatetm menards loginartemisbetnerobetbetasussekabetjojobetcasibomcasibomlunabetzbahiskingroyal güncel girişjojobetcasibomcasibom girişhazbetjojobetsitus slot gacorGalabetgoogle hit botudizipalperabetrealbahisrealbahisperabetbetwoonizmit escortonwin girişeSIM Evropapusulabetpusulabetpusulabetartemisbetbetasusholiganbet girişmeritkingpusulabetjojobetMarsbahismarsbahiscasibomjojobet girişjojobetgiftcardmall/mygiftbetlikedeneme bonusu veren sitelercasibom güncel girişholiganbet girişStreameastтоп рейтинг казиноcasibomjojobetbets10matbetGanobetGalabetcasinolevantsekabet girişmarsbahisjojobet girişmeritkingextrabetholiganbetprimebahisiptv satın almatbetjojobetjojobetcasibomjojobetonwinbetpasbets10