Photo by JJ Ying on Unsplash
Defining intelligence with the Turing Test
If you have ever done one of those tests where they require you to identify some bicycles or fire hydrants in a picture or type out the letters displayed in a scrawled out note you have done a Completely Automated Turing Test to Tell Computers and Humans Apart (CAPTCHA). Failing my first ever CAPTCHA test a few years back made me wonder if I was just a very dumb human, or if I was actually a self aware robot that had finally been exposed. It really makes you stop and think how sophisticated our computer programs have become that the very tests meant to tell computers and humans apart are unable to do so in one attempt. One of the main reasons for this is because the traits that make up our understanding of human intelligence are still not fully defined yet. There is no clear-cut equation describing it, just a bunch of fuzzy rules and gut feeling instinct. Underlying the most widespread of systems are still contentious philosophical issues, which is what makes the study of artificial intelligence and the debate surrounding it so interesting.
In a 1950 paper published in Mind, Turing proposed a solution to the much debated issue of defining intelligence practically, a solution that has now become a widely studied and adopted benchmark known as the Turing Test. The idea of the CAPTCHA came later, but the motivation is the same finding a way of assessing the humanness of a subject. The current modern interpretation of the Turing test is a situation in which Players A and B hold separate text-based conversations with Player C, also known as the interrogator, and attempt to convince Player C that they are human with their responses given. However, one of the players, either A or B, is a computer that is attempting to imitate a human. It is assumed that Player C is aware that either A or B is a computer but does not know which one (as opposed to variations of the test where Player C is not aware that a computer is present). The role of the interrogator is to discern which of the players is the human and which is the computer given a finite interaction time of typically several minutes.
As the decision is made purely based on how human the computer can act, the Turing test conveniently disregards the internal states of the mind, emotions, the subconscious and effectively focuses on the tangible result of the thinking process, making it an operational test of computer/human indistinguishability.
Throughout history, several programs have come convincingly close to clearing the Turing test, such as PC Therapist by Joseph Weintraub in 1991 and Rollo Carpenters Clever bot in 2011. The first widely accepted success was by the computer program Eugene Goostman in 2014 at the Royal Society in London. An analysis of the results credited the following features for Eugene Goostmans success: the machine had character, it frequently asked questions to the interrogator, and it occasionally made spelling errors. This made for a rich conversation with the interrogator that at times stood out from the relatively duller conversation between two actual humans, which was a cause for some of the misidentification.
Should we even use such a test in the first place?
The Turing test has been praised for its simplicity and its breadth. It allows the performance of artificial intelligence to be measured, albeit from an oblique angle that does not deal with precise definitions of thinking and intelligence. It also allows for unlimited scope in terms of conversation topics, just as how a normal conversation can flow between two humans. Last, it emphasizes human-like thinking and emotional intelligence rather than one which focuses on discursive intelligence only.
One widely studied aspect of the Turing test is the confederate effect and its converse, the Eliza effect. The former is where the judge misidentifies a human as a machine and the latter is when a machine passes successfully as a human. Research into the results of the 2003 Loebner Prize (an instantiation of the Turing test) showed that the human players had mean scores of less than 4.0, indicating that most of the interrogators could not positively identify them as human and had marked them as probably a human. In our daily interaction with other people through text messaging, could we tell if the other party has been replaced by clever chatbot? What kinds of behaviours are dead giveaways and what should we look out for? After all, some intelligent behaviour is inhuman, such as performing fast arithmetic calculations, while some unintelligent or illogical behaviour is human, like non-sequitur, and past a certain point, the inadequacies of a robotic mind start to coincide with the eccentricities of the human one it is trying to mimic.
A common criticism that is levelled is that the test has caused teams to focus on creating artificial intelligence with the sole aim of passing the test instead of focusing on real intelligence. Machines that are specially designed for the test are often optimised with various tactics meant to purposely mislead the human judge without actually furthering the actual intelligence of the system. For example, the Eugene Goostman program used a variety of question deflection techniques and witticisms to cover up the fact that it did not know the answer to simple elementary questions. Creating a chatbot that can fool humans is not the same as creating real artificial intelligence such as a program that can read a newspaper and answer questions based on the content in the papers, because no amount of evasive manoeuvring can ultimately compensate for skills like reasoning and semantic understanding.
Furthermore, the broad field of artificial intelligence deals with areas that are not directly related to natural language processing and conversation but still require machine intelligence, for example, machine vision and robotics. Using the Turing test sets a misleading standard for the definition of success in the field and downplays the contribution of artificial intelligence in a diverse range of applications from the stock market to autonomous vehicles.
Another argument against the test asserts that there is no basis for pinning the development of artificial intelligence to mimic that of the human brain. The brain may be used as a model for the development of neural networks, but it should not necessarily be the endpoint or the overall goal, of neural network development. An oft cited analogy is the study of flight, which used to concern itself with recreating the flapping motion of the wings of a bird but failed to achieve any significant success. However, once scientists moved away from trying to mimic nature and turned to actually understanding the physics and theory behind flight, much more progress was made in the field. Hence researchers may benefit more by not trying to perfect a human-like robot, but by building one from fundamental theories and first principles of intelligence.
If it walks and talks like a duck, is it a duck?
The debate regarding Turing test naturally leads into one concerning the legitimacy in using such techniques to measure human intelligence, and whether we can even measure it in the first place.
One argument goes that ascribing intelligence solely based on behavioural performance in the Turing test is absurd, as thinking cannot be fully described by just the results collected on the behaviour of a machine. Externally observable behaviours like walking and talking like a duck are not a rigorous basis for identifying ducks. The essence of the objection being that the Turing test is ultimately based on a behavioural construal of the concept of thinking which has no strong underlying basis.
Yet when we lack a more sophisticated understanding of the mind, behavioural analysis is the only way we can rigorously study intelligence. In fact, forms of the imitation game are regularly used to assess humans too for example assessment in the form of viva voce, think of a PhD defence, with the aim to discover whether one has truly understood the content, rather than just memorizing it in a parrot fashion. Such assessments are naturally behaviour based, as we have no way of critiquing thinking.
Even if a machine could pass the Turing test, it may not necessarily be intelligent. In 1980 John Searle introduced the Chinese room thought experiment: a human with no understanding of Chinese sits in a room following a set of instructions to produce Chinese output can give the impression to outside observers that the machine understands Chinese. Searle asserts that computers, which merely follow instructions, would never be able to truly gain intelligent understanding of a subject. They would only be a simulation at best what Searle terms as Weak AI. So when IBMs Deep Blue beat Garry Kasparov at chess, the machine may have appeared intelligent, but it had as much understanding of chess as Searle has of Chinese, that is to say, none.
Searles philosophy follows the idea that the intelligent mind cannot be reduced to an algorithm that we can define. Furthermore, creating an artificial intelligence on par with the human mind would involve testing for emotions, self-awareness, soul, originality, and all of the other multiple facets of human consciousness alongside intelligence. Robert French argued similarly in 1990 that the Turing test was limited as it could only measure intelligence as defined by human culture and experience. Hence other tests besides the Turing test are needed to measure what these thinkers view as true intelligence.
Of course, the contrary view contends that all human actions, from cooking to studying, can be reduced to the mere execution of an algorithm stored in the brain and hence can be effectively copied by a machine. Humans are nothing more than flesh and blood computers, with every decision stemming from a deterministic system of interacting atoms, no different from the machines we build.
At the end of the day, the Turing test does little in helping us understand the true nature of intelligence, which is more a question for the philosophers and psychologists. It can help serve as a benchmark for accomplishments in machine intelligence but it should definitely not be taken as a singular measure of an artificial intelligences effectiveness. Today, several alternatives to the Turing test exist including the Lovelace test, Total Turing Test, Wozniak test, as well as variations on the original Turing test itself. These tests help evaluate additional yardsticks that contribute to intelligence. Recent advancements have brought us close to creating truly human-esque thinking, but the best machines today such as OpenAIs GPT-3 still reveal a general lack of common-sense, and would rely on various shady tactics to pass off as one of us. Ultimately, the importance of the test today seems predicated on the difficulty we have in defining intelligence, and what the essence of human thinking is. So, until philosophers and psychologists achieve greater clarity in that area, the significance and relevance of the Turing test is fair game for anybody.
 Turing, Alan M. 1950. Computing machinery and intelligence. MindLIX(236):433460.
 Warwick, K. and Shah, H., 2016. Can machines think? A report on Turing test experiments at the Royal Society. Journal of Experimental & Theoretical Artificial Intelligence, 28(6), pp.9891007.
 Shah, H., 2005. The confederate effect in human machine textual interaction. WSEAS ISCA, Available at: https://www.researchgate.net/publication/236889402_The_Confederate_Effect_in_Human-Machine_Textual_Interaction.
 Searle, J., 1980. Minds, Brains and Programs. The Behavioural and Brain Sciences. Behavioural and Brain Sciences, 3(3), pp.417424.
 French, R., 1990. Subcognition and the Limits of the Turing Test. Mind, 99(393), pp.5365.
See the original post:
- Governor recognizes Aparna Bhooshanan for national commendation - The Madison Record - themadisonrecord.com - June 11th, 2021
- Chuck Schumer's Bipartisan Bill Is Bad for the Planet - The New Republic - June 11th, 2021
- Belgium and England are favorites according to a study - The Press Stories - June 11th, 2021
- A judge's rant to the awards galaxy - Chessbase News - May 30th, 2021
- 11 ways that I am avoiding revising for my exam - The Tab - May 30th, 2021
- Research Confirms Your Return-To-Office Theory ... Whatever It Is - Bisnow - May 30th, 2021
- Study: QAnon Is as Big as Some Major Religions in the US and One in Four White Evangelicals Believe It | RELEVANT - RELEVANT Magazine - May 30th, 2021
- The Intersection Of Poker And Chess - Chess.com - May 30th, 2021
- Steel Grating Market to Eyewitness Massive Growth by 2025 | Fisher & Ludlow, Nucor Corporation, Webforge Group The Shotcaller - The Shotcaller - May 30th, 2021
- The Publishing Industry is on the Rise, Rohit Arya Leads - Influencive - May 30th, 2021
- 'It was like watching two grandmasters of chess, locked in a battle of wits': the Guardiola and Tuchel... - The Athletic - May 30th, 2021
- Home-Schooled 22-Year-Old Is the Youngest Person in the UK to Get PhD in Astrophysics - The Epoch Times - May 30th, 2021
- Global Counter-UAV (C-UAV) Systems Market 2021 Comprehensive Research Report and Forecast to 2026 KSU | The Sentinel Newspaper - KSU | The Sentinel... - May 30th, 2021
- So You Want to End the Conversation? - The New York Times - May 30th, 2021
- Pratyusha: 'The long break came at the wrong time' - Sportstar - May 30th, 2021
- Global International Chess Market To Witness Huge Gains Over 2021-2026|The House of Staunton, ChessSUA, CNCHESS, ChessBaron, Shri Ganesh (India)... - May 5th, 2021
- "Behind the Stonewall you should never lose" - Chessbase News - May 5th, 2021
- Turner Novak is monetizing memes with venture capital - The Hustle - May 5th, 2021
- IT Spending In Cinema Market with Future Prospects, Key Player SWOT Analysis and Forecast To 2026 - AlgosOnline - May 5th, 2021
- Rebirth and recreation: Parks and People helps Baltimore communities replace blight with new parks - Baltimore Business Journal - May 5th, 2021
- Teaching children to play chess makes them more confident taking calculated risks - ZME Science - April 21st, 2021
- Mad bishops and knights - Chessbase News - Chessbase News - April 21st, 2021
- Counter UAV Market by Technological Innovations, Trend and Growing Demand 2021-2026 KSU | The Sentinel Newspaper - KSU | The Sentinel Newspaper - April 21st, 2021
- Brain-controlled chess is here - Big Think - April 21st, 2021
- Global Counter UAV Market 2021 Industry Insights, Drivers, Top Trends, Global Analysis And Forecast to 2027 KSU | The Sentinel Newspaper - KSU | The... - April 21st, 2021
- Sunday Monday is on a mission to provide you with colorful and ethically sourced fabrics - Yahoo Canada Shine On - April 21st, 2021
- The ghost in the archives #3: Sexism, legal discrimination and 'the game of bastardry' The Echo - Echonetdaily - April 21st, 2021
- Microsoft-Discord acquisition deal said to be off the table - ETTelecom.com - April 21st, 2021
- Lourdes Gurriel Jr on seeing the ball better, Vladimir Guerrero Jr's sensational start - Yahoo Canada Shine On - April 21st, 2021
- SF Deizisau convincingly wins the European Club Cup - Chessbase News - April 5th, 2021
- Premium Insights on Chess Market 2021-2026 by Leading Players like The House of Staunton, ChessSUA, CNCHESS, ChessBaron, Shri Ganesh (India)... - April 5th, 2021
- Freeline Reports Full Year 2020 Financial Results and Business Highlights - GlobeNewswire - April 5th, 2021
- Penn State mock draft roundup: Where will Micah Parsons, Jayson Oweh and Pat Freiermuth go in 2021 NFL Draft? - PennLive - April 5th, 2021
- Trending Report of Chess Table Market Drivers, Strategies, Applications and Competitive Landscape 2026 The Bisouv Network - The Bisouv Network - March 9th, 2021
- Learn to love the game: An interview with Andrew Martin - Chessbase News - March 9th, 2021
- At What Age Are You At The Top Of Your Game | KJ103 | TJ, Janet & JRod - iHeartRadio - March 9th, 2021
- For seniors, its a double pandemic COVID-19 and crushing isolation - MarketWatch - March 9th, 2021
- Getting the Green Light - richmondmagazine.com - Richmond magazine - March 9th, 2021
- New Trends: Chess Market Insights, Current and Future Trend 2021-2026 | The House of Staunton, ChessSUA, CNCHESS, ChessBaron, Shri Ganesh (India)... - March 9th, 2021
- How to cope with squabbling siblings: Five expert tips on handling competitive children - The Irish Times - February 20th, 2021
- Chess Market 2021 Research Report With Covid-19 Update Key Players Analysis, Growth Factors and Forecast to 2027 | The House of Staunton, ChessSUA,... - February 20th, 2021
- Incredible Growth of Counter-UAS System Market to Boom in Future by Industry market by top key Players SRC, Lockheed Martin, Thales NeighborWebSJ -... - February 20th, 2021
- Opinion | American politics are turning us into pessimists - University of Pittsburgh The Pitt News - February 17th, 2021
- 10 Activities To Do When You Need a Study Break - Uloop News - February 17th, 2021
- News from the hill - News24 - February 17th, 2021
- Beckerman Film Series Asks The Hard Questions - New Haven Independent - February 17th, 2021
- Saving Mr Kasparov (and Mr Korchnoi) - TheArticle - February 15th, 2021
- Gaming the System: 'Minecraft' cheating as a case study in subculture drama The Daily Free Press - Daily Free Press - February 15th, 2021
- B2B Telecommunications Market Giants To Grow At Much Faster Pace | Azzurri Communications, BT Group, Arqiva, Charterhouse Voice and Data The Bisouv... - February 15th, 2021
- Streaming is still a good deal, but it's starting to get pricey - WRCB-TV - February 15th, 2021
- Happy Lunar New Year - University of California - February 15th, 2021
- Koala rescued after causing 5-car pileup on Australian freeway, and more of this week's weirdest news - Herald & Review - February 15th, 2021
- Interesting Facts About Anya Taylor Joy From Emma and The Queens Gambit - OtakuKart - February 12th, 2021
- Promoting Exercise Through Gamification-Making Physical Activity More Fun? - Forbes - February 12th, 2021
- SXSW Sets Stacey Abrams As Keynote, Unveils New Round Of Featured Speakers Including Elizabeth Banks, Mary J. Blige, The Russo Brothers And More -... - February 12th, 2021
- Walrus Ivory Holds the Clue to the Lost Norse Civilization in Greenland - Science Times - February 12th, 2021
- Behind the Grind of the N.B.A. Team With the Next Big Thing - The New York Times - February 12th, 2021
- Get informed on the top stories of the day in one quick scan - CBC.ca - February 10th, 2021
- Students rely on hobbies to maintain mental health during pandemic - GW Hatchet - February 10th, 2021
- Award-winning documentarian turns to the story of Billy Stewart - Lake County News - February 7th, 2021
- 15 Tips to Improve the Way People Think of You in the Office (According to a Psychologist) - mySanAntonio.com - February 7th, 2021
- All he needs to do on the court is react - Catholic Star Herald - February 6th, 2021
- Chess grandmaster rates 7 chess scenes in movies and TV for accuracy - Yahoo News - February 6th, 2021
- White House might send masks to every American; no cruising to Canada until 2022; young adults drive infections: Latest COVID-19 updates. - USA TODAY - February 6th, 2021
- Feeling More Stressed Than Ever? You're Not Alone - Healthline - February 6th, 2021
- The Hess Triangle in New York City's West Village To Be Sold - Mental Floss - February 6th, 2021
- Local woman shines in world of chess - Yahoo News - February 4th, 2021
- Exclusive: Meet the startups in Harvard i-lab's 2021 Spring Venture Program - Boston Business Journal - February 4th, 2021
- Why is the European Metals (ASX:EMH) share price 12% higher today? // The Motley Fool Australia - The Motley Fool Australia - February 3rd, 2021
- This Is Where Empathy Lives in the Brain, and How It Works - Singularity Hub - February 3rd, 2021
- Deaths for the week of Jan. 31-Feb. 6, 2021 J. - The Jewish News of Northern California - February 3rd, 2021
- 4 schools in the UK offering a character-building education - Study International News - February 3rd, 2021
- The Story Of How Trump Officials Tried To End H-1B Visas - Forbes - February 1st, 2021
- Study finds surprising defensive functions of the immune system against new viral attacks - News-Medical.net - January 30th, 2021
- The 5 Most Influential Shows On Netflix - Forbes - January 30th, 2021
- Brain exercises: 22 ways to improve memory, cognition, and creativity - Medical News Today - January 30th, 2021
- COLUMN: Library has virtual programs for kids and teens - yoursun.com - January 30th, 2021
- Ten Interesting Things We Read This Week - Forbes India - January 30th, 2021
- Chess Market Research Report: Cagr Status, Industry Growth, Trends, Analysis And Forecasts To 2027|The House of Staunton, ChessSUA, CNCHESS,... - January 28th, 2021
- Global Chess Market Insights, Overview, Analysis and Forecast 2020 Murphy's Hockey Law - Murphy's Hockey Law - January 28th, 2021