How Deep Learning Is Teaching Machines To Detect & Filter Inappropriate Responses Like A Boss

We are in the middle of an AI revolution where computers are promising to become the trusted lieutenants and adorable friends of humans. This true potential can be only realized if these bots, so to speak, become more aware of their actions.

Manoj Kumar ChinnakotlaUpdated: Jun 08, 2017, 19:27 IST
How Deep Learning Is Teaching Machines To Detect & Filter Inappropriate Responses Like A Boss

We live in a digital world where we rely on our digital devices, such as smartphones and computers, for keeping ourselves informed, learning new concepts, planning our activities, and for taking crucial decisions in life. With the evolution of technology, the way we interact with our digital devices and computers has also undergone a rapid transformation 每 from the time when we were expected to key in a series of arcane commands for getting work done, to the age of Graphical User Interfaces (GUI) which needed several clicks of a mouse button to trigger some actions, to the current age of conversational interfaces 每 where the interaction happens through a natural language conversation using voice or text.

xiaoice chat

AP

Conversational Interfaces (CI) are powered using Artificial Intelligence (AI) and offer a much more natural, intuitive, and human-like interface to delegate tasks and get work done. Not just that, they*re even capable of providing fun, entertainment, amusement, and a personal emotional connect. For example, I can ask a virtual assistant like Cortana/Google Now/Siri, which is available on my smartphone, to book a cab ride, search online for ※punjabi food recipes§, or to entertain me with some jokes and songs 每 all with the help of a voice or text-based conversation.

Conversational Agents or Chat-Bots are also being deployed by various online business portals to provide a more personalised experience for their customers. In fact, some chat-bots, (such as Microsoft Xiaoice 每 a text-based chat-bot), are evolving beyond being assistants and note-takers to project a persona of their own. They have unique language characteristics, a sense of humor, and the ability to connect with users* emotions.

Chatting with machines is complicated

While it*s certainly great news that we can converse with our machines much like we do with other humans, there are certain major concerns which still need to be addressed. As a social species, humans are endowed with two fundamental qualities 每 self-awareness and self-regulation. These qualities ensure that our behavior and actions in the society are ※appropriate§ i.e. satisfy the commonly accepted standards or expectations of society such as not being rude, discourteous, disrespectful towards any individual or group of individuals, avoiding behavior which may cause or is capable of causing harm to others, avoiding activities which are illegal under the laws of the country, and avoiding lewd or extremely violent behavior.

However, Conversational Agents and computers are still in their infancy with respect to fully comprehending their responses or suggestions before communicating them to the end user. In this sense, their faculties of self-awareness and self-regulation are not yet fully mature. Most conversational interfaces today are either programmed to give some pre-defined responses or learn to respond based on the training given in the form of message-response pairs from previous historical conversations culled from sources like Twitter and online forums such as Quora, Yahoo answers etc. Due to this, they may sometimes utter, suggest, or respond back with messages which are ※inappropriate§ in a given context.

xiaoice Chat

HKSILICON.COM

AI researchers are looking for automatic techniques for detecting such ※inappropriate§ or ※toxic§ content so that it could be employed by machines for effective self-regulation. This technology could also be used for moderating discussions and comments in many online forums and news sites where certain issues can rapidly dissolve into inappropriate abuse and hate commentary.

Researchers have come up with an automatic technique to identify and prune inappropriate query suggestions offered by Search Engines (SE). Query Auto Completion (QAC) or Auto Suggest is a popular SE feature, wherein, based on the first few characters entered, the SE guesses the most probable query completions matching the user intent and automatically offers them as suggestions in real time while the user is still typing. However, while retrieving potential completions from search logs, SEs oftentimes inadvertently suggest query completions which are inappropriate.

For example, if I enter the prefix ※bollywood movies are ※ on a popular search engine, some of the suggestions which I get are ※bollywood movies are better than hollywood§, ※bollywood movies are rubbish§, ※bollywood movies are stupid§ and ※bollywood movies are so bad§ out of which the last three suggestions may offend Bollywood movie fans. In other circumstances, they have the potential to sometimes offer inappropriate suggestions related to certain activities which may be illegal or inappropriate suggestions like suicide or self-harm related intent or intent for buying or selling of banned drugs or substances etc. Any service offering such inappropriate suggestions may risk being seen as endorsing those views thereby tarnishing the brand image or worse may lead to legal complications. Thus, it is imperative for SEs to understand and regulate the search suggestions it offers.

xiaoice chat

Digital Trends

Understanding context is key

These problems fall broadly into the bucket that we will label as offensive or inappropriate content. This problem of detecting offensive queries is hard because search queries often contain spelling mistakes, asterisk characters in spellings, have loosely connected keywords without enough context, contain ambiguities of natural language and may also refer to other real world entities. The query ※lethal weapon suicide attempt§ may seem violent, offensive and hence ※inappropriate§ but is the name of a famous sound track in the pop album ※Lethal Weapon§. On the other hand, a query such as ※what to do when tweaking alone§, which appears like a clean query, the word ※tweaking§ has multiple meanings one of which also refers to ※the act of consuming an illegal drug§. Pattern based filtering techniques have severe limitations since they only work for the limited words defined in the rules and require constant manual intervention. So, a new strategy is required to automatically spot and filter such offensive query suggestions.

This technique being proposed by some researchers is based on a new field of computer science research known as Deep Learning (DL) 每 which aims to build machines that can process data and learn in the same way as our human brain does. DL essentially involves building artificial neural networks which are trained to mimic the behavior of the human brain. These networks can learn to represent and reason over the various inputs given to them such as words, images, sounds and so on. The figure below shows an illustration of an artificial neural network.

Teaching Machines

As shown in the illustration, these neural networks are composed of multiple layers, with input, output, and one or more hidden layers. These artificial neural networks can be trained to perform various tasks. Now for example, if a neural network is trained to understand a given image along with its various objects, the different hidden layers in the network tend to learn different aspects of the image. For instance, the first hidden layer of neurons may just identify edges in the image, with different angles of orientation and the next layer may learn to identify more complex objects using the previously learnt edges, such as detecting triangles, rectangles, and so on. Successive layers could build on previous layers to learn more sophisticated objects and features such as faces. The most interesting thing is 每 given training data, the model learns all of this on its own. In this case study researchers have proposed a novel architecture for training a network which effectively learns and models the semantic meaning of the given search query.

Similar to the way we train our human brain about a concept by showing labeled examples, this new artificial neural network method is trained using several thousand real-world web search queries which were labeled as acceptable or inappropriate. After the model was trained, the researchers presented the network with a new set of 4000 real-world search queries from Bing. Results showed that the model achieved an accuracy of 92%. This was significantly better than the performance of pattern based or other state-of-the-art machine learning based techniques. For example, the model identifies 每 ※a**monkey§ (a curse word in urban slang), ※shake and bake meth instructions§ (instructions on making meth 每 a short form for a banned drug methamphetamine) as inappropriate suggestions despite having spell mistakes, asterisk symbols and other short forms. It also identifies that the query ※marvin gaye if I die tonight download§ is a clean suggestion although it contains the words ※die tonight§. The new approach isn*t all perfect, of course. Overall, this is interesting work which provides some important directions for future research.

How Deep Learning Is Teaching Machines To Detect & Filter Inappropriate Responses Like A Boss

We are in the middle of an AI revolution where computers are promising to become the trusted lieutenants and adorable friends of humans. This true potential can be only realized if these bots, so to speak, become more aware of their actions and learn to restrain and regulate their automatic responses! An important motto to uphold - Thou Shalt Not Offend!

About the author: Manoj Kumar Chinnakotla is a Senior Applied Scientist, Artificial Intelligence and Research at Microsoft India

23/11/2024 4:35:6
seductrice.net
universo-virtual.com
buytrendz.net
thisforall.net
benchpressgains.com
qthzb.com
mindhunter9.com
dwjqp1.com
secure-signup.net
ahaayy.com
tressesindia.com
puresybian.com
krpano-chs.com
cre8workshop.com
hdkino.org
peixun021.com
qz786.com
utahperformingartscenter.org
worldqrmconference.com
shangyuwh.com
eejssdfsdfdfjsd.com
playminecraftfreeonline.com
trekvietnamtour.com
your-business-articles.com
essaywritingservice10.com
hindusamaaj.com
joggingvideo.com
wandercoups.com
wormblaster.net
tongchengchuyange0004.com
internetknowing.com
breachurch.com
peachesnginburlesque.com
dataarchitectoo.com
clientfunnelformula.com
30pps.com
cherylroll.com
ks2252.com
prowp.net
webmanicura.com
sofietsshotel.com
facetorch.com
nylawyerreview.com
apapromotions.com
shareparelli.com
goeaglepointe.com
thegreenmanpubphuket.com
karotorossian.com
publicsensor.com
taiwandefence.com
epcsur.com
mfhoudan.com
southstills.com
tvtv98.com
thewellington-hotel.com
bccaipiao.com
colectoresindustrialesgs.com
shenanddcg.com
capriartfilmfestival.com
replicabreitlingsale.com
thaiamarinnewtoncorner.com
gkmcww.com
mbnkbj.com
andrewbrennandesign.com
cod54.com
luobinzhang.com
faithfirst.net
zjyc28.com
tongchengjinyeyouyue0004.com
nhuan6.com
kftz5k.com
oldgardensflowers.com
lightupthefloor.com
bahamamamas-stjohns.com
ly2818.com
905onthebay.com
fonemenu.com
notanothermovie.com
ukrainehighclassescort.com
meincmagazine.com
av-5858.com
yallerdawg.com
donkeythemovie.com
corporatehospitalitygroup.com
boboyy88.com
miteinander-lernen.com
dannayconsulting.com
officialtomsshoesoutletstore.com
forsale-amoxil-amoxicillin.net
generictadalafil-canada.net
guitarlessonseastlondon.com
lesliesrestaurants.com
mattyno9.com
nri-homeloans.com
rtgvisas-qatar.com
salbutamolventolinonline.net
sportsinjuries.info
wedsna.com
rgkntk.com
bkkmarketplace.com
zxqcwx.com
breakupprogram.com
boxcardc.com
unblockyoutubeindonesia.com
fabulousbookmark.com
beat-the.com
guatemala-sailfishing-vacations-charters.com
magie-marketing.com
kingstonliteracy.com
guitaraffinity.com
eurelookinggoodapparel.com
howtolosecheekfat.net
marioncma.org
oliviadavismusic.com
shantelcampbellrealestate.com
shopleborn13.com
topindiafree.com
v-visitors.net
djjky.com
053hh.com
originbluei.com
baucishotel.com
33kkn.com
intrinsiqresearch.com
mariaescort-kiev.com
mymaguk.com
sponsored4u.com
crimsonclass.com
bataillenavale.com
searchtile.com
ze-stribrnych-struh.com
zenithalhype.com
modalpkv.com
bouisset-lafforgue.com
useupload.com
37r.net
autoankauf-muenster.com
bantinbongda.net
bilgius.com
brabustermagazine.com
indigrow.org
miicrosofts.net
mysmiletravel.com
selinasims.com
spellcubesapp.com
usa-faction.com
hypoallergenicdogsnames.com
dailyupdatez.com
foodphotographyreviews.com
cricutcom-setup.com
chprowebdesign.com
katyrealty-kanepa.com
tasramar.com
bilgipinari.org
four-am.com
indiarepublicday.com
inquick-enbooks.com
iracmpi.com
kakaschoenen.com
lsm99flash.com
nana1255.com
ngen-niagara.com
technwzs.com
virtualonlinecasino1345.com
wallpapertop.net
casino-natali.com
iprofit-internet.com
denochemexicana.com
eventhalfkg.com
medcon-taiwan.com
life-himawari.com
myriamshomes.com
nightmarevue.com
healthandfitnesslives.com
androidnews-jp.com
allstarsru.com
bestofthebuckeyestate.com
bestofthefirststate.com
bestwireless7.com
britsmile.com
declarationintermittent.com
findhereall.com
jingyou888.com
lsm99deal.com
lsm99galaxy.com
moozatech.com
nuagh.com
patliyo.com
philomenamagikz.net
rckouba.net
saturnunipessoallda.com
tallahasseefrolics.com
thematurehardcore.net
totalenvironment-inthatquietearth.com
velislavakaymakanova.com
vermontenergetic.com
kakakpintar.com
jerusalemdispatch.com
begorgeouslady.com
1800birks4u.com
2wheelstogo.com
6strip4you.com
bigdata-world.net
emailandco.net
gacapal.com
jharpost.com
krishnaastro.com
lsm99credit.com
mascalzonicampani.com
sitemapxml.org
thecityslums.net
topagh.com
flairnetwebdesign.com
rajasthancarservices.com
bangkaeair.com
beneventocoupon.com
noternet.org
oqtive.com
smilebrightrx.com
decollage-etiquette.com
1millionbestdownloads.com
7658.info
bidbass.com
devlopworldtech.com
digitalmarketingrajkot.com
fluginfo.net
naqlafshk.com
passion-decouverte.com
playsirius.com
spacceleratorintl.com
stikyballs.com
top10way.com
yokidsyogurt.com
zszyhl.com
16firthcrescent.com
abogadolaboralistamd.com
apk2wap.com
aromacremeria.com
banparacard.com
bosmanraws.com
businessproviderblog.com
caltonosa.com
calvaryrevivalchurch.org
chastenedsoulwithabrokenheart.com
cheminotsgardcevennes.com
cooksspot.com
cqxzpt.com
deesywig.com
deltacartoonmaps.com
despixelsetdeshommes.com
duocoracaobrasileiro.com
fareshopbd.com
goodpainspills.com
hemendekor.com
kobisitecdn.com
makaigoods.com
mgs1454.com
piccadillyresidences.com
radiolaondafresca.com
rubendorf.com
searchengineimprov.com
sellmyhrvahome.com
shugahouseessentials.com
sonihullquad.com
subtractkilos.com
valeriekelmansky.com
vipasdigitalmarketing.com
voolivrerj.com
worldhealthstory.com
zeelonggroup.com
1015southrockhill.com
10x10b.com
111-online-casinos.com
191cb.com
3665arpentunitd.com
aitesonics.com
bag-shokunin.com
brightotech.com
communication-digitale-services.com
covoakland.org
dariaprimapack.com
freefortniteaccountss.com
gatebizglobal.com
global1entertainmentnews.com
greatytene.com
hiroshiwakita.com
iktodaypk.com
jahatsakong.com
meadowbrookgolfgroup.com
newsbharati.net
platinumstudiosdesign.com
slotxogamesplay.com
strikestaruk.com
techguroh.com
trucosdefortnite.com
ufabetrune.com
weddedtowhitmore.com
12940brycecanyonunitb.com
1311dietrichoaks.com
2monarchtraceunit303.com
601legendhill.com
850elaine.com
adieusolasomade.com
andora-ke.com
bestslotxogames.com
cannagomcallen.com
endlesslyhot.com
iestpjva.com
ouqprint.com
pwmaplefest.com
qtylmr.com
rb88betting.com
buscadogues.com
1007macfm.com
born-wild.com
growthinvests.com
promocode-casino.com
proyectogalgoargentina.com
wbthompson-art.com
whitemountainwheels.com
7thavehvl.com
developmethis.com
funkydogbowties.com
travelodgegrandjunction.com
gao-town.com
globalmarketsuite.com
blogshippo.com
hdbka.com
proboards67.com
outletonline-michaelkors.com
kalkis-research.com
thuthuatit.net
buckcash.com
hollistercanada.com
docterror.com
asadart.com
vmayke.org
erwincomputers.com
dirimart.org
okkii.com
loteriasdecehegin.com
mountanalog.com
healingtaobritain.com
ttxmonitor.com
nwordpress.com
11bolabonanza.com