{"id":395,"date":"2018-07-23T12:47:42","date_gmt":"2018-07-23T10:47:42","guid":{"rendered":"https:\/\/craftcoders.app\/?p=395"},"modified":"2024-08-14T13:37:58","modified_gmt":"2024-08-14T11:37:58","slug":"rasa-core-nlu-conversational-ai-for-dummies","status":"publish","type":"post","link":"https:\/\/craftcoders.app\/rasa-core-nlu-conversational-ai-for-dummies\/","title":{"rendered":"Rasa Core &#038; NLU: Conversational AI for dummies"},"content":{"rendered":"<p>AI is a sought-after topic, but most developers face two hurdles that prevent them from programming anything with it.<\/p>\n<ol>\n<li>It is a complex field in which a lot of experience is needed to achieve good results<\/li>\n<li>Although there are good network topologies and models for a problem, there is often a lack of training data (corpora) without which most neural networks cannot achieve good results<\/li>\n<\/ol>\n<p>Especially in the up-and-coming natural language processing (nlp) sector, there is a lack of data in many areas. With this blogpost we are going to discuss a simple yet powerful solution to address this problem in the context of a conversational AI. ?<\/p>\n<p>Leon presented a simple solution on our blog a few weeks ago: With <a href=\"https:\/\/craftcoders.app\/ai-as-a-service\">AI as a Service<\/a> reliable language processing systems can be developed in a short time whithout having to hassle around with datasets and neural networks. However, there is one significant drawback due to this type of technology: Dependence on the operator of the service. On one hand the service can be linked with costs, furthermore the own possibly sensitive data has to be passed on to the service operator. Especially for companies this is usually a show stopper. That&#8217;s where Rasa enters the stage.<\/p>\n<h2>The Rasa Stack<\/h2>\n<p><a href=\"https:\/\/rasa.com\/\">Rasa<\/a> is an open source (<a href=\"https:\/\/github.com\/RasaHQ\">see Github<\/a>) conversational AI that is fully free for everyone and can be used in-house. There is no dependence on a service from Rasa or any other company. It consists of a two-part stack whose individual parts seem to perform similar tasks at first glance, but on a closer look you see that both try to solve their own problems. <strong><a href=\"https:\/\/nlu.rasa.com\/\">Rasa NLU<\/a><\/strong> is the language understanding AI we are going to dig deeper into soon. It is used to understand what the user is trying to say and which additional information he provides. <strong><a href=\"https:\/\/core.rasa.com\/\">Rasa Core<\/a><\/strong> is the context-aware AI for conversational flow, which is used to build dialog systems e.g. chatbots like <a href=\"https:\/\/craftcoders.app\/getting-started-with-the-telegram-abilitybot\">this<\/a>. It uses the information from Rasa NLU to find out what the user wants and what other information is needed to achieve it. For example, for a weather report you need both the date and the place.<\/p>\n<h2>Digging deeper into Rasa NLU<\/h2>\n<p>The following paragraphs deal with the development of language understanding. Its basics are already extensively documented, which is why I will keep this brief. Instead, the optimization possibilities are to be presented more extensively. If you have never coded something using Rasa, it makes sense to work through the <a href=\"https:\/\/nlu.rasa.com\/tutorial.html\">restaurant example<\/a> (see also <a href=\"https:\/\/github.com\/RasaHQ\/rasa_core\/tree\/master\/examples\">Github code template<\/a>) to get a basic understanding of the framework.<\/p>\n<p>The processing pipeline is the core element of Rasa NLU. The decisions you make there have a huge influence on the system&#8217;s quality. In the restaurant example the pipeline is already given: Two NLU frameworks spaCy and skLearn are used for text processing. Good results can be achieved with very few domain-specific training data (10 &#8211; 20 formulations per intent). You can get this amount of data easily using <a href=\"https:\/\/nlu.rasa.com\/tutorial.html#visualizing-the-training-data\">Rasa Trainer<\/a>. It is so small because transfer learning combines your own training data with spaCy&#8217;s own high-quality models to create a neural net. Besides spaCy, there are other ways to process your data, which we will discover now!<\/p>\n<h2>Unlock the full potential<\/h2>\n<p>Instead of spaCy you can also use <a href=\"https:\/\/github.com\/mit-nlp\/MITIE\">MIT Information Extraction<\/a>. MITIE can also be used for intent recognition and named entity recognition (NER). Both backends perform the same tasks and are therefore interchangeable. The difference lies in the algorithms and models they use. Therefore you are not bound to only spaCy or mitie, but you can also use <a href=\"http:\/\/scikit-learn.org\/stable\/\">scikit-learn<\/a> for intent classification.<\/p>\n<p>Which backend works best for your project is individual and should be tested. As you will see in the next paragraph, the pipeline offers some precious showpieces that work particularly well. The already included <a href=\"https:\/\/nlu.rasa.com\/evaluation.html\">cross validation<\/a> should be used to evaluate the quality of the system.<\/p>\n<h3>The processing pipeline<\/h3>\n<p>You should understand how the <a href=\"https:\/\/nlu.rasa.com\/pipeline.html\">pipeline<\/a> works to develop a good system for your special problem.<\/p>\n<ol>\n<li>The <strong>tokenizer:<\/strong> is used to transform input words, sentences or paragraphs into single word tokens. Hence, unnecessary punctuation is removed and stop words can also be removed.<\/li>\n<li>The <strong>featurizer<\/strong> is used to create input vectors from the tokens. They can be used as features for the neural net. The simplest form of an input vector is <a href=\"https:\/\/towardsdatascience.com\/word-to-vectors-natural-language-processing-b253dd0b0817\">one-hot<\/a>.<\/li>\n<li>The <strong>intent classifier<\/strong> is a part of the neural net, which is responsible for decision making. It decides which intent is most likely meant by the user. This is called multiclass classification.<\/li>\n<li>Finally <strong>named entity recognition<\/strong> can be used to extract information like e-mails from a text. In terms of Rasa (and dialogue systems) this is called entity extraction.<\/li>\n<\/ol>\n<p>In the following example (from Rasa) you can see how the single parts work together to provide information about intent and entity:<\/p>\n<pre><code class=\"json\">{\r\n    \"text\": \"I am looking for Chinese food\",\r\n    \"entities\": [\r\n        {\"start\": 8, \"end\": 15, \"value\": \"chinese\", \"entity\": \"cuisine\", \"extractor\": \"ner_crf\", \"confidence\": 0.864}\r\n    ],\r\n    \"intent\": {\"confidence\": 0.6485910906220309, \"name\": \"restaurant_search\"},\r\n    \"intent_ranking\": [\r\n        {\"confidence\": 0.6485910906220309, \"name\": \"restaurant_search\"},\r\n        {\"confidence\": 0.1416153159565678, \"name\": \"affirm\"}\r\n    ]\r\n}\r\n<\/code><\/pre>\n<p>As mentioned by <a href=\"https:\/\/blog.rasa.com\/supervised-word-vectors-from-scratch-in-rasa-nlu\/\">Rasa itself<\/a> <em>intent_classifier_tensorflow_embedding<\/em> can be used for intent classification. It is based on the <a href=\"https:\/\/arxiv.org\/abs\/1709.03856\">StarSpace: Embed All The Things!<\/a> paper published by Facebook Research. They present a completely new way for meaning similarity, which generates awesome results! ?<\/p>\n<p>For named entity recognition you have to make a decision: Either you use common pre-trained entities, or you use custom entities like &#8220;type_of_coffee&#8221;. Pre-trained entities can be one of the following:<\/p>\n<ul>\n<li><strong>ner_spaCy:<\/strong> Places, Dates, People, Organisations<\/li>\n<li><strong>ner_duckling:<\/strong> Dates, Amounts of Money, Durations, Distances, Ordinals<\/li>\n<\/ul>\n<p>Those two algorithms perform very well in recognition of the given types, but if you need custom entities they perform rather bad. Instead you should use <strong>ner_mitie<\/strong> or <strong>ner_crf<\/strong> and collect some more training data than usual. If your entities have a specific structure, which is parsable by a regex make sure to integrate <em>intent_entity_featurizer_regex<\/em> to your pipeline! In this <a href=\"https:\/\/gist.github.com\/djuelg\/44271df5942b1472a14dd06f16175ae6\">Github Gist<\/a> I provided a short script, which helps you to create training samples for a custom entity. You can just pass some sentences for an intent into it and combine it with sample values of your custom entity. It will then create some training samples for each of your sample values.<\/p>\n<p>That&#8217;s it \ud83d\ude42 If you have any questions about Rasa or this blogpost don&#8217;t hesitate to contact me! Have a nice week and stay tuned for our next post.<\/p>\n<p><strong>Greets,<br \/>\nDomi<\/strong><\/p>\n","protected":false},"excerpt":{"rendered":"<p>AI is a sought-after topic, but most developers face two hurdles that prevent them from programming anything with it. It is a complex field in which a lot of experience is needed to achieve good results Although there are good network topologies and models for a problem, there is often a lack of training data (corpora) without which most neural &#8230; <a href=\"https:\/\/craftcoders.app\/rasa-core-nlu-conversational-ai-for-dummies\/\" class=\"more-link\">Read More<\/a><\/p>\n","protected":false},"author":3,"featured_media":2358,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"inline_featured_image":false,"footnotes":""},"categories":[107,112,117,109],"tags":[],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v22.7 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Rasa Core &amp; NLU: Conversational AI for dummies - CraftCoders.app<\/title>\n<meta name=\"description\" content=\"With this blogpost we are going to discuss a simple yet powerful solution to address problems in terms of conversational AI using the Rasa Stack.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/craftcoders.app\/rasa-core-nlu-conversational-ai-for-dummies\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Rasa Core &amp; NLU: Conversational AI for dummies - CraftCoders.app\" \/>\n<meta property=\"og:description\" content=\"With this blogpost we are going to discuss a simple yet powerful solution to address problems in terms of conversational AI using the Rasa Stack.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/craftcoders.app\/rasa-core-nlu-conversational-ai-for-dummies\/\" \/>\n<meta property=\"og:site_name\" content=\"CraftCoders.app\" \/>\n<meta property=\"article:published_time\" content=\"2018-07-23T10:47:42+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-08-14T11:37:58+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/craftcoders.app\/wp-content\/uploads\/2018\/07\/Bildschirmfoto-2019-02-03-um-17.40.43-768x639-1.png\" \/>\n\t<meta property=\"og:image:width\" content=\"768\" \/>\n\t<meta property=\"og:image:height\" content=\"639\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Dominik J\u00fclg\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Dominik J\u00fclg\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/craftcoders.app\/rasa-core-nlu-conversational-ai-for-dummies\/\",\"url\":\"https:\/\/craftcoders.app\/rasa-core-nlu-conversational-ai-for-dummies\/\",\"name\":\"Rasa Core & NLU: Conversational AI for dummies - CraftCoders.app\",\"isPartOf\":{\"@id\":\"https:\/\/craftcoders.app\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/craftcoders.app\/rasa-core-nlu-conversational-ai-for-dummies\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/craftcoders.app\/rasa-core-nlu-conversational-ai-for-dummies\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/craftcoders.app\/wp-content\/uploads\/2018\/07\/Bildschirmfoto-2019-02-03-um-17.40.43-768x639-1.png\",\"datePublished\":\"2018-07-23T10:47:42+00:00\",\"dateModified\":\"2024-08-14T11:37:58+00:00\",\"author\":{\"@id\":\"https:\/\/craftcoders.app\/#\/schema\/person\/950725b140a7ce1147b1308a746a8cce\"},\"description\":\"With this blogpost we are going to discuss a simple yet powerful solution to address problems in terms of conversational AI using the Rasa Stack.\",\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/craftcoders.app\/rasa-core-nlu-conversational-ai-for-dummies\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/craftcoders.app\/rasa-core-nlu-conversational-ai-for-dummies\/#primaryimage\",\"url\":\"https:\/\/craftcoders.app\/wp-content\/uploads\/2018\/07\/Bildschirmfoto-2019-02-03-um-17.40.43-768x639-1.png\",\"contentUrl\":\"https:\/\/craftcoders.app\/wp-content\/uploads\/2018\/07\/Bildschirmfoto-2019-02-03-um-17.40.43-768x639-1.png\",\"width\":768,\"height\":639},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/craftcoders.app\/#website\",\"url\":\"https:\/\/craftcoders.app\/\",\"name\":\"CraftCoders.app\",\"description\":\"Jira and Confluence apps\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/craftcoders.app\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/craftcoders.app\/#\/schema\/person\/950725b140a7ce1147b1308a746a8cce\",\"name\":\"Dominik J\u00fclg\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/craftcoders.app\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/bd18a95cdfc659ad254c8a3bd7f70cc5?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/bd18a95cdfc659ad254c8a3bd7f70cc5?s=96&d=mm&r=g\",\"caption\":\"Dominik J\u00fclg\"},\"url\":\"https:\/\/craftcoders.app\/author\/domi\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Rasa Core & NLU: Conversational AI for dummies - CraftCoders.app","description":"With this blogpost we are going to discuss a simple yet powerful solution to address problems in terms of conversational AI using the Rasa Stack.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/craftcoders.app\/rasa-core-nlu-conversational-ai-for-dummies\/","og_locale":"en_US","og_type":"article","og_title":"Rasa Core & NLU: Conversational AI for dummies - CraftCoders.app","og_description":"With this blogpost we are going to discuss a simple yet powerful solution to address problems in terms of conversational AI using the Rasa Stack.","og_url":"https:\/\/craftcoders.app\/rasa-core-nlu-conversational-ai-for-dummies\/","og_site_name":"CraftCoders.app","article_published_time":"2018-07-23T10:47:42+00:00","article_modified_time":"2024-08-14T11:37:58+00:00","og_image":[{"width":768,"height":639,"url":"https:\/\/craftcoders.app\/wp-content\/uploads\/2018\/07\/Bildschirmfoto-2019-02-03-um-17.40.43-768x639-1.png","type":"image\/png"}],"author":"Dominik J\u00fclg","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Dominik J\u00fclg","Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/craftcoders.app\/rasa-core-nlu-conversational-ai-for-dummies\/","url":"https:\/\/craftcoders.app\/rasa-core-nlu-conversational-ai-for-dummies\/","name":"Rasa Core & NLU: Conversational AI for dummies - CraftCoders.app","isPartOf":{"@id":"https:\/\/craftcoders.app\/#website"},"primaryImageOfPage":{"@id":"https:\/\/craftcoders.app\/rasa-core-nlu-conversational-ai-for-dummies\/#primaryimage"},"image":{"@id":"https:\/\/craftcoders.app\/rasa-core-nlu-conversational-ai-for-dummies\/#primaryimage"},"thumbnailUrl":"https:\/\/craftcoders.app\/wp-content\/uploads\/2018\/07\/Bildschirmfoto-2019-02-03-um-17.40.43-768x639-1.png","datePublished":"2018-07-23T10:47:42+00:00","dateModified":"2024-08-14T11:37:58+00:00","author":{"@id":"https:\/\/craftcoders.app\/#\/schema\/person\/950725b140a7ce1147b1308a746a8cce"},"description":"With this blogpost we are going to discuss a simple yet powerful solution to address problems in terms of conversational AI using the Rasa Stack.","inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/craftcoders.app\/rasa-core-nlu-conversational-ai-for-dummies\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/craftcoders.app\/rasa-core-nlu-conversational-ai-for-dummies\/#primaryimage","url":"https:\/\/craftcoders.app\/wp-content\/uploads\/2018\/07\/Bildschirmfoto-2019-02-03-um-17.40.43-768x639-1.png","contentUrl":"https:\/\/craftcoders.app\/wp-content\/uploads\/2018\/07\/Bildschirmfoto-2019-02-03-um-17.40.43-768x639-1.png","width":768,"height":639},{"@type":"WebSite","@id":"https:\/\/craftcoders.app\/#website","url":"https:\/\/craftcoders.app\/","name":"CraftCoders.app","description":"Jira and Confluence apps","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/craftcoders.app\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/craftcoders.app\/#\/schema\/person\/950725b140a7ce1147b1308a746a8cce","name":"Dominik J\u00fclg","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/craftcoders.app\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/bd18a95cdfc659ad254c8a3bd7f70cc5?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/bd18a95cdfc659ad254c8a3bd7f70cc5?s=96&d=mm&r=g","caption":"Dominik J\u00fclg"},"url":"https:\/\/craftcoders.app\/author\/domi\/"}]}},"_links":{"self":[{"href":"https:\/\/craftcoders.app\/wp-json\/wp\/v2\/posts\/395"}],"collection":[{"href":"https:\/\/craftcoders.app\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/craftcoders.app\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/craftcoders.app\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/craftcoders.app\/wp-json\/wp\/v2\/comments?post=395"}],"version-history":[{"count":1,"href":"https:\/\/craftcoders.app\/wp-json\/wp\/v2\/posts\/395\/revisions"}],"predecessor-version":[{"id":2359,"href":"https:\/\/craftcoders.app\/wp-json\/wp\/v2\/posts\/395\/revisions\/2359"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/craftcoders.app\/wp-json\/wp\/v2\/media\/2358"}],"wp:attachment":[{"href":"https:\/\/craftcoders.app\/wp-json\/wp\/v2\/media?parent=395"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/craftcoders.app\/wp-json\/wp\/v2\/categories?post=395"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/craftcoders.app\/wp-json\/wp\/v2\/tags?post=395"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}