{"id":6111,"date":"2018-12-07T16:42:43","date_gmt":"2018-12-08T00:42:43","guid":{"rendered":"http:\/\/www.couchbase.com\/blog\/?p=6111"},"modified":"2025-06-13T18:44:57","modified_gmt":"2025-06-14T01:44:57","slug":"json-insights-analyze-usa-healthcare-data","status":"publish","type":"post","link":"https:\/\/www.couchbase.com\/blog\/json-insights-analyze-usa-healthcare-data\/","title":{"rendered":"JSON to Insights: Analyzing US healthcare Data."},"content":{"rendered":"<blockquote><p>\u201cNothing is certain except for death and taxes.\u201d<\/p><\/blockquote>\n<p><span style=\"font-weight: 400\">This isn\u2019t a dataset made with a <a href=\"https:\/\/www.couchbase.com\/blog\/json-to-insights-fast-and-easy\/\">bed of roses<\/a> or <a href=\"https:\/\/www.couchbase.com\/blog\/on-par-with-window-functions-in-n1ql\/\">manicured green grass<\/a>. A bit more serious. Let\u2019s see if we can quickly learn anything here.\u00a0<\/span><span style=\"font-weight: 400\">The dataset is the following.<\/span><\/p>\n<p><b>&#8220;name&#8221; : &#8220;NCHS &#8211; Leading Causes of Death: United States&#8221;,<\/b><b> <\/b><b><br \/>\n&#8220;attribution&#8221; : &#8220;National Center for Health Statistics&#8221;,<\/b><\/p>\n<p><span style=\"font-weight: 400\">The public d<\/span><span style=\"font-weight: 400\">ataset is available\u00a0at\u00a0<\/span><a href=\"https:\/\/data.cdc.gov\/api\/views\/bi63-dtpu\/rows.json?accessType=DOWNLOAD\"><span style=\"font-weight: 400\">https:\/\/data.cdc.gov\/api\/views\/bi63-dtpu\/rows.json?accessType=DOWNLOAD<\/span><\/a><\/p>\n<p><b>Step 1: <\/b>Download the file into a local file (e.g. health.json).\u00a0<span style=\"font-weight: 400\">Upload this file to one of the nodes in the Couchbase cluster.<\/span><\/p>\n<p><span style=\"font-weight: 400\"><strong>Step 2<\/strong>: import the data into a bucket called cause.\u00a0 After you create the bucket, create the primary index. You\u2019ll need this for querying.<\/span><\/p>\n<p><b>\/opt\/couchbase\/bin\/cbimport json -c couchbase:\/\/127.0.0.1 -u Administrator -p password -b cause -d file:\/\/health.json -g cause:0 -f sample<\/b><\/p>\n<p><strong>&gt;\u00a0<b>CREATE PRIMARY INDEX ON cause;<\/b><\/strong><\/p>\n<p><span style=\"font-weight: 400\"><strong>Step 3<\/strong>. Inspect the structure of the data.<\/span><\/p>\n<p>All of the data is provided in a SINGLE JSON document.\u00a0 Because of this, INFER doesn&#8217;t help.\u00a0 You&#8217;ll have to inspect and understand the structure manually.\u00a0<span style=\"font-weight: 400\">This data in typical government dataset with a lot of data in simple arrays with the meaning of each entity given at in the metadata.<\/span><\/p>\n<p><span style=\"font-weight: 400\">Simple array:<\/span><\/p>\n<pre><strong>select data from cause ;<\/strong><\/pre>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-6113 alignleft\" src=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2018\/12\/Screen-Shot-2018-12-02-at-11.54.35-PM-300x200.png\" alt=\"\" width=\"644\" height=\"429\" srcset=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-02-at-11.54.35-PM-300x200.png 300w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-02-at-11.54.35-PM-768x513.png 768w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-02-at-11.54.35-PM-400x267.png 400w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-02-at-11.54.35-PM-450x300.png 450w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-02-at-11.54.35-PM-20x13.png 20w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-02-at-11.54.35-PM.png 970w\" sizes=\"auto, (max-width: 644px) 100vw, 644px\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>This simply contains an array of data without the schema.\u00a0 For the public datasets, the schema is in the meta field.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-6114 alignleft\" src=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2018\/12\/Screen-Shot-2018-12-02-at-11.58.52-PM-212x300.png\" alt=\"\" width=\"584\" height=\"827\" srcset=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-02-at-11.58.52-PM-212x300.png 212w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-02-at-11.58.52-PM-724x1024.png 724w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-02-at-11.58.52-PM-768x1086.png 768w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-02-at-11.58.52-PM-300x424.png 300w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-02-at-11.58.52-PM-14x20.png 14w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-02-at-11.58.52-PM.png 854w\" sizes=\"auto, (max-width: 584px) 100vw, 584px\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-weight: 400\">Let\u2019s transform the structure into simple JSON key-value pairs so we can handle these bit more effectively.\u00a0 You can learn more about how this magic happened <a href=\"https:\/\/dzone.com\/articles\/json-files-whats-in-a-new-york-name-unlocking-data\">in this article<\/a>.<\/span><\/p>\n<pre class=\"tab-size:2 whitespace-before:2 whitespace-after:2 lang:default decode:true\">WITH cs AS (\r\n  SELECT\r\n    meta.`view`.columns [*].fieldName f,\r\n    data\r\n  FROM\r\n    cause\r\n)\r\nSELECT\r\n  o\r\nFROM\r\n  cs UNNEST cs.data AS d1 \r\nLET o = OBJECT p :d1 [ARRAY_POSITION(cs.f, p)] FOR p IN cs.f END;<\/pre>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-6115 alignleft\" src=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2018\/12\/Screen-Shot-2018-12-04-at-4.48.23-PM-300x191.png\" alt=\"\" width=\"663\" height=\"422\" srcset=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-04-at-4.48.23-PM-300x191.png 300w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-04-at-4.48.23-PM-20x13.png 20w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-04-at-4.48.23-PM.png 552w\" sizes=\"auto, (max-width: 663px) 100vw, 663px\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<h5><strong>Task1: Find out the cause for most deaths in a state, by year.<\/strong><\/h5>\n<p>The common table expression (CTE) in the WITH clause (csdata) transforms the complex json data into flat JSON.\u00a0 \u00a0You can do this dynamically or do this once and INSERT back into a bucket, as I&#8217;ve discussed in the article on <a href=\"https:\/\/dzone.com\/articles\/json-files-whats-in-a-new-york-name-unlocking-data\">New York baby names<\/a>. In this article, I use CTEs.<\/p>\n<pre class=\"\">WITH csdata as (\r\n  WITH cs AS (\r\n    SELECT\r\n      meta.`view`.columns [*].fieldName f,\r\n      data\r\n    FROM\r\n      cause\r\n  )\r\n  SELECT\r\n    o\r\n  FROM\r\n    cs UNNEST cs.data AS d1 LET o = OBJECT p :d1 [ARRAY_POSITION(cs.f, p)] FOR p IN cs.f END\r\n)\r\nSELECT\r\n  c.o.state,\r\n  c.o.year,\r\n  c.o.cause_name,\r\n  COUNT(c.o.cause_name),\r\n  SUM(TONUMBER(c.o.deaths)) totdeaths\r\nFROM\r\n  csdata as c\r\nWHERE\r\n  c.o.state &lt;&gt; \"United States\"\r\n  and c.o.cause_name &lt;&gt; \"All causes\"\r\nGROUP BY\r\n  c.o.state,\r\n  c.o.year,\r\n  c.o.cause_name\r\nORDER BY\r\n  totdeaths DESC,\r\n  c.o.state,\r\n  c.o.year<\/pre>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-6117 alignleft\" src=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2018\/12\/Screen-Shot-2018-12-04-at-11.03.44-PM-300x171.png\" alt=\"\" width=\"671\" height=\"382\" srcset=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-04-at-11.03.44-PM-300x171.png 300w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-04-at-11.03.44-PM-20x11.png 20w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-04-at-11.03.44-PM.png 980w\" sizes=\"auto, (max-width: 671px) 100vw, 671px\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>In this case, all the deaths in California\u00a0come on top, mainly due to its population.<\/p>\n<p><strong>Task 2. Find out leading causes of death in each state for the year 2016.<\/strong><\/p>\n<p><strong>Query 2: <\/strong>Use the resultset from the previous query and then use the FIRST_VALUE() window function to determine the top cause.\u00a0 Partitioning by state\u00a0(in the OVER BY clause) will give you the partitions by state and ORDER BY dx.totdeaths\u00a0within the OVER BY clause will give you the top cause in every state.<\/p>\n<pre class=\"\">WITH csdata as (\r\n  WITH cs AS (\r\n    SELECT\r\n      meta.`view`.columns [*].fieldName f,\r\n      data\r\n    FROM\r\n      cause\r\n  )\r\n  SELECT\r\n    o\r\n  FROM\r\n    cs UNNEST cs.data AS d1 LET o = OBJECT p :d1 [ARRAY_POSITION(cs.f, p)] FOR p IN cs.f END\r\n),\r\nd2 as(\r\nSELECT\r\n  c.o.state,\r\n  c.o.year,\r\n  c.o.cause_name,\r\n  SUM(TONUMBER(c.o.deaths)) totdeaths\r\nFROM\r\n  csdata as c\r\nWHERE\r\n  c.o.state &lt;&gt; \"United States\"\r\n  and c.o.cause_name &lt;&gt; \"All causes\"\r\n  and c.o.year = \"2016\"\r\nGROUP BY\r\n  c.o.state,\r\n  c.o.year,\r\n  c.o.cause_name),\r\nd3 as (\r\nSELECT dx.state, dx.cause_name, dx.totdeaths,\r\n  FIRST_VALUE(dx.cause_name) OVER(PARTITION BY dx.state ORDER BY dx.totdeaths DESC) topreason,\r\n  FIRST_VALUE(dx.totdeaths) OVER(PARTITION BY dx.state ORDER BY dx.totdeaths DESC) topcount\r\nFROM d2 dx)\r\nSELECT d3\r\nFROM d3\r\nWHERE d3.topcount = d3.totdeaths\r\norder by d3.state<\/pre>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-6124 alignleft\" src=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2018\/12\/Screen-Shot-2018-12-06-at-11.43.21-PM-300x122.png\" alt=\"\" width=\"514\" height=\"209\" srcset=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-06-at-11.43.21-PM-300x122.png 300w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-06-at-11.43.21-PM-1024x417.png 1024w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-06-at-11.43.21-PM-768x313.png 768w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-06-at-11.43.21-PM-20x8.png 20w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-06-at-11.43.21-PM.png 1232w\" sizes=\"auto, (max-width: 514px) 100vw, 514px\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-6123 alignleft\" src=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2018\/12\/Screen-Shot-2018-12-06-at-11.40.56-PM-300x94.png\" alt=\"\" width=\"516\" height=\"161\" srcset=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-06-at-11.40.56-PM-300x94.png 300w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-06-at-11.40.56-PM-1024x320.png 1024w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-06-at-11.40.56-PM-768x240.png 768w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-06-at-11.40.56-PM-20x6.png 20w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-06-at-11.40.56-PM.png 1236w\" sizes=\"auto, (max-width: 516px) 100vw, 516px\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p><strong>Task 3.\u00a0\u00a0<\/strong>Find out how the top reason has changed by the year, from 1999 to 2016 by state.<\/p>\n<p><strong>Query 3:\u00a0\u00a0<\/strong>Simply generate the report for all the years (199-2016) and then determine the top reason and finally get the highest reason by grouping by state, year and getting MAX(topcount) for the topreason cause.<\/p>\n<pre class=\"\">WITH csdata as (\r\n  WITH cs AS (\r\n    SELECT\r\n      meta.`view`.columns [*].fieldName f,\r\n      data\r\n    FROM\r\n      cause\r\n  )\r\n  SELECT\r\n    o\r\n  FROM\r\n    cs UNNEST cs.data AS d1 LET o = OBJECT p :d1 [ARRAY_POSITION(cs.f, p)] FOR p IN cs.f END\r\n),\r\nd2 as(\r\nSELECT\r\n  c.o.state,\r\n  c.o.year,\r\n  c.o.cause_name,\r\n  SUM(TONUMBER(c.o.deaths)) totdeaths\r\nFROM\r\n  csdata as c\r\nWHERE\r\n  c.o.state &lt;&gt; \"United States\" \r\n  and c.o.cause_name &lt;&gt; \"All causes\"\r\nGROUP BY\r\n  c.o.state,\r\n  c.o.year,\r\n  c.o.cause_name),\r\nd3 as (\r\nSELECT dx.state, dx.year,\r\n  FIRST_VALUE(dx.cause_name) OVER(PARTITION BY dx.state, dx.year ORDER BY dx.totdeaths DESC ) topreason,\r\n  FIRST_VALUE(dx.totdeaths) OVER(PARTITION BY dx.state, dx.year ORDER BY dx.totdeaths DESC) topcount\r\nFROM d2 dx)\r\nSELECT d3.state , d3.year , d3.topreason, max(d3.topcount) topcount\r\nFROM d3\r\nGROUP BY d3.state, d3.year, d3.topreason\r\norder by d3.state, d3.year<\/pre>\n<p>Here&#8217;s the partial result.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-6128 alignleft\" src=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2018\/12\/Screen-Shot-2018-12-07-at-12.24.08-AM-300x183.png\" alt=\"\" width=\"452\" height=\"274\" srcset=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-07-at-12.24.08-AM-300x183.png 300w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-07-at-12.24.08-AM-20x12.png 20w\" sizes=\"auto, (max-width: 452px) 100vw, 452px\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-6129 alignleft\" src=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2018\/12\/Screen-Shot-2018-12-07-at-12.24.18-AM-300x146.png\" alt=\"\" width=\"454\" height=\"221\" srcset=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-07-at-12.24.18-AM-300x146.png 300w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-07-at-12.24.18-AM-768x374.png 768w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-07-at-12.24.18-AM-20x10.png 20w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-07-at-12.24.18-AM.png 826w\" sizes=\"auto, (max-width: 454px) 100vw, 454px\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-6130 alignleft\" src=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2018\/12\/Screen-Shot-2018-12-07-at-12.25.10-AM-300x141.png\" alt=\"\" width=\"456\" height=\"213\" srcset=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-07-at-12.25.10-AM-300x141.png 300w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-07-at-12.25.10-AM-768x360.png 768w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-07-at-12.25.10-AM-20x9.png 20w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-07-at-12.25.10-AM.png 828w\" sizes=\"auto, (max-width: 456px) 100vw, 456px\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>Visualizing this gives us the following histogram.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-6131 alignnone\" src=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2018\/12\/Screen-Shot-2018-12-07-at-12.34.54-AM-300x162.png\" alt=\"\" width=\"948\" height=\"512\" srcset=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-07-at-12.34.54-AM-300x162.png 300w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-07-at-12.34.54-AM-768x416.png 768w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-07-at-12.34.54-AM-20x11.png 20w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2018\/12\/Screen-Shot-2018-12-07-at-12.34.54-AM.png 2048w\" sizes=\"auto, (max-width: 948px) 100vw, 948px\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>\u201cNothing is certain except for death and taxes.\u201d This isn\u2019t a dataset made with a bed of roses or manicured green grass. A bit more serious. Let\u2019s see if we can quickly learn anything here.\u00a0The dataset is the following. &#8220;name&#8221; [&hellip;]<\/p>\n","protected":false},"author":55,"featured_media":7451,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"inline_featured_image":false,"footnotes":""},"categories":[1814,1815,1812],"tags":[2378,2318,2317,1261,1725],"ppma_author":[8929],"class_list":["post-6111","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-application-design","category-best-practices-and-tutorials","category-n1ql-query","tag-6-5","tag-healthcare","tag-insights","tag-json","tag-nosql-database"],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v27.3 (Yoast SEO v27.3) - https:\/\/yoast.com\/product\/yoast-seo-premium-wordpress\/ -->\n<title>JSON to Insights: Analyzing US healthcare Data. - The Couchbase Blog<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.couchbase.com\/blog\/json-insights-analyze-usa-healthcare-data\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"JSON to Insights: Analyzing US healthcare Data.\" \/>\n<meta property=\"og:description\" content=\"\u201cNothing is certain except for death and taxes.\u201d This isn\u2019t a dataset made with a bed of roses or manicured green grass. A bit more serious. Let\u2019s see if we can quickly learn anything here.\u00a0The dataset is the following. &#8220;name&#8221; [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.couchbase.com\/blog\/json-insights-analyze-usa-healthcare-data\/\" \/>\n<meta property=\"og:site_name\" content=\"The Couchbase Blog\" \/>\n<meta property=\"article:published_time\" content=\"2018-12-08T00:42:43+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-06-14T01:44:57+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2019\/08\/Transactions-alternative-3.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1400\" \/>\n\t<meta property=\"og:image:height\" content=\"553\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Keshav Murthy\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@rkeshavmurthy\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Keshav Murthy\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/json-insights-analyze-usa-healthcare-data\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/json-insights-analyze-usa-healthcare-data\\\/\"},\"author\":{\"name\":\"Keshav Murthy\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#\\\/schema\\\/person\\\/c261644262bf98e146372fe647682636\"},\"headline\":\"JSON to Insights: Analyzing US healthcare Data.\",\"datePublished\":\"2018-12-08T00:42:43+00:00\",\"dateModified\":\"2025-06-14T01:44:57+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/json-insights-analyze-usa-healthcare-data\\\/\"},\"wordCount\":548,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/json-insights-analyze-usa-healthcare-data\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/1\\\/2019\\\/08\\\/Transactions-alternative-3.jpg\",\"keywords\":[\"6.5\",\"healthcare\",\"insights\",\"JSON\",\"NoSQL Database\"],\"articleSection\":[\"Application Design\",\"Best Practices and Tutorials\",\"SQL++ \\\/ N1QL Query\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/json-insights-analyze-usa-healthcare-data\\\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/json-insights-analyze-usa-healthcare-data\\\/\",\"url\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/json-insights-analyze-usa-healthcare-data\\\/\",\"name\":\"JSON to Insights: Analyzing US healthcare Data. - The Couchbase Blog\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/json-insights-analyze-usa-healthcare-data\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/json-insights-analyze-usa-healthcare-data\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/1\\\/2019\\\/08\\\/Transactions-alternative-3.jpg\",\"datePublished\":\"2018-12-08T00:42:43+00:00\",\"dateModified\":\"2025-06-14T01:44:57+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/json-insights-analyze-usa-healthcare-data\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/json-insights-analyze-usa-healthcare-data\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/json-insights-analyze-usa-healthcare-data\\\/#primaryimage\",\"url\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/1\\\/2019\\\/08\\\/Transactions-alternative-3.jpg\",\"contentUrl\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/1\\\/2019\\\/08\\\/Transactions-alternative-3.jpg\",\"width\":1400,\"height\":553},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/json-insights-analyze-usa-healthcare-data\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"JSON to Insights: Analyzing US healthcare Data.\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#website\",\"url\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/\",\"name\":\"The Couchbase Blog\",\"description\":\"Couchbase, the NoSQL Database\",\"publisher\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#organization\",\"name\":\"The Couchbase Blog\",\"url\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/wp-content\\\/uploads\\\/2023\\\/04\\\/admin-logo.png\",\"contentUrl\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/wp-content\\\/uploads\\\/2023\\\/04\\\/admin-logo.png\",\"width\":218,\"height\":34,\"caption\":\"The Couchbase Blog\"},\"image\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#\\\/schema\\\/logo\\\/image\\\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#\\\/schema\\\/person\\\/c261644262bf98e146372fe647682636\",\"name\":\"Keshav Murthy\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/af74df754db27152971d0aed2f323ead5a1f9fe5afd0209af91e12e784451224?s=96&d=mm&r=g4e51d72fc07c662aa791316deafffac4\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/af74df754db27152971d0aed2f323ead5a1f9fe5afd0209af91e12e784451224?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/af74df754db27152971d0aed2f323ead5a1f9fe5afd0209af91e12e784451224?s=96&d=mm&r=g\",\"caption\":\"Keshav Murthy\"},\"description\":\"Keshav Murthy is a Vice President at Couchbase R&amp;D. Previously, he was at MapR, IBM, Informix, Sybase, with more than 20 years of experience in database design &amp; development. He lead the SQL and NoSQL R&amp;D team at IBM Informix. He has received two President's Club awards at Couchbase, two Outstanding Technical Achievement Awards at IBM. Keshav has a bachelor's degree in Computer Science and Engineering from the University of Mysore, India, and has received twenty four US patents.\",\"sameAs\":[\"https:\\\/\\\/blog.planetnosql.com\\\/\",\"https:\\\/\\\/x.com\\\/rkeshavmurthy\"],\"url\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/author\\\/keshav-murthy\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"JSON to Insights: Analyzing US healthcare Data. - The Couchbase Blog","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.couchbase.com\/blog\/json-insights-analyze-usa-healthcare-data\/","og_locale":"en_US","og_type":"article","og_title":"JSON to Insights: Analyzing US healthcare Data.","og_description":"\u201cNothing is certain except for death and taxes.\u201d This isn\u2019t a dataset made with a bed of roses or manicured green grass. A bit more serious. Let\u2019s see if we can quickly learn anything here.\u00a0The dataset is the following. &#8220;name&#8221; [&hellip;]","og_url":"https:\/\/www.couchbase.com\/blog\/json-insights-analyze-usa-healthcare-data\/","og_site_name":"The Couchbase Blog","article_published_time":"2018-12-08T00:42:43+00:00","article_modified_time":"2025-06-14T01:44:57+00:00","og_image":[{"width":1400,"height":553,"url":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2019\/08\/Transactions-alternative-3.jpg","type":"image\/jpeg"}],"author":"Keshav Murthy","twitter_card":"summary_large_image","twitter_creator":"@rkeshavmurthy","twitter_misc":{"Written by":"Keshav Murthy","Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.couchbase.com\/blog\/json-insights-analyze-usa-healthcare-data\/#article","isPartOf":{"@id":"https:\/\/www.couchbase.com\/blog\/json-insights-analyze-usa-healthcare-data\/"},"author":{"name":"Keshav Murthy","@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/c261644262bf98e146372fe647682636"},"headline":"JSON to Insights: Analyzing US healthcare Data.","datePublished":"2018-12-08T00:42:43+00:00","dateModified":"2025-06-14T01:44:57+00:00","mainEntityOfPage":{"@id":"https:\/\/www.couchbase.com\/blog\/json-insights-analyze-usa-healthcare-data\/"},"wordCount":548,"commentCount":0,"publisher":{"@id":"https:\/\/www.couchbase.com\/blog\/#organization"},"image":{"@id":"https:\/\/www.couchbase.com\/blog\/json-insights-analyze-usa-healthcare-data\/#primaryimage"},"thumbnailUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2019\/08\/Transactions-alternative-3.jpg","keywords":["6.5","healthcare","insights","JSON","NoSQL Database"],"articleSection":["Application Design","Best Practices and Tutorials","SQL++ \/ N1QL Query"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.couchbase.com\/blog\/json-insights-analyze-usa-healthcare-data\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.couchbase.com\/blog\/json-insights-analyze-usa-healthcare-data\/","url":"https:\/\/www.couchbase.com\/blog\/json-insights-analyze-usa-healthcare-data\/","name":"JSON to Insights: Analyzing US healthcare Data. - The Couchbase Blog","isPartOf":{"@id":"https:\/\/www.couchbase.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.couchbase.com\/blog\/json-insights-analyze-usa-healthcare-data\/#primaryimage"},"image":{"@id":"https:\/\/www.couchbase.com\/blog\/json-insights-analyze-usa-healthcare-data\/#primaryimage"},"thumbnailUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2019\/08\/Transactions-alternative-3.jpg","datePublished":"2018-12-08T00:42:43+00:00","dateModified":"2025-06-14T01:44:57+00:00","breadcrumb":{"@id":"https:\/\/www.couchbase.com\/blog\/json-insights-analyze-usa-healthcare-data\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.couchbase.com\/blog\/json-insights-analyze-usa-healthcare-data\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.couchbase.com\/blog\/json-insights-analyze-usa-healthcare-data\/#primaryimage","url":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2019\/08\/Transactions-alternative-3.jpg","contentUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2019\/08\/Transactions-alternative-3.jpg","width":1400,"height":553},{"@type":"BreadcrumbList","@id":"https:\/\/www.couchbase.com\/blog\/json-insights-analyze-usa-healthcare-data\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.couchbase.com\/blog\/"},{"@type":"ListItem","position":2,"name":"JSON to Insights: Analyzing US healthcare Data."}]},{"@type":"WebSite","@id":"https:\/\/www.couchbase.com\/blog\/#website","url":"https:\/\/www.couchbase.com\/blog\/","name":"The Couchbase Blog","description":"Couchbase, the NoSQL Database","publisher":{"@id":"https:\/\/www.couchbase.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.couchbase.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.couchbase.com\/blog\/#organization","name":"The Couchbase Blog","url":"https:\/\/www.couchbase.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2023\/04\/admin-logo.png","contentUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2023\/04\/admin-logo.png","width":218,"height":34,"caption":"The Couchbase Blog"},"image":{"@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/c261644262bf98e146372fe647682636","name":"Keshav Murthy","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/af74df754db27152971d0aed2f323ead5a1f9fe5afd0209af91e12e784451224?s=96&d=mm&r=g4e51d72fc07c662aa791316deafffac4","url":"https:\/\/secure.gravatar.com\/avatar\/af74df754db27152971d0aed2f323ead5a1f9fe5afd0209af91e12e784451224?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/af74df754db27152971d0aed2f323ead5a1f9fe5afd0209af91e12e784451224?s=96&d=mm&r=g","caption":"Keshav Murthy"},"description":"Keshav Murthy is a Vice President at Couchbase R&amp;D. Previously, he was at MapR, IBM, Informix, Sybase, with more than 20 years of experience in database design &amp; development. He lead the SQL and NoSQL R&amp;D team at IBM Informix. He has received two President's Club awards at Couchbase, two Outstanding Technical Achievement Awards at IBM. Keshav has a bachelor's degree in Computer Science and Engineering from the University of Mysore, India, and has received twenty four US patents.","sameAs":["https:\/\/blog.planetnosql.com\/","https:\/\/x.com\/rkeshavmurthy"],"url":"https:\/\/www.couchbase.com\/blog\/author\/keshav-murthy\/"}]}},"acf":[],"authors":[{"term_id":8929,"user_id":55,"is_guest":0,"slug":"keshav-murthy","display_name":"Keshav Murthy","avatar_url":"https:\/\/secure.gravatar.com\/avatar\/af74df754db27152971d0aed2f323ead5a1f9fe5afd0209af91e12e784451224?s=96&d=mm&r=g","0":null,"1":"","2":"","3":"","4":"","5":"","6":"","7":"","8":""}],"_links":{"self":[{"href":"https:\/\/www.couchbase.com\/blog\/wp-json\/wp\/v2\/posts\/6111","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.couchbase.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.couchbase.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/wp-json\/wp\/v2\/users\/55"}],"replies":[{"embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/wp-json\/wp\/v2\/comments?post=6111"}],"version-history":[{"count":0,"href":"https:\/\/www.couchbase.com\/blog\/wp-json\/wp\/v2\/posts\/6111\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/wp-json\/wp\/v2\/media\/7451"}],"wp:attachment":[{"href":"https:\/\/www.couchbase.com\/blog\/wp-json\/wp\/v2\/media?parent=6111"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/wp-json\/wp\/v2\/categories?post=6111"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/wp-json\/wp\/v2\/tags?post=6111"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/wp-json\/wp\/v2\/ppma_author?post=6111"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}