{"id":5266,"date":"2021-05-26T18:03:29","date_gmt":"2021-05-26T23:03:29","guid":{"rendered":"https:\/\/beta.pewresearch.org\/pewresearch-org\/data-labs\/"},"modified":"2026-05-06T13:50:10","modified_gmt":"2026-05-06T17:50:10","slug":"data-labs","status":"publish","type":"page","link":"https:\/\/beta.pewresearch.org\/pewresearch-org\/data-labs\/","title":{"rendered":"Data Labs"},"content":{"rendered":"<div style=\"--grid-gutter: 3.5rem;--grid-row-gap: 3.5rem;--divider-color: var(--wp--preset--color--ui-gray-light)\" class=\"has-divider has-ui-gray-light-divider-color is-vertically-aligned-top wp-block-prc-block-grid-controller\">\n<div style=\"--desktop-span:8;--tablet-span:8;--mobile-span:4\" class=\"is-vertically-aligned-top wp-block-prc-block-grid-column is-layout-flow wp-block-prc-block-grid-column-is-layout-flow\" data-desktop-span=\"8\" data-tablet-span=\"8\" data-mobile-span=\"4\">\n\n<p class=\"wp-block-paragraph\">Pew Research Center\u2019s&nbsp;Data Labs&nbsp;team uses computational methods to complement and expand on the Center\u2019s existing research agenda. The team collects text, audiovisual and behavioral datasets; uses innovative computational techniques and empirical strategies for analysis; and generates original research. Data Labs also explores the limitations of these data and methods and works toward establishing standards for use and analysis.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The Data Labs project both produces its own reports and collaborates with other research groups at the Center, applying new computational approaches to existing research questions. Past research has explored&nbsp;<a href=\"https:\/\/beta.pewresearch.org\/pewresearch-org\/politics\/2018\/07\/18\/taking-sides-on-facebook-how-congressional-outreach-changed-under-president-trump\">congressional communication<\/a>, looked at the ways Americans use&nbsp;<a href=\"https:\/\/beta.pewresearch.org\/pewresearch-org\/politics\/2019\/10\/23\/national-politics-on-twitter-small-share-of-u-s-adults-produce-majority-of-tweets\/\">social media<\/a>, and analyzed everything from&nbsp;<a href=\"https:\/\/beta.pewresearch.org\/pewresearch-org\/internet\/2019\/07\/25\/a-week-in-the-life-of-popular-youtube-channels\/\">videos<\/a> and&nbsp;<a href=\"https:\/\/beta.pewresearch.org\/pewresearch-org\/journalism\/2019\/05\/23\/men-appear-twice-as-often-as-women-in-news-photos-on-facebook\/\">images<\/a>&nbsp;to&nbsp;<a href=\"https:\/\/beta.pewresearch.org\/pewresearch-org\/social-trends\/2018\/12\/17\/gender-and-jobs-in-online-image-searches\/\">algorithmic bias<\/a>&nbsp;and&nbsp;<a href=\"https:\/\/beta.pewresearch.org\/pewresearch-org\/religion\/2019\/12\/16\/the-digital-pulpit-a-nationwide-analysis-of-online-sermons\/\">religious rhetoric<\/a>. The Data Labs team also writes about the process of computational social science research on&nbsp;<a href=\"http:\/\/pewresearch.org\/pewresearch-org\/decoded\">Decoded<\/a>, the Center\u2019s behind-the-scenes blog about research methods.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In addition, Data Labs manages the Center\u2019s computing infrastructure. That includes building high-performance computing systems and databases that facilitate web data collection and processing; deploying platforms that facilitate collaborative, replicable analysis in R and Python; and developing systems to automate research tasks such as content classification for machine learning.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">As is true for Pew Research Center as a whole, Data Labs is nonpartisan and nonadvocacy. The team values independence, objectivity, accuracy, rigor, humility, transparency and innovation.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">[<a href=\"https:\/\/beta.pewresearch.org\/pewresearch-org\/publications\/?_research_teams=data-labs\">View the latest research from Data Labs<\/a>]<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"why-did-pew-research-center-create-data-labs\"><strong>Why did Pew Research Center create Data Labs?<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Data Labs was created as a response to the changing nature of data on human behaviors and attitudes. The public is expressing views online and leaving behind electronic trails of behavior in unprecedented ways. We can now learn about whom people connect with on social networks, what they search for, and what content they post. At the same time, institutions and groups are using the internet to convey information to diverse audiences, inviting researchers to observe what they post and how people react.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">While some of these digital traces of communication and behavior are unstructured and not amenable to analysis in raw form, a number of new technologies are making it easier to collect and process these data. These technologies include:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Internet data collection:<\/strong> This includes harvesting web page content and parsing out fields (e.g., dates, names, links and tables) for analysis as well as querying APIs online to obtain formatted data.<\/li>\n\n\n\n<li><strong>Natural language processing<\/strong> <strong>(NLP):<\/strong> This includes processing text to measure concepts and extract patterns.<\/li>\n\n\n\n<li><strong>Machine vision:<\/strong> This refers to analyzing images using computational models that estimate what the images depict.<\/li>\n\n\n\n<li><strong>Online distributed labor platforms:<\/strong> These platforms allow major data collection efforts to be divided into a series of small tasks that can then be completed by external individuals. This is sometimes referred to as \u201ccrowdsourcing.\u201d<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Data Labs is a testing ground for these data sources and the different approaches to analyzing them, with the goal of extracting meaning from the data through creative design, innovative methods, thoughtful measurement and sound deployment.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The Data Labs team also employs methodologies honed across the Center, such as&nbsp;<a href=\"https:\/\/www.pewinternet.org\/2019\/07\/25\/a-week-in-the-life-of-popular-youtube-channels\/\">content analysis<\/a>,&nbsp;<a href=\"https:\/\/beta.pewresearch.org\/pewresearch-org\/short-reads\/2019\/01\/29\/good-jobs-vs-jobs-survey-experiments-can-measure-the-effects-of-question-wording-and-more\/\">survey experiments<\/a>, and the analysis of&nbsp;<a href=\"https:\/\/beta.pewresearch.org\/pewresearch-org\/religion\/\/2018\/11\/20\/where-americans-find-meaning-in-life\/\">open-ended survey responses<\/a>.<\/p>\n\n<\/div>\n\n<div style=\"--desktop-span:4;--tablet-span:4;--mobile-span:4\" class=\"is-vertically-aligned-top wp-block-prc-block-grid-column is-layout-flow wp-block-prc-block-grid-column-is-layout-flow has-desktop-divider has-tablet-divider has-mobile-divider\" data-desktop-span=\"4\" data-tablet-span=\"4\" data-mobile-span=\"4\">\n\n<div class=\"wp-block-group align-to__column-divider has-global-padding is-layout-constrained wp-container-core-group-is-layout-2bc0fe50 wp-block-group-is-layout-constrained\" style=\"padding-top:0;padding-right:0;padding-bottom:0;padding-left:0\">\n<div style=\"--custom-heading-text-color: #FFF; --custom-heading-background-color: #000;\" class=\"wp-block-prc-block-card\"><h2 class=\"prc-card__heading\">OTHER RESEARCH METHODS<\/h2><div class=\"prc-card__content\" style=\"padding-left:var(--wp--preset--spacing--30);padding-right:var(--wp--preset--spacing--30);padding-top:var(--wp--preset--spacing--30);padding-bottom:var(--wp--preset--spacing--30)\"><nav style=\"font-size:clamp(14px, 0.875rem + ((1vw - 3.2px) * 0.114), 15px);line-height:2\" class=\"items-justified-left is-vertical no-wrap wp-block-navigation is-content-justification-left is-nowrap is-layout-flex wp-container-core-navigation-is-layout-e4e95721 wp-block-navigation-is-layout-flex\" aria-label=\"Data Science Methods Page Menu\"><ul style=\"font-size:clamp(14px, 0.875rem + ((1vw - 3.2px) * 0.114), 15px);line-height:2\" class=\"wp-block-navigation__container items-justified-left is-vertical no-wrap wp-block-navigation\"><li class=\"wp-block-navigation-item wp-block-navigation-link\"><a class=\"wp-block-navigation-item__content\"  href=\"https:\/\/beta.pewresearch.org\/pewresearch-org\/u-s-surveys\/\"><span class=\"wp-block-navigation-item__label\">U.S Surveys<\/span><\/a><\/li><li class=\"wp-block-navigation-item wp-block-navigation-link\"><a class=\"wp-block-navigation-item__content\"  href=\"https:\/\/beta.pewresearch.org\/pewresearch-org\/international-surveys\/\"><span class=\"wp-block-navigation-item__label\">International Surveys<\/span><\/a><\/li><li class=\"wp-block-navigation-item wp-block-navigation-link\"><a class=\"wp-block-navigation-item__content\"  href=\"https:\/\/beta.pewresearch.org\/pewresearch-org\/data-sources-for-demographic-research\/\"><span class=\"wp-block-navigation-item__label\">Demographic Analysis<\/span><\/a><\/li><li class=\"wp-block-navigation-item wp-block-navigation-link\"><a class=\"wp-block-navigation-item__content\"  href=\"https:\/\/beta.pewresearch.org\/pewresearch-org\/data-labs\/\"><span class=\"wp-block-navigation-item__label\">Data Science<\/span><\/a><\/li><\/ul><\/nav><\/div><\/div>\n<\/div>\n\n<\/div>\n<\/div>\n\n\n<p class=\"wp-block-paragraph\"><\/p>\n","protected":false},"excerpt":{"rendered":"","protected":false},"author":329,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"sub_headline":"","sub_title":"","_prc_public_revisions":[],"_ppp_expiration_hours":0,"_ppp_enabled":false,"ai_generated_summary":"","datacite_doi":"","datacite_doi_citation":"","_prc_seo_qr_attachment_id":0,"footnotes":"","_prc_fork_parent":0,"_prc_fork_status":"","_prc_active_fork":0},"level_of_effort":[],"primary_audience":[],"information_type":[],"class_list":["post-5266","page","type-page","status-publish","hentry"],"_embeds":[],"table_of_contents":[],"datacite_doi":"","prc_seo_data":{"title":"Data Labs","description":"Pew Research Center\u2019s&nbsp;Data Labs&nbsp;team uses computational methods to complement and expand on the Center\u2019s existing research agenda. The team collects text, audiovisual and behavioral datasets; uses innovative computational techniques and&hellip;","og_title":"Data Labs","og_description":"","schema_type":"WebPage","noindex":false,"canonical_url":"","primary_terms":[],"custom_schema":[],"og_image":0,"indexnow_submitted_at":null,"gsc_index_status":null},"prepublish_checks":{"prc-image-alt-text":{"status":"complete","message":"No image blocks in content.","data":null}},"jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/beta.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/pages\/5266","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/beta.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/beta.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/beta.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/users\/329"}],"replies":[{"embeddable":true,"href":"https:\/\/beta.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/comments?post=5266"}],"version-history":[{"count":9,"href":"https:\/\/beta.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/pages\/5266\/revisions"}],"predecessor-version":[{"id":302686,"href":"https:\/\/beta.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/pages\/5266\/revisions\/302686"}],"wp:attachment":[{"href":"https:\/\/beta.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/media?parent=5266"}],"wp:term":[{"taxonomy":"level_of_effort","embeddable":true,"href":"https:\/\/beta.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/level_of_effort?post=5266"},{"taxonomy":"primary_audience","embeddable":true,"href":"https:\/\/beta.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/primary_audience?post=5266"},{"taxonomy":"information_type","embeddable":true,"href":"https:\/\/beta.pewresearch.org\/pewresearch-org\/wp-json\/wp\/v2\/information_type?post=5266"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}