SlideShare una empresa de Scribd logo
1 de 39
LinkedIn Endorsements: Reputation, Virality, and
Social Tagging
O‟Reilly Strata - February 28, 2013
Sam Shah @sam_shah
Pete Skomoroch @peteskomoroch


©2012 LinkedIn Corporation. All Rights Reserved.
Sam Shah
                          Principal Engineer and Engineering Manager
                          @sam_shah
                          www.linkedin.com/in/shahsam




                          Peter Skomoroch
                          Principal Data Scientist
                          @peteskomoroch
                          www.linkedin.com/in/peterskomoroch




©2012 LinkedIn Corporation. All Rights Reserved.
LinkedIn: The Professional Profile of Record




   200+M            Members                      200M                  Member
                                                                       Profiles


                    ©2012 LinkedIn Corporation. All Rights Reserved.              3
LinkedIn‟s Latest Data Product: Skill Endorsements




                                                     4
Viral Growth: 800M Endorsements in 4 Months




                                              5
Data Amplifies Desire

1. Desire + Social Proof
2. Viral Loops + Network Effects
3. Data Foundation + Recommendation Algorithms




                                                 6
1) Desire & Social Proof




                           7
Email   News Feed   Notification
2) Viral Loops & Network Effects
       A
                           B                          B “accepts”
    endorses
                        notified                     endorsement
       B




                                                     Endorsement
                                                   recommendations




                                                     B           B
                                                  endorses    endorses
                                                     C           D
3) Data Foundation: Skills & Suggested Skills




                                                9
Data Foundation: LinkedIn Skills




                                   10
Social Tagging Accelerates Adoption



     Skill marketing
Skill recommendations
       Virality only


         Suggested
      endorsements




                        ©2012 LinkedIn Cororation. All Rights Reserved.
Outline


              Skill discovery


               Skill tagging


           Skill recommendations


          Suggested endorsements




                                   12
Unsupervised Topic Discovery from Profiles




    Extract




                                             13
Profile
Building the Skills Dictionary                                 (specialties)



   What is the skills dictionary?

     – A growing taxonomy of skills
                                                              Tokenization


                                                               Clustering
     – Generated by mining profiles and maintained by the
       Skills team at LinkedIn
                                                             Crowdsourcing
     – Created using clustering and crowdsourcing.

     – Multiple phrases, acronyms, and misspellings map to
       a single standardized skill.

         250+ different phrases map to “Microsoft Office”
                                                                Taxonomy



                                                                               14
Topic Clustering & Phrase Sense Disambiguation




                                                 15
Skills Dictionary: Microsoft Office


   –   ms office
   –   ms office suite
   –   computer skills including ms office
   –   office 97
   –   microsoft office user
                                             Microsoft Office
   –   mac office
   –   microsoft office 2003 & 2007          (Skill ID = 366)
   –   microsoft office suits
   –   microsoft ofice
   –   microsoft ofiice
   –   ms office certified
   –   office 98
   –   …




                                                                16
Deduplication Signals from Mechanical Turk




                                             17
Sample Task for Mechanical Turk Workers




                                          18
Skill Phrase Deduplication




                             19
Outline


              Skill discovery


               Skill tagging


           Skill recommendations


          Suggested endorsements




                                   20
Skills Classification
   Use skill dictionary metadata to tag, standardize and infer skills
   Run classifiers for each skill on member profiles




                                             Public Speaking

                                             Ruby on Rails

                                             Entrepreneurship

                                             Microsoft Office

                                             AP Style




                                                                         21
Document
 Tagging Skill Phrases                                                        (ex: Profile)


 Tagging: Extract potential skill phrases from text
     Lead designer and engineer for the implementation of a user-
     centric, fully-configurable UI for data aggregation and reporting.
     Developed over 20 SaaS custom applications using Python,
     Javascript and RoR.                                                     Tokenization

                                                                                      Phrases
    JavaScript       RoR     SaaS                           Python
                                                                                      (up to 6 words)

 Standardize unambiguous phrase variants                                   Skills Tagger
      ror
      rubyonrails                                                                       Skills
      ruby on rails development           Ruby on Rails                                 (unordered)
      ruby rails
      ruby on rail                                                         Skills Classifier


                                                                                  Skills
                                                                          (ranked by relevance)


                                                                                              22
Outline


              Skill discovery


               Skill tagging


          Skill recommendations


          Suggested endorsements




                                   23
Skills Classification on Member Profiles

    The skills classifier computes the likelihood of a member to have a skill based on
    the member’s profile, other profiles which share common attributes and their
    connections.




                 Tagging               Standardization                Inference
Profile
                Tokenize free             Transform tags            Rank skills by
  text
            text into phrase tags       into potential skills        likelihood


                        Profile attributes & network signals




                                                                                         24
Profile
Skill Inference

   How suggested/inferred skills work:
                                                                         Extract
     – Profiles with skills help build a massive dataset of             attributes
       (attribute: skills).
                                                              Feature
                                                                              - Company ID
         Example with a title:                                Vectors
                                                                              - Title ID
                                                                              - Groups ID
         Software Engineer         Java      100 000
                                                                              - Industry ID
         Software Engineer         C++        88 000                          -…
         …
                                                                 Skills Classifier
                 Title             Skill      Occurrences



                                                                       Skills
                                                               (ranked by likelihood)


                                                                                     25
Profile
Skill Inference

   How suggested/inferred skills work:
                                                                    Extract
     – The skill likelihood is a conditional model                 attributes

                                                         Feature
                                                                         - Company ID
     – Probabilities are combined using a Naïve Bayes    Vectors
                                                                         - Title ID
       Classifier                                                        - Groups ID
                                                                         - Industry ID
                                                                         -…

                                                            Skills Classifier
   If you are an engineer at Apple, you probably know
    about iPhone Development.


                                                                  Skills
                                                          (ranked by likelihood)


                                                                                26
Skill Suggestions for Your LinkedIn Profile



                                  4% Conversion




                                  49% Conversion


                                              29
Outline


               Skill discovery


               Skill tagging


           Skill recommendations


          Suggested endorsements




                                   30
Social Tagging via Skill Endorsements




                                        31
Suggesting Endorsements
                                                          Candidate
 People-skill combinations in a member‟s network         generation
 Binary classification
                                                    Feature
                                                                   - Company
 Features                                          Vectors
                                                                   - Title
   –   Skill inference score                                       - Groups
   –   Company overlap                                             - Industry
   –   School overlap                                              -…
   –   Group overlap
   –   Industry and functional area similarity                Classifier
   –   Title similarity
   –   Site interactions
   –   Co-interactions
                                                    Suggested Endorsements
                                                      (ranked by likelihood)




                                                                           32
Social Tagging Accelerates Adoption



     Skill marketing

Skill recommendations



  Skill endorsements




                        ©2012 LinkedIn Cororation. All Rights Reserved.
Can We Find Influencers In Venture Capital?




                                              34
Which Skills Are Important for a Data Scientist?




                                                   35
What Technologies are Professionals Adopting?




                                                36
Data Amplifies Desire

1. Desire + Social Proof
2. Viral Loops + Network Effects
3. Data Catalyst + Recommendation Algorithms




                                               37
Infrastructure




•   Apache Hadoop: Parallel processing architecture
•   Apache Kafka: Ingress pipes
•   Azkaban: Hadoop scheduler
•   Voldemort: Egress database
•   Apache Pig: High-level MR language
•   DataFu: Convenience routines

http://data.linkedin.com


R. Sumbaly, J. Kreps, and S. Shah. “The „Big Data‟ ecosystem at LinkedIn”. In SIGMOD 2013 (to appear).

                                          ©2012 LinkedIn Corporation. All Rights Reserved.               38
Learning More
data.linkedin.com

Más contenido relacionado

Destacado

Insanity: The Chase
Insanity: The ChaseInsanity: The Chase
Insanity: The ChaseDavid Sr.
 
Hiring Hacks For Founders
Hiring Hacks For FoundersHiring Hacks For Founders
Hiring Hacks For Foundersdanarkind
 
App secforum2014 andrivet-cplusplus11-metaprogramming_applied_to_software_obf...
App secforum2014 andrivet-cplusplus11-metaprogramming_applied_to_software_obf...App secforum2014 andrivet-cplusplus11-metaprogramming_applied_to_software_obf...
App secforum2014 andrivet-cplusplus11-metaprogramming_applied_to_software_obf...Cyber Security Alliance
 
ENTREPRENEURIAL OPPORTUNITY E X P L O I T A T I O N A N D T H E F A M I L Y
ENTREPRENEURIAL OPPORTUNITY E X P L O I T A T I O N  A N D  T H E  F A M I L YENTREPRENEURIAL OPPORTUNITY E X P L O I T A T I O N  A N D  T H E  F A M I L Y
ENTREPRENEURIAL OPPORTUNITY E X P L O I T A T I O N A N D T H E F A M I L YFabulyzer
 
iPods & Podcasting for Whole Language Instruction
iPods & Podcasting for Whole Language InstructioniPods & Podcasting for Whole Language Instruction
iPods & Podcasting for Whole Language InstructionUnion City High School
 
Herramientas para crear videotutoriales
Herramientas para crear videotutorialesHerramientas para crear videotutoriales
Herramientas para crear videotutorialesAlexander Guzman
 
A Primer - Comparing Japanese, Australian, Dutch and UK RMBS and Mortgage Mar...
A Primer - Comparing Japanese, Australian, Dutch and UK RMBS and Mortgage Mar...A Primer - Comparing Japanese, Australian, Dutch and UK RMBS and Mortgage Mar...
A Primer - Comparing Japanese, Australian, Dutch and UK RMBS and Mortgage Mar...Arthur Karabatsos
 
3 d pie chart circular with hole in center 12 stages powerpoint presentation ...
3 d pie chart circular with hole in center 12 stages powerpoint presentation ...3 d pie chart circular with hole in center 12 stages powerpoint presentation ...
3 d pie chart circular with hole in center 12 stages powerpoint presentation ...SlideTeam.net
 
Sebi compliances and penalty 13.11.05 jaipur
Sebi compliances and penalty               13.11.05 jaipurSebi compliances and penalty               13.11.05 jaipur
Sebi compliances and penalty 13.11.05 jaipurPavan Kumar Vijay
 

Destacado (11)

Insanity: The Chase
Insanity: The ChaseInsanity: The Chase
Insanity: The Chase
 
Your big idea (1)
Your big idea (1)Your big idea (1)
Your big idea (1)
 
Hiring Hacks For Founders
Hiring Hacks For FoundersHiring Hacks For Founders
Hiring Hacks For Founders
 
VDC-Newsletter 2016-08
VDC-Newsletter 2016-08VDC-Newsletter 2016-08
VDC-Newsletter 2016-08
 
App secforum2014 andrivet-cplusplus11-metaprogramming_applied_to_software_obf...
App secforum2014 andrivet-cplusplus11-metaprogramming_applied_to_software_obf...App secforum2014 andrivet-cplusplus11-metaprogramming_applied_to_software_obf...
App secforum2014 andrivet-cplusplus11-metaprogramming_applied_to_software_obf...
 
ENTREPRENEURIAL OPPORTUNITY E X P L O I T A T I O N A N D T H E F A M I L Y
ENTREPRENEURIAL OPPORTUNITY E X P L O I T A T I O N  A N D  T H E  F A M I L YENTREPRENEURIAL OPPORTUNITY E X P L O I T A T I O N  A N D  T H E  F A M I L Y
ENTREPRENEURIAL OPPORTUNITY E X P L O I T A T I O N A N D T H E F A M I L Y
 
iPods & Podcasting for Whole Language Instruction
iPods & Podcasting for Whole Language InstructioniPods & Podcasting for Whole Language Instruction
iPods & Podcasting for Whole Language Instruction
 
Herramientas para crear videotutoriales
Herramientas para crear videotutorialesHerramientas para crear videotutoriales
Herramientas para crear videotutoriales
 
A Primer - Comparing Japanese, Australian, Dutch and UK RMBS and Mortgage Mar...
A Primer - Comparing Japanese, Australian, Dutch and UK RMBS and Mortgage Mar...A Primer - Comparing Japanese, Australian, Dutch and UK RMBS and Mortgage Mar...
A Primer - Comparing Japanese, Australian, Dutch and UK RMBS and Mortgage Mar...
 
3 d pie chart circular with hole in center 12 stages powerpoint presentation ...
3 d pie chart circular with hole in center 12 stages powerpoint presentation ...3 d pie chart circular with hole in center 12 stages powerpoint presentation ...
3 d pie chart circular with hole in center 12 stages powerpoint presentation ...
 
Sebi compliances and penalty 13.11.05 jaipur
Sebi compliances and penalty               13.11.05 jaipurSebi compliances and penalty               13.11.05 jaipur
Sebi compliances and penalty 13.11.05 jaipur
 

Similar a Strata 2013 - LinkedIn Endorsements: Reputation, Virality, and Social Tagging

Keynote Peter Skomoroch - skills, reputation, and search
Keynote   Peter Skomoroch - skills, reputation, and searchKeynote   Peter Skomoroch - skills, reputation, and search
Keynote Peter Skomoroch - skills, reputation, and searchlucenerevolution
 
KEYNOTE: Skills, Reputation and Search
KEYNOTE: Skills, Reputation and SearchKEYNOTE: Skills, Reputation and Search
KEYNOTE: Skills, Reputation and Searchlucenerevolution
 
Skills, Reputation, and Search
Skills, Reputation, and SearchSkills, Reputation, and Search
Skills, Reputation, and SearchPeter Skomoroch
 
LinkedIn Skills: RecSys Conference 2014
LinkedIn Skills: RecSys Conference 2014LinkedIn Skills: RecSys Conference 2014
LinkedIn Skills: RecSys Conference 2014Mathieu Bastian
 
Big Data and Data Standardization at LinkedIn
Big Data and Data Standardization at LinkedInBig Data and Data Standardization at LinkedIn
Big Data and Data Standardization at LinkedInAlexis Baird
 
Fall 2012 Info Session Slides
Fall 2012 Info Session SlidesFall 2012 Info Session Slides
Fall 2012 Info Session SlidesJamie Thai
 
Linked In Corporate Presentation 050312
Linked In Corporate Presentation 050312Linked In Corporate Presentation 050312
Linked In Corporate Presentation 050312Andy Solty
 
Aras PLM Software Visualization
Aras PLM Software VisualizationAras PLM Software Visualization
Aras PLM Software VisualizationAras
 
Mind Click Services
Mind Click ServicesMind Click Services
Mind Click ServicesMind Click
 
Resume: Web Development and Design
Resume: Web Development and DesignResume: Web Development and Design
Resume: Web Development and Designcthuliz
 
Northridge Webinar Share Point 2010 Public Web
Northridge Webinar Share Point 2010 Public WebNorthridge Webinar Share Point 2010 Public Web
Northridge Webinar Share Point 2010 Public Webjfarq
 
Mesh Labs Introduction June 2012
Mesh Labs Introduction June 2012Mesh Labs Introduction June 2012
Mesh Labs Introduction June 2012Umesh Ramalingachar
 
Porque se certificar e mudanças nas provas de SQL Server 2012.
Porque se certificar e mudanças nas provas de SQL Server 2012.Porque se certificar e mudanças nas provas de SQL Server 2012.
Porque se certificar e mudanças nas provas de SQL Server 2012.Marcos Freccia
 
My Intro and Resume - Cameron T Brown
My Intro and Resume - Cameron T BrownMy Intro and Resume - Cameron T Brown
My Intro and Resume - Cameron T Brownself employed
 
LinkedIn API Possibilities
LinkedIn API PossibilitiesLinkedIn API Possibilities
LinkedIn API PossibilitiesKim Beinborn
 
LinkedIn API's
LinkedIn API'sLinkedIn API's
LinkedIn API'sTim Deegan
 
LinkedIn API Possibilities
LinkedIn API PossibilitiesLinkedIn API Possibilities
LinkedIn API PossibilitiesRachel Romba
 
LinkedIn API Possibilities
LinkedIn API PossibilitiesLinkedIn API Possibilities
LinkedIn API PossibilitiesLinkedIn
 

Similar a Strata 2013 - LinkedIn Endorsements: Reputation, Virality, and Social Tagging (20)

Keynote Peter Skomoroch - skills, reputation, and search
Keynote   Peter Skomoroch - skills, reputation, and searchKeynote   Peter Skomoroch - skills, reputation, and search
Keynote Peter Skomoroch - skills, reputation, and search
 
KEYNOTE: Skills, Reputation and Search
KEYNOTE: Skills, Reputation and SearchKEYNOTE: Skills, Reputation and Search
KEYNOTE: Skills, Reputation and Search
 
Skills, Reputation, and Search
Skills, Reputation, and SearchSkills, Reputation, and Search
Skills, Reputation, and Search
 
LinkedIn Skills: RecSys Conference 2014
LinkedIn Skills: RecSys Conference 2014LinkedIn Skills: RecSys Conference 2014
LinkedIn Skills: RecSys Conference 2014
 
Big Data and Data Standardization at LinkedIn
Big Data and Data Standardization at LinkedInBig Data and Data Standardization at LinkedIn
Big Data and Data Standardization at LinkedIn
 
Fall 2012 Info Session Slides
Fall 2012 Info Session SlidesFall 2012 Info Session Slides
Fall 2012 Info Session Slides
 
Linked In Corporate Presentation 050312
Linked In Corporate Presentation 050312Linked In Corporate Presentation 050312
Linked In Corporate Presentation 050312
 
Aras PLM Software Visualization
Aras PLM Software VisualizationAras PLM Software Visualization
Aras PLM Software Visualization
 
SFIAplus V3
SFIAplus V3SFIAplus V3
SFIAplus V3
 
Mind Click Services
Mind Click ServicesMind Click Services
Mind Click Services
 
Resume: Web Development and Design
Resume: Web Development and DesignResume: Web Development and Design
Resume: Web Development and Design
 
Northridge Webinar Share Point 2010 Public Web
Northridge Webinar Share Point 2010 Public WebNorthridge Webinar Share Point 2010 Public Web
Northridge Webinar Share Point 2010 Public Web
 
Mesh Labs Introduction June 2012
Mesh Labs Introduction June 2012Mesh Labs Introduction June 2012
Mesh Labs Introduction June 2012
 
Fomax Corporate Profile
Fomax   Corporate Profile Fomax   Corporate Profile
Fomax Corporate Profile
 
Porque se certificar e mudanças nas provas de SQL Server 2012.
Porque se certificar e mudanças nas provas de SQL Server 2012.Porque se certificar e mudanças nas provas de SQL Server 2012.
Porque se certificar e mudanças nas provas de SQL Server 2012.
 
My Intro and Resume - Cameron T Brown
My Intro and Resume - Cameron T BrownMy Intro and Resume - Cameron T Brown
My Intro and Resume - Cameron T Brown
 
LinkedIn API Possibilities
LinkedIn API PossibilitiesLinkedIn API Possibilities
LinkedIn API Possibilities
 
LinkedIn API's
LinkedIn API'sLinkedIn API's
LinkedIn API's
 
LinkedIn API Possibilities
LinkedIn API PossibilitiesLinkedIn API Possibilities
LinkedIn API Possibilities
 
LinkedIn API Possibilities
LinkedIn API PossibilitiesLinkedIn API Possibilities
LinkedIn API Possibilities
 

Último

Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Kaya Weers
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024TopCSSGallery
 

Último (20)

Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App Framework
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024
 

Strata 2013 - LinkedIn Endorsements: Reputation, Virality, and Social Tagging

  • 1. LinkedIn Endorsements: Reputation, Virality, and Social Tagging O‟Reilly Strata - February 28, 2013 Sam Shah @sam_shah Pete Skomoroch @peteskomoroch ©2012 LinkedIn Corporation. All Rights Reserved.
  • 2. Sam Shah Principal Engineer and Engineering Manager @sam_shah www.linkedin.com/in/shahsam Peter Skomoroch Principal Data Scientist @peteskomoroch www.linkedin.com/in/peterskomoroch ©2012 LinkedIn Corporation. All Rights Reserved.
  • 3. LinkedIn: The Professional Profile of Record 200+M Members 200M Member Profiles ©2012 LinkedIn Corporation. All Rights Reserved. 3
  • 4. LinkedIn‟s Latest Data Product: Skill Endorsements 4
  • 5. Viral Growth: 800M Endorsements in 4 Months 5
  • 6. Data Amplifies Desire 1. Desire + Social Proof 2. Viral Loops + Network Effects 3. Data Foundation + Recommendation Algorithms 6
  • 7. 1) Desire & Social Proof 7
  • 8. Email News Feed Notification 2) Viral Loops & Network Effects A B B “accepts” endorses notified endorsement B Endorsement recommendations B B endorses endorses C D
  • 9. 3) Data Foundation: Skills & Suggested Skills 9
  • 11. Social Tagging Accelerates Adoption Skill marketing Skill recommendations Virality only Suggested endorsements ©2012 LinkedIn Cororation. All Rights Reserved.
  • 12. Outline Skill discovery Skill tagging Skill recommendations Suggested endorsements 12
  • 13. Unsupervised Topic Discovery from Profiles Extract 13
  • 14. Profile Building the Skills Dictionary (specialties)  What is the skills dictionary? – A growing taxonomy of skills Tokenization Clustering – Generated by mining profiles and maintained by the Skills team at LinkedIn Crowdsourcing – Created using clustering and crowdsourcing. – Multiple phrases, acronyms, and misspellings map to a single standardized skill. 250+ different phrases map to “Microsoft Office” Taxonomy 14
  • 15. Topic Clustering & Phrase Sense Disambiguation 15
  • 16. Skills Dictionary: Microsoft Office – ms office – ms office suite – computer skills including ms office – office 97 – microsoft office user Microsoft Office – mac office – microsoft office 2003 & 2007 (Skill ID = 366) – microsoft office suits – microsoft ofice – microsoft ofiice – ms office certified – office 98 – … 16
  • 17. Deduplication Signals from Mechanical Turk 17
  • 18. Sample Task for Mechanical Turk Workers 18
  • 20. Outline Skill discovery Skill tagging Skill recommendations Suggested endorsements 20
  • 21. Skills Classification  Use skill dictionary metadata to tag, standardize and infer skills  Run classifiers for each skill on member profiles Public Speaking Ruby on Rails Entrepreneurship Microsoft Office AP Style 21
  • 22. Document Tagging Skill Phrases (ex: Profile)  Tagging: Extract potential skill phrases from text Lead designer and engineer for the implementation of a user- centric, fully-configurable UI for data aggregation and reporting. Developed over 20 SaaS custom applications using Python, Javascript and RoR. Tokenization Phrases JavaScript RoR SaaS Python (up to 6 words)  Standardize unambiguous phrase variants Skills Tagger ror rubyonrails Skills ruby on rails development Ruby on Rails (unordered) ruby rails ruby on rail Skills Classifier Skills (ranked by relevance) 22
  • 23. Outline Skill discovery Skill tagging Skill recommendations Suggested endorsements 23
  • 24. Skills Classification on Member Profiles The skills classifier computes the likelihood of a member to have a skill based on the member’s profile, other profiles which share common attributes and their connections. Tagging Standardization Inference Profile Tokenize free Transform tags Rank skills by text text into phrase tags into potential skills likelihood Profile attributes & network signals 24
  • 25. Profile Skill Inference  How suggested/inferred skills work: Extract – Profiles with skills help build a massive dataset of attributes (attribute: skills). Feature - Company ID Example with a title: Vectors - Title ID - Groups ID Software Engineer Java 100 000 - Industry ID Software Engineer C++ 88 000 -… … Skills Classifier Title Skill Occurrences Skills (ranked by likelihood) 25
  • 26. Profile Skill Inference  How suggested/inferred skills work: Extract – The skill likelihood is a conditional model attributes Feature - Company ID – Probabilities are combined using a Naïve Bayes Vectors - Title ID Classifier - Groups ID - Industry ID -… Skills Classifier  If you are an engineer at Apple, you probably know about iPhone Development. Skills (ranked by likelihood) 26
  • 27.
  • 28.
  • 29. Skill Suggestions for Your LinkedIn Profile 4% Conversion 49% Conversion 29
  • 30. Outline Skill discovery Skill tagging Skill recommendations Suggested endorsements 30
  • 31. Social Tagging via Skill Endorsements 31
  • 32. Suggesting Endorsements Candidate  People-skill combinations in a member‟s network generation  Binary classification Feature - Company  Features Vectors - Title – Skill inference score - Groups – Company overlap - Industry – School overlap -… – Group overlap – Industry and functional area similarity Classifier – Title similarity – Site interactions – Co-interactions Suggested Endorsements (ranked by likelihood) 32
  • 33. Social Tagging Accelerates Adoption Skill marketing Skill recommendations Skill endorsements ©2012 LinkedIn Cororation. All Rights Reserved.
  • 34. Can We Find Influencers In Venture Capital? 34
  • 35. Which Skills Are Important for a Data Scientist? 35
  • 36. What Technologies are Professionals Adopting? 36
  • 37. Data Amplifies Desire 1. Desire + Social Proof 2. Viral Loops + Network Effects 3. Data Catalyst + Recommendation Algorithms 37
  • 38. Infrastructure • Apache Hadoop: Parallel processing architecture • Apache Kafka: Ingress pipes • Azkaban: Hadoop scheduler • Voldemort: Egress database • Apache Pig: High-level MR language • DataFu: Convenience routines http://data.linkedin.com R. Sumbaly, J. Kreps, and S. Shah. “The „Big Data‟ ecosystem at LinkedIn”. In SIGMOD 2013 (to appear). ©2012 LinkedIn Corporation. All Rights Reserved. 38