Subscribe
Logo
Logo
  • Topics Icon Topics
    • AI Icon AI
    • Banking Icon Banking
    • Blockchain/DeFi Icon Blockchain/DeFi
    • Embedded Finance Icon Embedded Finance
    • Fraud/Identity Icon Fraud/Identity
    • Investing Icon Investing
    • Lending Icon Lending
    • Payments Icon Payments
    • Regulation Icon Regulation
    • Startups Icon Startups
  • Podcasts Icon Podcasts
  • Products Icon Products
    • Webinars Icon Webinars
    • White Papers Icon White Papers
  • TechWire Icon TechWire
  • Search
  • Subscribe
Reading
AI Strategies for Fintech Firms: Data Scientist Sumedha Rai Explains How to Power Up
ShareTweet
Home
AI
AI Strategies for Fintech Firms: Data Scientist Sumedha Rai Explains How to Power Up

AI Strategies for Fintech Firms: Data Scientist Sumedha Rai Explains How to Power Up

Katherine Heires·
AI
·May. 8, 2024·6 min read

If a fintech firm with text data at their disposal is not using it to employ natural language processing models – a branch of artificial intelligence that teaches machines to understand, analyze, and generate human language – they are missing out.

Natural language processing models or NLP can and should be employed regularly to assess a firm’s internal and external text material to understand the sentiments of the customers as well as those of employees.  It can also be used to identify important themes or business trends for the company to assess and integrate into their business strategy.

This is particularly so with the emergence of generative AI, making natural language processing capabilities more powerful than ever.

That is the clear message from data scientist Sumedha Rai in an interview with Fintech Nexus as well as in presentations at two recent conferences in New York City this spring – the AI in Finance Summit and MLConf 2024 gathering of AI and machine learning experts. 

However, these are just two of the results that firms can get out of ongoing text analysis via NLP models.

Rai adds that such NLP tools, used together with other machine learning and AI solutions, can also be used to rapidly summarize and translate documents, understand important tags in text data, personalize interactions with customers, and catch fraudsters by picking up anomalies in their communications.

Sumedha Rai, Senior Data Scientist
Sumedha Rai, Senior Data Scientist

Rai is a senior data scientist at a micro-investment firm in New York City, where she spends a great deal of time analyzing user sentiment and themes, reviewing data to assist in investment decisions, and creating fraud prevention models. She also researches with the Center for Data Science and other affiliated departments at New York University.

She notes that perhaps the most important benefit that comes with regular text analysis via NLP – aside from greater efficiency — is that “people (employees) will have far more time to think about the creative stuff,” related to product development and one’s business strategy, which is a distinct competitive advantage.

Text relevant for NLP analysis or summarization includes everything from customer feedback, postings, complaints, social media comments, emails and survey results to transaction data, company website and internal data, employee communications, claims calls, agent feedback, regulatory, compliance, and legal data.

The benefits of quarterly or ongoing assessment of such texts via NLP, Rai says, is that fintech firms can more easily customize services, build better chatbots,  detect fraud, summarize and translate global compliance and regulatory documents, and gain a better understanding of employee satisfaction levels.

One type of text analysis – using NLP for topic modeling – can be used to track the topics that are uppermost in the minds of one’s customers – including what they like or don’t like about a product — and is an activity that Rai believes may be underutilized by many fintech firms.

Using this technique, “Fintech firms should consider all of their problems and challenges and see how much signal they have received for these problems in the form of text. They should then leverage NLP analysis of text data to help solve many of these issues,” Rai says.

NLP models that can assist with this exercise include Latent Semantic Analysis (LSA), Latent Dirichlet Allocation (LDA), LDA2vec, and BERTopic and its different variations though, for fintech firms in particular, using FinBERT, a transformer model that was specifically pretrained on financial text, is also a great choice.

Among these model choices, however, Rai is particularly partial to the BERT models because they are bi-directional in design and capture context based on this bi-directionality.

“They (BERT models) also have contextual embeddings, which enable the models to understand a word by considering all other words around it and take into account the context for each occurrence of a given word,” Rai says.

She adds:  “Additionally, we now have access to powerful word embeddings from GenAI models, some of which are freely downloadable. However, BERT is a great choice for establishing a baseline when working with LLMs, particularly when working with financial text.”

Rai also highlighted the importance of making full use of Named Entity Recognition (NER), a subfield of NLP that pertains to tagging text so that named entities – individual words, phrases, or sequences of words – can be easily categorized.

“NER is a base technology that is very underused but, in fact, can be employed in multiple ways to better understand what entities customers are most interested in, allowing you to better tailor your communications with them,” Rai says.

She notes that NER analysis gives us a way to extract all critical information a lot faster from a large body of text and it can be used to flag risky interactions or anomalies that may indicate potential fraud. In this way, it plays a pivotal role in one’s ongoing sentiment analysis and text classification.

 One particularly helpful feature, says Rai, is NER’s ability to help one “eyeball compliance documents really fast,” so that one can quickly extract key information from lengthy documents and review it later in an efficient manner.

With the introduction of Generative AI models, Rai says, fintech firms now have access to a powerful tool for text analysis where minimal coding is involved, when using the out-of-box solution directly. However, the tradeoff may be in the level of accuracy that may be lost in using out-of-the-box Gen AI models versus fine tuning a model for specific tasks.

“Generative AI models are pre-trained and so, for a simple text analysis, a pre-trained model can often do the job,” Rai says, adding that with multiple generative AI models to choose from, she favors the ease of use of Chat GPT which continues to improve in accuracy and also has easily accessible APIs to integrate the GPT models into code.

She also finds Meta’s LLAMA models – LLAMA 3 in particular – to be powerful and helpful and it is free to use.

However, Rai warns that fintech firms do have to keep in mind that there are risks in using out-of-the-box generative AI models.

“No sensitive or customer data should be fed to these models. These are hosted systems and the data goes out of your local machines and to a server where the model resides,” Rai says noting that the data from interactions can be analyzed by the companies making the LLMs to improve performance and reliability of their systems.

“Even if you are using the enterprise version of these models, I would still make sure that your data has been stripped of all personally identifiable information (PII) before it is fed into a model or used to query the model,” Rai says.

Evaluating models for bias, discrimination, data security, data privacy, hallucinations, and respectful content creation is also key, Rai says, and starts with looking at what sort of data you are ingesting into the model, making sure all classes, genders, and geographies are represented and also by employing a diverse team of people to work on models as opposed to only one person.

Increasingly, Rai says, some fintech firms are hiring red teams from the outside of their company to conduct a thorough assessment and to ensure that a firm’s working models have been “de-biased.” are not generating biased results that can result in discriminatory practices.

One Gen AI time saver that Rai particularly liked involved asking Chat GPT to create a logo, tagline, and launch press release for a fantasy fintech firm.

“The results were impressive,” Rai said, noting that on an ongoing basis, Chat GPT continues to improve and to impress.

  • Katherine Heires
    Katherine Heires

    Katherine Heires is a business & technology journalist and founder of MediaKat llc. As a freelance journalist, she covers a range of topics including the growing impact on business of AI and machine learning developments and trends related to fintech startups, embedded banking, open banking, behavioral finance, cybersecurity, and fraud prevention technology. Her reporting on financial and fintech topics has appeared in Businessweek Online, Institutional Investor, Risk Intelligence, Risk Management Magazine and Venture Capital Journal.

    View all posts
Tags
AILLMsnatural language processing
Related

Lawsuit, subpoena, have fintechs revisiting AI

Rising digital spending among findings of Alkami, Cornerstone Advisors report

Editorial Cartoon for April 25, 2024

Incognia’s location-based solutions offer fraud antidote

Popular Posts

Today:

  • Stylizedhouse-with-EKGFintech x the One Big Beautiful Bill Jun. 26, 2025
  • Paraform Founders, Jeffrey Li and John KimFunded: Paraform raises $20M to put top recruiters, not AI, in the driver’s seat Jun. 27, 2025
  • Globe-money-symbolsOPINION: Why Brazil and India are leading the global digital shift through payment innovation Jun. 24, 2025
  • WP UmbrellaTo Bank or Not to Bank: The ILC Question Jun. 5, 2025
  • GreenliteAI-Alex-WillGreenlite AI is on a mission to revolutionize banking compliance Jun. 10, 2025
  • Email-AI-pieceAvatar CEOs Have Entered the Meeting Jun. 18, 2025
  • Revised-AI-InvoiceAI Faces Skepticism. Startups Say: OK, Pay When it Works Jun. 25, 2025
  • PayabliFunded: Payments infrastructure co Payabli lands $28M Series B to AI-ify Jun. 20, 2025
  • TechNexus The AI IssueThe AI Paradox Jun. 18, 2025
  • Jon StonaTips from Airwallex x McLaren on Making the Best of a Fintech Sponsorship  Jun. 18, 2025

This month:

  • WP UmbrellaTo Bank or Not to Bank: The ILC Question Jun. 5, 2025
  • DanMurphy-FN-headshotCFPB’s Next Open Banking Battle Begins Jun. 3, 2025
  • GreenliteAI-Alex-WillGreenlite AI is on a mission to revolutionize banking compliance Jun. 10, 2025
  • Current stablecoin adoptionWhy Banks (and Fintechs) Need to Embrace Stablecoins Today Jun. 12, 2025
  • ai-work-nexusWalkMe Vets Declare War on SaaS Bloat with $10M Seed for Autonomous Agents Jun. 10, 2025
  • Ben Hemani, Founding Partner at Bison VenturesThe Risk and Reward of Betting Big on AI’s Next Frontier Jun. 4, 2025
  • Jon StonaTips from Airwallex x McLaren on Making the Best of a Fintech Sponsorship  Jun. 18, 2025
  • Ironclad State of AI ReportThe Economics of AI Trust Jun. 11, 2025
  • Email-AI-pieceAvatar CEOs Have Entered the Meeting Jun. 18, 2025
  • TechNexus The AI IssueMeeker’s AI Bombshell + The VC Betting on AI Reshaping The Physical World  Jun. 4, 2025

  • About
  • Contact
  • Disclaimer
  • Privacy Policy
  • Terms
Subscribe
Copyright © 2025 Fintech Nexus
  • Topics
    • AI
    • Banking
    • Blockchain/DeFi
    • Embedded Finance
    • Fraud/Identity
    • Investing
    • Lending
    • Payments
    • Regulation
    • Startups
  • Podcasts
  • Products
    • Webinars
    • White Papers
  • TechWire
  • Contact Us
Start typing to see results or hit ESC to close
lis digital banking USA Lending Club UK
See all results