利用生成式 AI 减少信用评分的偏差

Wei You Pan, Ashwin Gangadhar, and Jack Yallop
February 20, 2024 | Updated: March 5, 2024

信用评分在确定谁获得信贷以及以何种条件获得信贷方面发挥着关键作用。然而，尽管这一点很重要，但传统的信用评分系统长期以来一直受到一系列关键问题的困扰 — 从偏见和歧视到有限的数据考虑和可扩展性挑战。例如，一项针对美国贷款的研究表明，与来自特权群体的借款人相比，少数族裔借款人被收取的利率更高 (+8%)，被拒绝贷款的频率也更高 (+14%)。

僵化的信贷系统反应迟缓，无法快速适应不断变化的经济形势和消费者行为，这会导致一些人得不到充分服务并被忽视。为了解决这一问题，银行和其他贷款机构正在寻求采用人工智能来开发日益复杂的信用风险评分模型。'

在本文中，我们将了解信用评分的基础知识、当前系统面临的挑战，并深入探讨如何利用人工智能 (AI)，特别是生成式 AI (genAI) 来减少偏差并提高准确性。从替代数据源的整合到机器学习 (ML) 模型的开发，我们将揭示 AI 在重塑信用评分未来方面的变革潜力。

请查看我们的 AI 资源页面，了解有关使用 MongoDB 构建 AI 驱动的应用的更多信息。

什么是信用评分？

信用评分是金融领域不可或缺的一个方面，是衡量个人信用状况的一个数字标准。贷方利用这一重要指标来评估与向个人或企业提供信贷或贷款相关的潜在风险。

传统上，银行依赖于通常使用线性回归或逻辑回归构建的预定义规则和统计模型。这些模型以历史信用数据为基础，重点关注支付历史、信用利用率和信用历史长度等因素。

但是，评估新的信用申请人是一项挑战，因此需要更准确的分析评估。为了满足传统上受到歧视的、得不到充分服务或服务不足的群体的需求，金融科技公司和数字银行正越来越多地将传统信用记录以外的信息与其他数据结合起来，以便更全面地了解个人的金融行为。

传统信用评分面临的挑战

信用评分是现代生活中不可或缺的一部分，因为它在各种金融交易（包括获得贷款、租房、购买保险，甚至是就业筛选）中起着至关重要的决定性作用。追求信用可能是一段迷宫般的旅程，传统信用评分模型存在一些挑战或限制，这些挑战或限制通常会阻碍信用申请的批准。

有限的信用记录：许多人，尤其是那些刚接触信用游戏的人，都会遇到一个重大障碍 — 信用记录有限或根本不存在。传统的信用评分模型严重依赖于过去的信用行为，这使得没有良好信用记录的个人很难证明自己的信用度。大约有 4,500 万美国人缺乏信用评分，仅仅是因为他们没有这些数据点。
收入不稳定：非经常性收入（这在兼职工作或自由职业中很常见）对传统的信用评分模型提出了挑战，可能会给个人贴上更高风险的标签，并导致其申请被拒绝或信用额度受到限制。关于 2023 年美国有多少人从事个体经营，数据来源各不相同。一个数据来源显示，有超过 2,700 万美国人提交了附表 C 纳税文件，其中涵盖了来自一项业务的净收入或损失 — 这突显了那些个体经营者对于不同信用评分方法的需求。
现有信用利用率高：对现有信用的严重依赖往往被视为潜在财务压力的信号，从而影响信用决策。信用申请可能会面临拒绝或以不太有利的条件获得批准，这反映出对申请人明智地管理额外信用能力的担忧。
拒绝原因不明确：即使了解申请被拒的原因也无法让申请人从根本上解决问题 — 在英国，2022 年 4 月至 2023 年 4 月期间的一项研究显示，申请被拒的主要原因包括“信用记录不良”(38%)、“无力偿还贷款”(38%)、“有太多其他信贷”(19%)，还有 10% 的人表示没有被告知原因。即使给出了原因，往往也太模糊，让申请人一筹莫展，难以解决根本问题并提高他们未来申请的信用度。缺乏透明度不仅会给客户带来麻烦，还可能导致银行受到处罚。例如，2023 年，柏林一家银行因在拒绝信用卡申请时缺乏透明度而被罚款 30 万欧元。
缺乏灵活性：消费者行为的转变，尤其是年轻一代对数字交易的青睐，对传统模式提出了挑战。零工经济的兴起、非传统就业、学生贷款债务和高昂的生活成本等因素使评估收入稳定性和财务健康状况变得更加复杂。在像新冠疫情这样前所未有的破坏事件中，传统的信用风险预测是有限的，在评分模型中没有考虑到这一点。

认识到这些挑战，就需要有替代的信用评分模型，以适应不断变化的金融行为，处理非传统的数据来源，并在当今动态变化的金融环境中提供更具包容性和更准确的信用度评估。

使用替代数据进行信用评分

替代信用评分是指使用非传统数据源（又名替代数据）和方法来评估个人信用度。传统的信用评分在很大程度上依赖于主要征信机构的信用记录，而替代信用评分则纳入了更广泛的因素，以更全面地反映个人的金融行为。以下是一些常用的替代数据源：

公用事业付款：除信用记录外，持续支付水电等公用事业费用也是衡量财务责任的有力指标，显示了履行财务义务的决心，提供了传统指标之外的重要见解。
租赁记录：对于没有抵押贷款的人来说，租金支付历史记录是一个重要的替代数据来源。持续、及时支付租金的表现全面反映了对财务纪律的遵守和可靠性。
手机使用模式：手机的普及解锁了大量的替代数据。通过分析通话和短信模式，可以深入了解个人的网络、稳定性和社交关系，为信用评估提供有价值的信息。
网上购物行为：对网购的频率、类型和金额进行研究，为了解消费行为提供了宝贵的信息，有助于对财务习惯有更细致的了解。
教育和就业背景：替代信用评分考虑了个人的教育和就业经历。教育成就和稳定就业等积极指标在评估金融稳定性方面发挥着至关重要的作用。

这些替代数据源代表着向更具包容性、更细致入微、更全面的信用评估方法的转变。随着金融技术的不断进步，利用这些替代数据集可确保对信用度进行更全面的评估，标志着信用评分模型的发展迈出了变革性的一步。

使用人工智能进行替代信用评分

除了使用替代数据外，作为一种替代方法，人工智能已成为应对传统信用评分挑战的变革力量，原因有很多：

减少偏见的能力：与传统的统计模型一样，人工智能模型（包括大型语言模型）在有偏见的历史数据上进行训练后，也会继承这些数据中存在的偏见，从而导致歧视性的结果。大型语言模型可能更关注某些特征而忽略其他一些特征，或者不能从更广泛的背景去理解个人财务状况，从而导致决策存在偏见。但是，有多种技术可以减少 AI 模型的偏见：

缓解策略：从使用多样化和有代表性的培训数据开始，避免强化现有的偏见。不充分或无效的缓解策略可能会导致 AI 信用评分模型中持续出现有偏见的结果。细心关注收集的数据和模型开发对于减少这种偏见至关重要。将替代数据纳入信用评分在减少偏见方面发挥着关键作用。
在训练过程中，严格的偏见检测工具、公平性约束和正则化技术可增强模型的问责性：平衡特征表示并采用后处理技术和专门算法有助于减少偏见。对模型进行全面评估、持续监控和迭代改进，同时结合对道德准则和管理规范的遵守，可以从多个层面减少人工智能模型中的偏见。这对于解决与历史信用数据中可能存在的人口或社会经济偏见有关的问题尤为重要。
定期进行偏见审查：定期进行审查以识别并减少大型语言模型中的偏见。这可能涉及分析模型输出结果，以发现不同人口群体之间的差异，并相应调整算法。
透明度和可解释性：提高大型语言模型的透明度和可解释性，以了解决策是如何做出的。这可以帮助识别和解决有偏见的决策过程。Trade Ledger 是一种贷款软件即服务 (SaaS) 工具，它使用数据驱动的方法，通过将具有不同模式的多个来源的数据整合到单个数据源中，以更高的透明度和可追溯性做出明智的决策。

能够分析海量且多样化的数据集：与依赖预定义规则和历史信用数据的传统模型不同，AI 模型可以处理大量信息，包括非传统数据源，以对个人信用度进行更全面的评估，确保考虑到更广泛的金融行为。
AI 带来了无与伦比的适应性：随着经济条件的变化和消费者行为的演变，AI 驱动的模型可以快速调整并从新数据中学习。持续学习可确保信用评分在瞬息万变的金融环境中保持相关性和有效性。

对于在信用评分中使用 AI，银行最常见的反对意见与信用决策的透明度和可解释性相关。一些 AI 模型，尤其是深度学习算法，其本身的复杂性可能会导致难以为信用决策提供清晰的解释。幸运的是，AI 模型的透明度和可解释性已经取得了显著的进步。现在，SHAPley Additive exPlanations (SHAP) 值和 Local Interpretable Model-Agnostic Explanations (LIME) 图等技术以及可解释 AI (XAI) 领域的其他一些进步，让我们能够了解模型是如何做出具体信用决策的。这不仅增强了对信用评分过程的信任，还解决了 AI 模型是“黑匣子”的普遍批评。

了解利用通常以半结构化或非结构化格式出现的替代数据的重要性后，金融机构与 MongoDB 合作，以更快、更简单、更灵活的方式进行付款和提供信用，以增强其信用申请流程：

作为印度尼西亚领先的一家数字银行，阿马尔银行正在为无法从传统银行获得金融服务（无银行账户和服务支持不足）的人群提供小额贷款，从而消除偏见。由于传统的承保流程不足以涵盖缺乏信用记录或抵押品的客户，因此该银行利用非结构化数据简化了贷款决策。该银行利用 MongoDB Atlas 开发了一个集成结构化和非结构化数据的预测性分析模型，用于评估借款人的信用水平。MongoDB 具备强大的可扩展性和多样化数据类型的管理能力，从而助力该银行扩展和优化贷款业务。
对于绝大多数印度人来说，由于严格的监管和缺乏信用数据，获得信贷批准通常困难重重。通过使用现代承保系统，印度金融科技生态系统的领先创新者 Slice 正在简化其 KYC 流程，以提供更顺畅的信贷体验，从而拓宽印度人获得信贷的渠道。通过在不同的使用案例中使用 MongoDB Atlas（包括作为实时 ML 特征存储），slice 改变了他们的引导流程，将处理时间缩短至不到一分钟。slice 使用具有 MongoDB 和 ML 模型的实时功能存储来即时计算 100 多个变量，从而可以在不到 30 秒的时间内确定信贷资格。

使用生成式 AI 改变信用评分

在信用评分中除了使用替代数据和 AI 外，还有 GenAI，GenAI 具有创建合成数据和理解复杂模式的能力，提供更细致、更具适应性和预测性的方法，因此有可能彻底改变信用评分和评估。

GenAI 综合不同数据集的能力解决了传统信用评分的主要限制之一 — 对历史信用数据的依赖。通过创建反映现实世界金融行为的合成数据，GenAI 模型可以对信用度进行更具包容性的评估。这一变革性转变促进了金融包容性，为更广泛的人群获得信贷机会打开了大门。

适应性在驾驭动态发展的经济条件和不断变化的消费行为方面发挥着举足轻重的作用。传统模型难以适应不可预见的干扰，与之不同的是，GenAI 的持续学习和适应能力可确保信用评分保持实时有效，提供了一个更具弹性和响应能力的信用风险评估工具。除了预测能力之外，GenAI 还可以提高信用评分的透明度和可解释性。模型可以为其决策提供解释，为信用评估提供更清晰的见解，并增强消费者、监管机构和金融机构之间的信任。

然而，在使用 GenAI 的过程中，一个关键问题是幻觉问题，即模型提供的信息可能是毫无意义或完全错误的。有几种技术可以降低这种风险，其中一种是使用检索增强生成 (RAG) 方法。RAG 通过将模型的响应建立在最新来源的事实信息基础上，确保模型的响应反映最新、最准确的信息，从而最大限度地减少幻觉。

例如，Patronus AI 利用 RAG 和 MongoDB Atlas，使工程师能够在现实场景中对大型语言模型 (LLM) 性能进行评分和基准测试，大规模生成对抗性测试用例，并监控幻觉及其他意外和不安全的行为。这有助于大规模检测 LLM 错误，并安全、自信地部署 AI 产品。

MongoDB 的另一个技术合作伙伴是 Robust Intelligence。该公司的 AI 防火墙通过实时验证输入和输出来保护生产中的 LLM。它可以评估并降低幻觉等操作风险、包括模型偏见和有毒输出在内的道德风险，以及提示词注入和个人身份信息 (PII) 提取等安全风险。

随着生成式 AI 的不断成熟，将其融入信用评分和更广泛的信贷申请系统有望带来的不仅仅是技术进步，而是我们评估和发放信贷方式的根本性转变。

信贷史上的关键时刻

替代数据、人工智能和生成式 AI 的融合正在重塑信用评分的基础，标志着金融业进入了一个关键时刻。通过采用替代信用评分方法，提供更具包容性和更细致的评估，传统模式所面临的挑战正在被克服。生成式 AI 虽然会带来幻觉的潜在挑战，但它站在创新的前沿，不仅彻底改变了技术能力，而且从根本上重新定义了信用评估方式，开创了具有金融包容性、效率和公平的新时代。

如果您想了解有关使用 MongoDB 构建 AI 密集型应用程序的更多信息，请查看以下资源：

了解 slice 如何在不到一分钟的时间内为数百万人完成信贷审批

← Previous
Reducing Bias in Credit Scoring with Generative AI
This post is also available in: Deutsch , Français , Español , Português , Italiano , 한국어 , 简体中文 . Credit scoring plays a pivotal role in determining who gets access to credit and on what terms. Despite its importance, however, traditional credit scoring systems have long been plagued by a series of critical issues, from biases and discrimination, to limited data consideration and scalability challenges. For example, a study of US loans showed that minority borrowers were charged higher interest rates (+8%) and rejected loans more often (+14%) than borrowers from more privileged groups. The rigid nature of credit systems means that they can be slow to adapt to changing economic landscapes and evolving consumer behaviors, leaving some individuals underserved and overlooked. To overcome this, banks and other lenders are looking to adopt artificial intelligence to develop increasingly sophisticated models for scoring credit risk. In this article, we'll explore the fundamentals of credit scoring, the challenges current systems present, and delve into how artificial intelligence (AI), in particular, generative AI (genAI) can be leveraged to mitigate bias and improve accuracy. From the incorporation of alternative data sources to the development of machine learning (ML) models, we'll uncover the transformative potential of AI in reshaping the future of credit scoring. Check out our AI resource page to learn more about building AI-powered apps with MongoDB. What is credit scoring? Credit scoring is an integral aspect of the financial landscape, serving as a numerical gauge of an individual's creditworthiness. This vital metric is employed by lenders to evaluate the potential risk associated with extending credit or lending money to individuals or businesses. Traditionally, banks rely on predefined rules and statistical models often built using linear regression or logistic regression. The models are based on historical credit data, focusing on factors such as payment history, credit utilization, and length of credit history. However, assessing new credit applicants poses a challenge, leading to the need for more accurate profiling. To cater to the underserved or unserved segments traditionally discriminated against, fintechs and digital banks are increasingly incorporating information beyond traditional credit history with alternative data to create a more comprehensive view of an individual's financial behavior. Challenges with traditional credit scoring Credit scores are integral to modern life because they serve as a crucial determinant in various financial transactions, including securing loans, renting an apartment, obtaining insurance, and even sometimes in employment screenings. Because the pursuit of credit can be a labyrinthine journey, here are some of the challenges or limitations with traditional credit scoring models that often cloud the path to credit application approval. Limited credit history: Many individuals, especially those new to the credit game, encounter a significant hurdle – limited or non-existent credit history. Traditional credit scoring models heavily rely on past credit behavior, making it difficult for individuals without a robust credit history to prove their creditworthiness. Roughly 45 million Americans lack credit scores simply because those data points do not exist for them. Inconsistent income: Irregular income, typical in part-time work or freelancing, poses a challenge for traditional credit scoring models, potentially labeling individuals as higher risk and leading to application denials or restrictive credit limits. In 2023 in the United States , data sources differ on how many people are self-employed. One source shows more than 27 million Americans filed Schedule C tax documents, which cover net income or loss from a business – highlighting the need for different methods of credit scoring for those self-employed. High utilization of existing credit: Heavy reliance on existing credit is often perceived as a signal of potential financial strain, influencing credit decisions. Credit applications may face rejection or approval with less favorable terms, reflecting concerns about the applicant's ability to judiciously manage additional credit. Lack of clarity in rejection reasons: Understanding the reasons behind rejections hinders applicants from addressing the root causes – in the UK, a study between April 2022 and April 2023 showed the main reasons for rejection included “poor credit history” (38%), “couldn’t afford the repayments” (28%), “having too much other credit" (19%) and 10% said they weren’t told why. The reasons even when given are often too vague which leaves applicants in the dark, making it difficult for them to address the root cause and enhance their creditworthiness for future applications. The lack of transparency is not only a trouble for customers, it can also lead to a penalty for banks. For example, a Berlin bank was fined €300k in 2023 for lacking transparency in declining a credit card application. Lack of flexibility: Shifts in consumer behavior, especially among younger generations preferring digital transactions, challenge traditional models. Factors like the rise of the gig economy, non-traditional employment, student loan debt, and high living costs complicate assessing income stability and financial health. Traditional credit risk predictions are limited during unprecedented disruptions like COVID-19, not taking this into account in scoring models. Recognizing these challenges highlights the need for alternative credit scoring models that can adapt to evolving financial behaviors, handle non-traditional data sources, and provide a more inclusive and accurate assessment of creditworthiness in today's dynamic financial landscape. Credit scoring with alternative data Alternative credit scoring refers to the use of non-traditional data sources (aka. alternative data) and methods to assess an individual's creditworthiness. While traditional credit scoring relies heavily on credit history from major credit bureaus, alternative credit scoring incorporates a broader range of factors to create a more comprehensive picture of a person's financial behavior. Below are some of the popular alternative data sources: Utility payments: Beyond credit history, consistent payments for utilities like electricity and water offer a powerful indicator of financial responsibility and reveal a commitment to meeting financial obligations, providing crucial insights beyond traditional metrics. Rental history: For those without a mortgage, rental payment history emerges as a key alternative data source. Demonstrating consistent and timely rent payments paints a comprehensive picture of financial discipline and reliability. Mobile phone usage patterns: The ubiquity of mobile phones unlocks a wealth of alternative data. Analyzing call and text patterns provides insights into an individual's network, stability, and social connections, contributing valuable information for credit assessments. Online shopping behavior: Examining the frequency, type, and amount spent on online purchases offers valuable insights into spending behaviors, contributing to a more nuanced understanding of financial habits. Educational and employment background: Alternative credit scoring considers an individual's educational and employment history. Positive indicators, such as educational achievements and stable employment, play a crucial role in assessing financial stability. These alternative data sources represent a shift towards a more inclusive, nuanced, and holistic approach to credit assessments. As financial technology continues to advance, leveraging these alternative data sets ensures a more comprehensive evaluation of creditworthiness, marking a transformative step in the evolution of credit scoring models. Alternative credit scoring with artificial intelligence Besides the use of alternative data, the use of AI as an alternative method has emerged as a transformative force to address the challenges of traditional credit scoring for a number of reasons: Ability to mitigate bias: Like traditional statistical models, AI models, including LLMs, trained on historical data that are biased will inherit biases present in that data, leading to discriminatory outcomes. LLMs might focus on certain features more than others or may lack the ability to understand the broader context of an individual's financial situation leading to biased decision-making. However, there are various techniques to mitigate the bias of AI models: Mitigation strategies: Initiatives begin with the use of diverse and representative training data to avoid reinforcing existing biases. Inadequate or ineffective mitigation strategies can result in biased outcomes persisting in AI credit scoring models. Careful attention to the data collected and model development is crucial in mitigating this bias. Incorporating alternative data for credit scoring plays a critical role in reducing biases. Rigorous bias detection tools, fairness constraints, and regularization techniques during training enhance model accountability: Balancing feature representation and employing post-processing techniques and specialized algorithms contribute to bias mitigation. Inclusive model evaluation, continuous monitoring, and iterative improvement, coupled with adherence to ethical guidelines and governance practices, complete a multifaceted approach to reducing bias in AI models. This is particularly significant in addressing concerns related to demographic or socioeconomic biases that may be present in historical credit data. Regular bias audits: Conduct regular audits to identify and mitigate biases in LLMs. This may involve analyzing model outputs for disparities across demographic groups and adjusting the algorithms accordingly. Transparency and explainability: Increase transparency and explainability in LLMs to understand how decisions are made. This can help identify and address biased decision-making processes. Trade Ledger , a lending software as a service (SaaS) tool, uses a data-driven approach to make informed decisions with greater transparency and traceability by bringing data from multiple sources with different schemas into a single data source. Ability to analyze vast and diverse datasets: Unlike traditional models that rely on predefined rules and historical credit data, AI models can process a myriad of information, including non-traditional data sources, to create a more comprehensive assessment of an individual's creditworthiness, ensuring that a broader range of financial behaviors is considered. AI brings unparalleled adaptability to the table: As economic conditions change and consumer behaviors evolve, AI-powered models can quickly adjust and learn from new data. The continuous learning aspect ensures that credit scoring remains relevant and effective in the face of ever-changing financial landscapes. The most common objections from banks to not using AI in credit scoring are transparency and explainability in credit decisions. The inherent complexity of some AI models, especially deep learning algorithms, may lead to challenges in providing clear explanations for credit decisions. Fortunately, the transparency and interpretability of AI models have seen significant advancements. Techniques like SHapley Additive exPlanations (SHAP) values and Local Interpretable Model-Agnostic Explanations (LIME) plots</a,> and several other advancements in the domain of Explainable AI (XAI) now allow us to understand how the model arrives at specific credit decisions. This not only enhances trust in the credit scoring process but also addresses the common critique that AI models are "black boxes." Understanding the criticality of leveraging alternative data that often comes in a semi or unstructured format, financial institutions work with MongoDB to enhance their credit application processes with a faster, simpler, and more flexible way to make payments and offer credit: Amar Bank, Indonesia's leading digital bank , is combatting bias by providing microloans to people who wouldn’t be able to get financial services from traditional banks (unbanked and underserved). Traditional underwriting processes were inadequate for customers lacking credit history or collateral so they have streamlined lending decisions by harnessing unstructured data. Leveraging MongoDB Atlas, they developed a predictive analytics model integrating structured and unstructured data to assess borrower creditworthiness. MongoDB's scalability and capability to manage diverse data types were instrumental in expanding and optimizing their lending operations. For the vast majority of Indians, getting credit is typically challenging due to stringent regulations and a lack of credit data. Through the use of modern underwriting systems, Slice, a leading innovator in India’s fintech ecosystem , is helping broaden the accessibility to credit in India by streamlining their KYC process for a smoother credit experience. By utilizing MongoDB Atlas across different use cases, including as a real-time ML feature store, slice transformed their onboarding process, slashing processing times to under a minute. slice uses the real-time feature store with MongoDB and ML models to compute over 100 variables instantly, enabling credit eligibility determination in less than 30 seconds. Transforming credit scoring with generative AI Besides the use of alternative data and AI in credit scoring, GenAI has the potential to revolutionize credit scoring and assessment with its ability to create synthetic data and understand intricate patterns, offering a more nuanced, adaptive, and predictive approach. GenAI’s capability to synthesize diverse data sets addresses one of the key limitations of traditional credit scoring – the reliance on historical credit data. By creating synthetic data that mirrors real-world financial behaviors, GenAI models enable a more inclusive assessment of creditworthiness. This transformative shift promotes financial inclusivity, opening doors for a broader demographic to access credit opportunities. Adaptability plays a crucial role in navigating the dynamic nature of economic conditions and changing consumer behaviors. Unlike traditional models, which struggle to adjust to unforeseen disruptions, GenAI’s ability to continuously learn and adapt ensures that credit scoring remains effective in real-time, offering a more resilient and responsive tool for assessing credit risk. In addition to its predictive prowess, GenAI can contribute to transparency and interpretability in credit scoring. Models can generate explanations for their decisions, providing clearer insights into credit assessments, and enhancing trust among consumers, regulators, and financial institutions. One key concern however in making use of GenAI is the problem of hallucination, where the model may present information that is either nonsensical or outright false. There are several techniques to mitigate this risk and one approach is using the Retrieval Augment Generation (RAG) approach. RAG minimizes hallucinations by grounding the model’s responses in factual information from up-to-date sources, ensuring the model’s responses reflect the most current and accurate information available. Patronus AI , for example, leverages RAG with MongoDB Atlas to enable engineers to score and benchmark large language models (LLMs) performance on real-world scenarios, generate adversarial test cases at scale, and monitor hallucinations and other unexpected and unsafe behavior. This can help to detect LLM mistakes at scale and deploy AI products safely and confidently. Another technology partner of MongoDB is Robust Intelligence . The firm’s AI Firewall protects LLMs in production by validating inputs and outputs in real-time. It assesses and mitigates operational risks such as hallucinations, ethical risks including model bias and toxic outputs, and security risks such as prompt injections and personally identifiable information (PII) extractions. As generative AI continues to mature, its integration into credit scoring and the broader credit application systems promises not just a technological advancement, but a fundamental transformation in how we evaluate and extend credit. A pivotal moment in the history of credit The convergence of alternative data, artificial intelligence, and generative AI is reshaping the foundations of credit scoring, marking a pivotal moment in the financial industry. The challenges of traditional models are being overcome through the adoption of alternative credit scoring methods, offering a more inclusive and nuanced assessment. Generative AI, while introducing the potential challenge of hallucination, represents the forefront of innovation, not only revolutionizing technological capabilities but fundamentally redefining how credit is evaluated, fostering a new era of financial inclusivity, efficiency, and fairness. If you would like to discover more about building AI-enriched applications with MongoDB, take a look at the following resources: Digitizing the lending and leasing experience with MongoDB Deliver AI-enriched apps with the right security controls in place, and at the scale and performance users expect Discover how slice enables credit approval in less than a minute for millions Solution: Credit card application with generative AI
February 20, 2024

Next →
Building Gen AI with MongoDB & AI Partners | February 2025
February was big for MongoDB—and, more importantly, for anyone looking to build AI applications that deliver highly accurate, relevant information (in other words, for everyone building AI apps). MongoDB announced the acquisition of Voyage AI , a pioneer in state-of-the-art embedding and reranking models that power next-generation AI applications. Because generative AI is by nature probabilistic, models can “hallucinate”, and generate false or misleading information. This can lead to serious risks, especially in cases or industries (e.g., financial services) where accurate information is paramount. To address this, organizations building AI apps need high-quality retrieval; they need to trust that the most relevant information is extracted from their data with precision. Voyage AI’s advanced embedding and reranking models enable applications to extract meaning from highly specialized and domain-specific text and unstructured data. With roots at Stanford and MIT, Voyage AI’s world-class team is trusted by AI innovators like Anthropic, LangChain, Harvey, and Replit. Integrating Voyage AI’s technology with MongoDB will enable organizations to easily build trustworthy, AI-powered applications by offering highly accurate and relevant information retrieval deeply integrated with operational data. For more, check out MongoDB CEO Dev Ittycheria’s blog post about Voyage AI , and what this means for developers and businesses (in short, delivering high-quality results at scale). Onward! P.S. If you’re in Vegas for HumanX this week, stop by booth 412 to say hi to MongoDB! Welcoming new AI and tech partners The Voyage AI news was hardly the only exciting development last month. In February 2025, MongoDB welcomed three new AI and tech partners that offer product integrations with MongoDB. Read on to learn more about each great new partner! CopilotKit Seattle-based CopilotKit provides open source infrastructure for in-app AI copilots. CopilotKit helps organizations build production-ready copilots and agents effortlessly. “We’re excited to be partnering with MongoDB to help companies build best-in-class copilots that leverage RAG & take action based on internal data,” said Uli Barkai, Co-Founder and Chief Marketing Officer at CopilotKit. “MongoDB made it dead simple to build a scalable vector database with operational data. This collaboration enables developers to easily ship production-grade RAG applications.” Varonis Varonis is the leader in data security, protecting data wherever it lives—across SaaS, IaaS, and hybrid cloud environments. Varonis’ cloud-native Data Security Platform continuously discovers and classifies critical data, removes exposures, and detects advanced threats with AI-powered automation. “Varonis’s mission is to protect data wherever it lives,” said David Bass, Executive Vice President of Engineering and Chief Technology Officer at Varonis. “We are thrilled to further advance our mission by offering AI-powered data security and compliance for MongoDB, the database of choice for high-performance application and AI development. With this integration, joint customers can automatically discover and classify sensitive data, detect abnormal activities, secure AI data pipelines, and prevent data leaks.” Xlrt Xlrt is an automated insight-generation platform that enables financial institutions to create innovative financial credit products at scale by simplifying the financial spreading process. “We are excited to partner with MongoDB Atlas to transform AI-driven financial workflows,” said Rupesh Chaudhuri, Chief Operating Officer and Co-Founder of Xlrt. “XLRT.ai leverages agentic AI, combining graph-based contextualization, vector search, and LLMs to redefine data-driven decision-making. With MongoDB's robust NoSQL and vector search capabilities, we’re delivering unparalleled efficiency, accuracy, and scalability in automating financial processes.” To learn more about building AI-powered apps with MongoDB, check out our AI Learning Hub and stop by our Partner Ecosystem Catalog to read about our integrations with MongoDB’s ever-evolving AI partner ecosystem. And visit the MongoDB AI Applications Program (MAAP) page to learn how MongoDB and the MAAP ecosystem helps organizations build applications with advanced AI capabilities.
March 12, 2025