Panlingo.LanguageIdentification.FastText 0.2.0

dotnet add package Panlingo.LanguageIdentification.FastText --version 0.2.0                
NuGet\Install-Package Panlingo.LanguageIdentification.FastText -Version 0.2.0                
This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.
<PackageReference Include="Panlingo.LanguageIdentification.FastText" Version="0.2.0" />                
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add Panlingo.LanguageIdentification.FastText --version 0.2.0                
#r "nuget: Panlingo.LanguageIdentification.FastText, 0.2.0"                
#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.
// Install Panlingo.LanguageIdentification.FastText as a Cake Addin
#addin nuget:?package=Panlingo.LanguageIdentification.FastText&version=0.2.0

// Install Panlingo.LanguageIdentification.FastText as a Cake Tool
#tool nuget:?package=Panlingo.LanguageIdentification.FastText&version=0.2.0                

Panlingo.LanguageIdentification.FastText

Welcome to Panlingo.LanguageIdentification.FastText, a .NET wrapper for the FastText library by Facebook AI Research (FAIR). This package seamlessly integrates language identification capabilities into .NET applications, leveraging the powerful and efficient FastText library to recognize and classify texts in multiple languages. Ideal for applications that require fast and accurate language detection.

Requirements

  • Runtime: .NET >= 5.0
  • OS: Linux (Ubuntu, Debian), Windows 10+ or Windows Server 2019+, macOS
  • Arch: AMD64 (or ARM for macOS)

Installation

To integrate the FastText functionality, follow these steps:

Install the NuGet package:

dotnet add package Panlingo.LanguageIdentification.FastText

Usage

Integrating the FastText library into your .NET application is straightforward. Here’s a quick guide to get you started:

  1. Install the Package: Ensure you have added the Panlingo.LanguageIdentification.FastText package to your project using the provided installation command.
  2. Download the Models: Follow the provided commands to download the pretrained language identification models.
  3. Initialize the Library: Follow the example snippet to initialize and use the FastText library for detecting languages.
using Panlingo.LanguageIdentification.FastText;

class Program
{
    static void Main()
    {
        using var fastText = new FastTextDetector();
        fastText.LoadDefaultModel();

        var predictions = fastText.Predict(
            text: "Привіт, як справи?", 
            count: 10
        );

        foreach (var prediction in predictions)
        {
            Console.WriteLine($"{prediction.Label}: {prediction.Probability}");
        }

        var dimensions = fastText.GetModelDimensions();
        var labels = fastText.GetLabels();
    }
}

Custom models

Download the Pretrained Models

Depending on your needs, download one of the pretrained language identification (LID) models provided by Facebook:

  • For the LID model with 176 languages:

    curl --location -o /models/fasttext176.bin https://dl.fbaipublicfiles.com/fasttext/supervised-models/lid.176.bin
    
  • For the LID model with 217 languages:

    curl --location -o /models/fasttext217.bin https://huggingface.co/facebook/fasttext-language-identification/resolve/main/model.bin?download=true
    

Learn more about these models here:

Use custom model in code
using var fastText = new FastTextDetector();

var modelPath = "/models/fasttext176.bin";
fastText.LoadModel(modelPath);

OR

using var fastText = new FastTextDetector();

var modelPath = "/models/fasttext176.bin";
using var stream = File.Open(modelPath, FileMode.Open);
fastText.LoadModel(stream);

Alternatives

If you are exploring other options, here are some alternatives to consider:

Sources

  1. Original FastText Repository
  2. Enriching Word Vectors with Subword Information
  3. Language identification at Wikipedia

We value your feedback. Feel free to open issues or contribute to the repository. Let’s make language detection in .NET even more powerful and versatile! 🌍📝

Happy coding! 👩‍💻👨‍💻


Stay updated by following our repository. For any inquiries or support, reach out through the issues page.

Product Compatible and additional computed target framework versions.
.NET net5.0 is compatible.  net5.0-windows was computed.  net6.0 is compatible.  net6.0-android was computed.  net6.0-ios was computed.  net6.0-maccatalyst was computed.  net6.0-macos was computed.  net6.0-tvos was computed.  net6.0-windows was computed.  net7.0 is compatible.  net7.0-android was computed.  net7.0-ios was computed.  net7.0-maccatalyst was computed.  net7.0-macos was computed.  net7.0-tvos was computed.  net7.0-windows was computed.  net8.0 is compatible.  net8.0-android was computed.  net8.0-browser was computed.  net8.0-ios was computed.  net8.0-maccatalyst was computed.  net8.0-macos was computed.  net8.0-tvos was computed.  net8.0-windows was computed.  net9.0 is compatible. 
Compatible target framework(s)
Included target framework(s) (in package)
Learn more about Target Frameworks and .NET Standard.

NuGet packages

This package is not used by any NuGet packages.

GitHub repositories

This package is not used by any popular GitHub repositories.

Version Downloads Last updated
0.2.0 75 11/10/2024
0.1.1 240 10/22/2024
0.1.0 110 10/7/2024
0.0.0.22 159 9/22/2024
0.0.0.21 116 9/10/2024
0.0.0.20 99 9/8/2024
0.0.0.19 120 9/1/2024
0.0.0.18 106 8/26/2024
0.0.0.17 141 8/21/2024
0.0.0.16 136 8/9/2024
0.0.0.15 109 8/8/2024
0.0.0.14 66 8/3/2024
0.0.0.13 76 8/1/2024
0.0.0.12 97 7/20/2024
0.0.0.10 99 7/14/2024

0.1.0.0
- Windows and MacOS support
     
0.0.0.21:
- Default FastText model is included in NuGet package

0.0.0.10:
- Initial release