Julia and R: A Deep Dive into Two Data Powerhouses – IT Exams Training

In the growing world of data analysis and scientific computing, programming languages act as essential tools. Among the most specialized languages in this space are Julia and R. Both are built with data in mind but serve their purposes in distinct ways. R, with its roots in statistics, has been a trusted language for decades. Julia, a more recent addition to the programming scene, offers cutting-edge performance and flexibility.

While R dominates academic and statistical circles, Julia is carving out a niche in technical computing where speed and scalability matter most. Understanding how each of these languages works—and where they shine—can help developers, analysts, and data scientists make more strategic choices.

The Importance of Choosing the Right Language

The programming language you start with can influence how you approach problems, build solutions, and integrate into various fields. Not every language is suited to every task, and your selection can depend on several factors—such as industry norms, project requirements, or your learning background.

Both Julia and R provide tools to analyze, manipulate, and visualize data. Yet, how they achieve this differs in implementation and philosophy. Picking a language isn’t necessarily about finding the best overall but rather about choosing the best for your specific context. That said, learning either Julia or R can be an excellent first step into the world of data.

A Look into the Origins

R was created in the early 1990s, inspired by the S programming language. It quickly became a go-to tool for statisticians, supported by its open-source nature and a thriving community. Over the years, it expanded into a versatile language used in academia, healthcare, marketing, and social sciences.

Julia entered the scene in 2012. Its goal was ambitious—to combine the speed of low-level languages with the simplicity of high-level ones. Designed from scratch for numerical computing, it found immediate adoption in fields like physics, economics, and machine learning. Though younger, Julia is growing rapidly, and its design solves many of the performance issues found in older languages.

Programming Philosophy and Goals

R was developed primarily for statistical computing. Its focus remains on enabling users to perform complex statistical operations with minimal code. Most R functions are tailored for data modeling, visualization, and transformation.

Julia was created with performance as a top priority. Its just-in-time compilation makes it suitable for tasks that demand high speed. Unlike R, which leans heavily into statistical workflows, Julia aims to be a general-purpose scientific language. This gives Julia the versatility to handle diverse challenges, from simulations to numerical analysis and beyond.

Learning Curve and Syntax Design

Ease of learning often determines how quickly someone can become productive. R was built for statisticians, so its syntax and functions may feel unusual to those with a traditional programming background. It has several idiosyncrasies, especially around object types, formula notation, and model fitting.

Julia’s syntax is cleaner and closer to mainstream languages like Python, MATLAB, or even C. It allows programmers to write readable code without sacrificing performance. For beginners familiar with other languages, Julia tends to be more intuitive and requires less adjustment.

A defining feature of Julia is multiple dispatch, which selects function behavior based on all argument types rather than just one. This enables powerful and elegant code reuse, especially in mathematical operations. R does have some form of multiple dispatch through its S4 system, but it is neither as fast nor as central to the language.

Development Environments

The right development environment can boost productivity, especially during learning and experimentation.

R integrates seamlessly with RStudio, a feature-rich and purpose-built IDE. RStudio includes an editor, data viewer, visualization pane, and integration with R Markdown for creating interactive documents. It’s the standard environment for most R users.

Julia doesn’t have a single dominant IDE, but it supports several. Visual Studio Code, with Julia plugins, is a popular choice. Julia also offers its own reactive notebook environment called Pluto. Pluto notebooks allow live interactivity, code updates, and dynamic user input. This makes them suitable for teaching, presentations, and live demonstrations.

Both R and Julia can be used within Jupyter notebooks as well. These notebooks allow the blending of narrative text and executable code, supporting exploratory workflows in a reproducible format.

Speed and Performance Considerations

Performance is a key area where Julia clearly leads. Because it compiles code just before execution using the LLVM framework, Julia code runs extremely fast—often rivaling C or Fortran. This makes Julia well suited for heavy computation tasks like simulations, modeling, and optimization.

R, being an interpreted language, generally performs slower. It relies on optimized packages or integration with other compiled languages like C++ or Fortran for speed. While packages like data.table or libraries written in C can improve performance, they don’t eliminate the inherent limitations of R’s interpreted nature.

Julia does not require vectorization to speed up computations, thanks to its compiler optimizations. R, on the other hand, often needs code to be vectorized—processing entire arrays at once—to run efficiently.

Package Ecosystem and Community Contributions

One of R’s major strengths lies in its rich ecosystem. With nearly 19,000 packages available on its main repository, there is likely an R package for every conceivable statistical task. This vast library makes it easy to implement models, draw visualizations, and run diagnostics without building tools from scratch.

Julia has a growing but smaller ecosystem with around 7,400 registered packages. While this number may seem modest compared to R, it’s expanding steadily. The Julia community is active, and the language’s structure encourages package development. Julia’s packages are often high quality and optimized for performance.

Both languages support interoperability. Julia can call R and Python functions, and R can likewise interface with Julia and Python. This cross-language integration helps leverage the strengths of each language while minimizing their weaknesses.

Package Management Tools

Managing dependencies and creating reproducible environments is essential in collaborative projects. R uses tools like Packrat and renv to isolate project environments and track package versions. These tools allow teams to reproduce analyses across different machines without breaking workflows.

Julia comes with a built-in package manager called Pkg. It manages environments directly in each project folder, storing a manifest file with all package information. Julia’s stackable environment system also allows packages from one project to be reused in another, improving efficiency across projects.

Availability of Learning Resources

R, with its long-standing presence, offers an abundance of learning materials. From textbooks and MOOCs to user forums and conferences, R learners have access to decades of shared knowledge. Online communities like Stack Overflow are filled with active discussions, and most questions about R have already been answered.

Julia’s community, though newer, is also thriving. It may not yet have the same volume of tutorials and documentation as R, but it is growing quickly. The user base is highly collaborative, and the percentage of unanswered Julia questions is lower than R, showing that users receive quick help from the community.

Notebook Interfaces and Interactivity

Both R and Julia support notebook-style coding for exploratory analysis. Jupyter notebooks can run both R and Julia kernels, letting users interleave code, text, and visualizations. However, each language also has its own native solution.

R uses R Markdown, a notebook-like format within RStudio. This allows users to mix prose and code, knit reports, and generate visualizations in real-time. It’s ideal for sharing insights and generating automated reports.

Julia’s Pluto notebooks offer reactivity—any change to a variable updates all dependent cells automatically. This feature makes it easier to test changes and build interactive applications within the notebook environment. Pluto also supports input controls like sliders and buttons, which add user interactivity to code execution.

Use Cases and Target Audience

R’s dominance in academia and public health stems from its statistical capabilities. Many published studies, government reports, and institutional dashboards are built using R. It remains a critical tool for biostatisticians, epidemiologists, social scientists, and market researchers.

Julia is making waves in domains that require performance. Engineers, physicists, financial modelers, and AI researchers often prefer Julia for its ability to process large datasets and run simulations without bottlenecks. It’s particularly useful in scenarios where performance can translate to real-time decision-making or reduced operational costs.

Adoption in Industry and Academia

R has widespread adoption across universities, non-profits, consulting firms, and government bodies. Instructors use it to teach statistics, while analysts use it for reporting and modeling. The large base of R users also ensures that businesses can hire from a deep talent pool.

Julia is gaining adoption in specialized domains. Though not yet as widely used in industry, it is making inroads where computational performance is a must. Julia is also starting to appear in academic curricula focused on scientific computing and engineering applications.

Flexibility and Future Potential

R has matured and stabilized, offering reliable performance and an extensive ecosystem. Its strength is consistency in statistical processing and its long track record of success. However, its performance ceiling is a limiting factor for some modern applications.

Julia, with its design focused on future computing needs, is poised for expansion. As data volumes grow and computational demands increase, Julia’s architecture makes it well positioned for adoption in data-intensive industries. Its potential lies not just in replacing R or Python, but in offering an entirely new model for scientific programming.

Learning either Julia or R opens doors to different opportunities. R offers immediate access to a mature ecosystem, job opportunities, and community support. Julia delivers on performance and scalability, making it a compelling choice for cutting-edge applications in machine learning and scientific computing.

Choosing between them is less about finding a winner and more about aligning with your goals. For beginners in data analysis, R might be the easier and more practical first step. For professionals looking to extend capabilities or solve high-performance problems, Julia could be the key to unlocking new potential.

Key Technical Differences Between Julia and R

When evaluating Julia and R for data science, it’s important to understand how they differ under the hood. While they are both high-level languages tailored for working with data, their approaches to programming paradigms, performance, and flexibility are rooted in different philosophies. This section explores how these languages compare in terms of language design, computational features, and performance optimization.

Dispatch Systems and Functional Behavior

One of the most significant distinctions between Julia and R lies in how they handle function execution based on input types. R uses a method called single dispatch in its older object system, S3. In this system, the function’s behavior depends primarily on the class of the first argument. While R also supports the S4 system with multiple dispatch, it is less commonly used and introduces complexity that many users avoid.

Julia, however, was designed from the outset with multiple dispatch as a core principle. This means that functions can respond differently based on all the input types passed to them, not just the first. This leads to more adaptable and cleaner code, particularly in numerical applications and simulations. Multiple dispatch is one of Julia’s defining strengths, making it ideal for scientific computing where function flexibility and performance are critical.

Vectorization and Execution Strategy

In many high-level languages, developers often rely on vectorization—processing entire data sets or arrays at once—to optimize performance. This is especially true for R, where loops and iterative structures can be slower unless optimized by external libraries. R users are therefore encouraged to write code that operates on whole vectors or matrices simultaneously.

Julia, on the other hand, is designed to perform fast computations even when using simple loops. Its JIT (just-in-time) compiler translates code into efficient machine instructions, so there’s less need for developers to reshape their code for speed. This makes Julia not only faster but also more intuitive for those coming from traditional programming backgrounds who prefer using loops and procedural styles.

Data Handling and Manipulation

Data manipulation is a fundamental part of data science. R has been optimized for this with tools like dplyr, tidyr, and data.table. These libraries provide streamlined ways to reshape, filter, and summarize data. Combined with R’s native support for data frames, users can efficiently perform even complex transformations with a few lines of code.

Julia supports similar workflows using packages like DataFrames.jl. While it may not yet offer the same breadth of capabilities found in R’s mature ecosystem, it covers most essential operations like grouping, filtering, and joining datasets. Moreover, Julia’s syntax for data manipulation is readable and flexible, especially for users who prefer chaining operations in a clear sequence.

Visualization Capabilities

Visualization is where R truly excels. Its graphics libraries such as ggplot2, lattice, and base plotting tools allow users to produce publication-quality visuals with ease. The grammar of graphics approach used by ggplot2 has set a benchmark for data visualization in programming languages, enabling users to construct layered and informative plots with detailed control.

Julia also has visualization tools, such as Plots.jl, Gadfly, and Makie. These packages provide interactive and static plotting features, although the ecosystem is still growing. While Julia’s plots can be highly performant and beautiful, they may require more setup or exploration compared to R’s more plug-and-play approach.

Notebook Interfaces and Reactive Coding

Both Julia and R are notebook-compatible, and this is a key advantage in data science, where exploration and documentation go hand in hand. Jupyter notebooks support kernels for both languages and allow users to combine narrative text with code and output, making them ideal for analysis and collaboration.

R takes this a step further with R Markdown, a format that integrates code, results, and narrative into a single reproducible report. These documents are widely used in academia and business for generating reports, dashboards, and interactive documents.

Julia’s Pluto notebook is an innovation in reactive programming. Pluto notebooks automatically update results when inputs change, ensuring that output is always in sync. This makes them especially useful for developing interactive reports, tutorials, and simulations. The live-updating nature of Pluto adds a modern dynamic to the notebook experience.

Package Management and Environment Isolation

Maintaining consistent environments is crucial when sharing projects or working on teams. R manages this through tools like Packrat and renv. These create isolated environments where specific package versions are locked, avoiding the risk of conflicts or version mismatches across projects.

Julia includes Pkg as a built-in environment manager. Pkg handles installation, updates, and project-specific package manifests. Julia environments are local to each project and can be stacked or shared, providing excellent flexibility for managing complex software dependencies.

Julia’s environment system is particularly valuable for users working on multiple concurrent projects or collaborating in teams, as it reduces the friction of switching contexts and dependencies.

Community Support and Documentation

R’s longevity has resulted in a massive online community. It is widely taught in universities, and support is available through countless tutorials, documentation pages, forums, and books. Stack Overflow hosts hundreds of thousands of R-related questions, many of which have comprehensive answers.

Julia’s community, though smaller, is growing quickly. Its users are enthusiastic and often deeply involved in scientific computing, contributing actively to development and support. Julia’s official documentation is well-maintained, and there are growing numbers of tutorials, courses, and learning materials emerging regularly.

One of the distinguishing features of the Julia community is its openness to newcomers. Community forums, online discussions, and meetups are often welcoming and supportive, even though the user base is more specialized than R’s.

Interoperability with Other Languages

No single language can do everything. Often, combining tools from multiple languages provides the best results. Both Julia and R support this kind of interoperability.

R can interface with C++, Python, and Julia through specific packages and APIs. This allows R users to call external libraries when high performance or additional capabilities are needed.

Julia also supports calling Python, C, and R libraries, enabling users to extend Julia scripts with familiar tools. For instance, if a specific machine learning package exists only in Python, Julia users can import and use it directly through PyCall or similar tools. This compatibility increases flexibility but comes with trade-offs in performance or integration complexity.

Application Domains and Practical Use

R is dominant in industries that require intensive statistical modeling and data reporting. Public health, social sciences, academic research, and government agencies frequently rely on R for its comprehensive toolset and documentation. R is also heavily used in education, with many universities incorporating it into data science curricula.

Julia is rising in fields where heavy computation is a requirement. These include physics, astronomy, economics, engineering, and artificial intelligence. Its ability to handle demanding mathematical workloads makes it particularly appealing to researchers, scientists, and engineers working with simulations, optimization, and real-time data analysis.

Integration in Enterprise Systems

R has matured into a reliable enterprise tool. It integrates well with databases, APIs, cloud platforms, and business intelligence systems. Enterprise-level reporting tools and dashboards can be built using R Shiny, enabling non-technical users to interact with data models and analytics in real-time.

Julia’s enterprise adoption is not as widespread, but it is increasing. More businesses are recognizing Julia’s ability to handle real-time analytics and process large-scale computations efficiently. While it may require more engineering effort to integrate into traditional enterprise systems, the benefits can be significant in performance-critical applications.

Performance Benchmarks and Real-World Scenarios

Performance comparisons between Julia and R often show Julia outperforming R in tasks like numerical simulation, optimization problems, and matrix operations. This is not surprising, as Julia was designed for speed. For example, machine learning models written in Julia can often run faster without requiring external libraries.

R, however, still performs well for statistical modeling and standard data analysis tasks. Its well-optimized packages and integration with fast back-end languages help maintain acceptable performance for most applications. When speed becomes a bottleneck, users often turn to Rcpp to integrate C++ code, though this increases complexity.

Julia removes the need for this workaround by allowing high-speed performance without leaving the language.

Learning Pathways and Educational Use

R has been widely adopted in academic institutions, especially in statistics and social sciences. Its usage in teaching introductory data analysis is prevalent due to its intuitive statistical functions and vast support materials.

Julia is making its way into technical and scientific education programs. Its clean syntax, speed, and scientific libraries make it suitable for teaching numerical methods, optimization, and computational modeling. It’s also gaining ground in summer schools, workshops, and online platforms focusing on scientific computing.

Considerations for Career Development

Choosing which language to learn can also depend on your professional goals. If you’re targeting jobs in traditional data science roles, public policy, health, or marketing analytics, learning R provides immediate value and wide-ranging job opportunities.

If your focus is high-performance computing, systems modeling, or machine learning at scale, Julia offers strong advantages. As more businesses look to optimize performance and reduce operational costs, Julia’s relevance in the job market is likely to expand.

Choosing the Best Fit for Your Workflow

Neither language is inherently better. It depends on your goals, the nature of your work, and your team’s needs. If you value a mature environment with extensive documentation and tools for data reporting, R fits the bill. If you need rapid computation, reusable function designs, and clean syntax, Julia stands out.

Both languages can also be used together, depending on the project. Many professionals integrate them into multi-language pipelines to benefit from the strengths of each.

Understanding Real-World Usage of Julia and R

Languages like Julia and R aren’t just academic tools; they are solving real-world problems in business, science, finance, and more. While each language has its roots in distinct disciplines, both are evolving to meet the complex demands of today’s data-driven world.

Organizations are increasingly turning to data science to drive decision-making, and the tools they choose can significantly impact performance, collaboration, and outcomes. This section explores how Julia and R are used in various sectors and the environments where each language truly excels.

Statistical Analysis and Reporting in Practice

R is synonymous with statistical analysis. Organizations that deal heavily with data reporting, statistical testing, or forecasting often rely on R. Its mature libraries and graphical capabilities enable professionals to create clean, well-documented statistical models and dashboards.

In healthcare, for instance, researchers use R to monitor trends, predict disease outbreaks, and evaluate clinical trials. Similarly, in social sciences, R facilitates survey analysis, regression modeling, and hypothesis testing. Tools like R Markdown and Shiny allow users to generate reproducible reports and interactive applications that support transparency and communication.

Julia, while not as entrenched in reporting workflows, is gaining popularity among researchers who want both statistical analysis and computational performance. Its ability to handle large datasets and perform real-time analysis makes it suitable for environments where speed is critical, like live experiments or simulations.

Scientific Computing and High-Performance Applications

Julia was built for numerical and scientific computing. It shines in domains that require processing complex equations, simulations, or models across massive datasets. This includes fields like climate modeling, fluid dynamics, computational biology, and astrophysics.

Because Julia performs nearly at the level of compiled languages like C or Fortran, it removes the traditional trade-off between usability and speed. Engineers and scientists no longer need to write performance-critical code in a separate language.

One clear example of Julia’s impact is in optimization problems. Industries that rely on operations research or predictive modeling benefit from Julia’s fast solvers and optimization libraries. Whether it’s for routing delivery trucks, allocating energy resources, or calibrating economic models, Julia handles intensive computation with efficiency.

Data Visualization and Presentation

Visualizing data is an essential step in any analytical process. R’s strengths in this area are widely acknowledged. Through libraries like ggplot2 and base plotting tools, R enables analysts to produce layered, expressive charts. These visualizations are useful not only for analysis but also for storytelling and presenting results to stakeholders.

In business settings, such as marketing analytics or customer segmentation, R’s charts can help explain complex insights in a digestible format. It also supports the creation of interactive dashboards that update in real time based on data input.

Julia’s visualization capabilities are growing steadily. Tools like Makie and Plots.jl provide a range of options from static charts to animated graphics. While these packages may not yet rival R’s visualization ecosystem in terms of variety and simplicity, they offer significant flexibility and performance—especially for 3D and real-time graphics.

Machine Learning and Artificial Intelligence

Both R and Julia support machine learning, but their approaches differ. R focuses on ease of use and access to pre-built models. With packages like caret, randomForest, and xgboost, data scientists can rapidly build predictive models. These packages simplify model training, evaluation, and tuning, making them well-suited for standard classification or regression tasks.

Julia, however, is carving out a place in more complex and large-scale machine learning projects. Libraries like Flux.jl and MLJ.jl allow for the design of custom models with high performance and flexibility. Julia’s speed makes it ideal for deep learning and optimization-heavy tasks, especially when working with custom layers or non-standard architectures.

For teams building proprietary models or working with high-frequency data, Julia allows end-to-end model development in a single language without sacrificing execution speed.

Finance and Economic Modeling

Financial analysts and economists often require languages that combine modeling precision with statistical tools. R has a long-standing presence in finance, with packages that support time series analysis, portfolio optimization, risk modeling, and econometrics. Institutions use R to evaluate markets, simulate trading strategies, and model economic indicators.

Julia is increasingly being used in financial modeling for tasks that demand performance, such as real-time pricing, algorithmic trading, and Monte Carlo simulations. With its focus on numerical stability and speed, Julia can efficiently process thousands of scenarios to support risk evaluation or asset pricing.

Quantitative analysts appreciate Julia’s ability to handle complex models and large matrices, making it an attractive option in high-frequency trading environments.

Adoption Across Industries

R’s widespread adoption makes it a safe choice for most data science roles. It’s embedded in many enterprise analytics systems and is part of standard workflows in sectors like public health, academia, consulting, and policy analysis. Because of its longevity, businesses also have access to a large talent pool of R users.

Julia, while newer, is gaining momentum in specialized industries where performance translates directly into cost savings or competitive advantage. These include aerospace, robotics, renewable energy, pharmaceuticals, and academic research. Although the number of organizations using Julia is still growing, those that do report significant gains in performance and productivity.

Job Market and Career Opportunities

Career prospects often influence language selection. R remains one of the top languages listed in job descriptions for data analysts, biostatisticians, and academic researchers. Its prevalence in job boards and its use in education make it a reliable skill for those entering the workforce.

In contrast, Julia jobs are fewer in number but often concentrated in cutting-edge fields. Positions requiring Julia are likely to involve simulation, modeling, or computational research. These roles often offer more technical depth, and while they are fewer, they can command higher pay due to the specialized nature of the work.

Those considering long-term career growth may find that learning R opens more doors in the short term, while picking up Julia later can position them for emerging opportunities in high-tech sectors.

Learning Path and Skill Development

For individuals new to data science, R provides a structured and supportive environment. The documentation is robust, online courses are plentiful, and the community is responsive. R is ideal for mastering statistics, learning data wrangling, and building reproducible research projects.

Julia is a great second language for those who want to build on their data science skills with performance-focused tools. It’s a natural step for anyone interested in engineering applications, modeling, or large-scale analysis. Julia also introduces more advanced programming concepts like multiple dispatch and memory management, which are useful for deeper understanding of computing.

Both languages encourage good practices like reproducible research and modular programming, which are essential for collaboration and scaling.

Choosing Based on Project Needs

It is not always necessary to choose one language over the other. In fact, many professionals use both in different stages of a project. For instance, they might use R for quick data exploration and visualization, and then switch to Julia for simulation or large-scale processing.

This dual approach allows users to take advantage of R’s vast package ecosystem and Julia’s performance. Integration tools make it easier than ever to move between languages or call one from within the other.

Understanding which language suits your current project requirements is more useful than sticking to a single tool. If you’re reporting to business stakeholders or generating standardized models, R is ideal. If you’re solving performance-critical problems or building simulation engines, Julia offers clear advantages.

Looking Ahead: The Future of Julia and R

R is not going away anytime soon. Its deep integration into research, education, and reporting keeps it relevant and essential. As the world becomes more data-driven, R’s role in foundational analysis remains important.

Julia, however, is positioned to grow rapidly. Its design caters to future computing demands, including high-throughput applications and real-time analytics. As more organizations encounter performance limits with traditional tools, Julia is likely to be adopted in scenarios that demand responsiveness and computational power.

For learners and professionals alike, being familiar with both tools can future-proof their careers. Flexibility, rather than allegiance to one language, is the hallmark of a skilled data practitioner.

Strategic Advice for Learners and Teams

If you’re just starting out in data science or statistics, begin with R. It provides strong fundamentals, a wide range of educational resources, and practical tools for analysis and reporting. As you grow more confident and face new challenges, adding Julia to your skill set can open doors to high-performance computing and advanced modeling.

For teams, adopting both languages allows for a broader range of projects and improved productivity. Building hybrid systems or multi-language pipelines lets each tool shine where it is strongest.

The future belongs to those who can navigate diverse tools, integrate knowledge across platforms, and adapt to the evolving landscape of data science.

Conclusion

Choosing the right programming language is one of the first and most crucial decisions for anyone stepping into the world of data science and analytics. Both Julia and R have unique advantages and are suited to different types of tasks, users, and environments.

R stands out as a mature, feature-rich language deeply integrated into statistics, academic research, and business reporting. It offers a massive ecosystem of packages, excellent visualization tools, and a strong foundation for anyone starting in data-related careers. Whether for data wrangling, building dashboards, or conducting rigorous statistical testing, R provides the necessary tools and community support to get the job done effectively.

Julia, on the other hand, is designed for speed and performance. Its ability to handle complex computations and large-scale data operations without sacrificing usability makes it a compelling choice for scientific computing, machine learning, optimization, and simulations. Its growing ecosystem and increasing presence in technical industries suggest that Julia will play a significant role in the future of high-performance data science.

There is no universal answer to which language is better. The choice depends on the goals of the individual or organization. R may be ideal for data analysis in social sciences or health research, while Julia may be better suited for simulation-heavy tasks in engineering or finance.

Ultimately, the best approach is not to treat this as an either-or decision. Mastering both R and Julia allows data professionals to expand their toolkit, become more versatile, and solve a broader range of problems. As industries become more data-intensive, the ability to adapt and work across different programming environments will be increasingly valuable.

Start with the language that aligns with your immediate needs, but stay open to exploring the other as your projects evolve. Embracing both tools will ensure you stay relevant and effective in the dynamic and ever-growing field of data science.