[RFC] Range Integer Types #109

JoseRuizAdaCore · 2023-11-10T16:40:41Z

First draft for removing the need for a symmetric base range for signed integer types.

raph-amiard · 2023-11-23T09:52:49Z

@dkm @jklmnn Guys could you review this RFC and tell me if you find any glaring holes in it ? It seems sound and simple enough to me but I would like a second read before handing that to the front-end team :) Thanks in advance!

clairedross · 2023-11-23T10:14:49Z

Hi Jose, I am a bit surprised that you say this RFC has no drawbacks and is backward compatible. Users can access the underlying type used by the compiler using T'Base. They could safely assume that this type was symmetrical before. I am not entirely sure of why you need this, but if it is only for unsigned type and the issue is that you do not want the wrap-around semantics, I believe it would be simpler and more backward compatible to introduce a new annotation No_Wrap_Around on modular types. We have something similar in SPARK to instruct GNATprove that we want overflow checks instead of wraparound semantics.

JoseRuizAdaCore · 2023-11-23T14:58:52Z

Hi Claire. What I meant when I say no drawbacks and backward compatible is that compilers can choose to do what they do today (symmetric base type) and they would respect the proposed wording.

The reason why I want it is to simplify the life of embedded developers. Forcing a larger base type has implications in the run-time capabilities needed.

Now your suggestion of No_Wrap_Around on modular types is very interesting. I think it gives me what I want and it is better for compatibility. Thanks!

raph-amiard · 2023-11-23T17:05:41Z

@dkm @jklmnn Never mind then :)

jklmnn · 2023-11-27T08:45:20Z

While I agree that for full range integer types modular types with No_Wrap_Around could be a solution I still think this change would be a good improvement. The forced symmetry is something that I have seen being a problem many times and e.g. (@treiher correct me if I'm wrong) is one of the reasons we currently only support 32 bit types in RecordFlux.

At least to me adding No_Wrap_Around to modular types look a bit like a workaround. We're changing the semantic of one type to make it work like another type because that other type has the correct semantic but a limitation that doesn't allow us to use its full range.

Additionally I think modular types don't fill all the use cases, even with No_Wrap_Around. I still cannot define a 64 bit positive type:

type Positive_64 is range 1 .. 2 ** 64 - 1;

clairedross · 2023-11-27T09:00:59Z

The proposed modification seems small, but I would expect it to be a big change in the compiler and not backward compatible for users if the compiler was to use this opportunity in general. So I think it might be a good idea to get an opinion from compiler/language experts before going in this direction.

yannickmoy · 2023-11-27T09:36:18Z

@jklmnn I think you can define the type you want:

type Natural_64 is mode 2**64 with No_Wrap_Around;
type Positive_64 is Natural_64 range 1 .. 2**64-1;

clairedross · 2023-11-27T09:50:20Z

If you define it as Yannick proposed, you have the advantage of having the 0 in the base type, so for example you will be able to use it to index an array without worrying about invalid empty array aggregates...

treiher · 2023-11-27T12:54:32Z

While I agree that for full range integer types modular types with No_Wrap_Around could be a solution I still think this change would be a good improvement. The forced symmetry is something that I have seen being a problem many times and e.g. (@treiher correct me if I'm wrong) is one of the reasons we currently only support 32 bit types in RecordFlux.

We only support up to 63-bit integer types in RecordFlux at the moment. The reason is that SPARK uses another representation for modular integer types during proofs, even with No_Wrap_Around, which makes the generated code much more complicated to prove.

@jklmnn I think you can define the type you want:

type Natural_64 is mod 2**64 with No_Wrap_Around;
type Positive_64 is Natural_64 range 1 .. 2**64-1;

@yannickmoy Interesting. So we could use this approach to define a 64-bit integer type that is handled like other signed integers in SPARK proofs?

jklmnn · 2023-11-27T13:09:44Z

@jklmnn I think you can define the type you want:

type Natural_64 is mode 2**64 with No_Wrap_Around;
type Positive_64 is Natural_64 range 1 .. 2**64-1;

To be honest I think that is really confusing and I don't think this is a good solution. If you look at it as someone without any prior knowledge of Ada what would you expect? You would think that modular types have a wrap around and are to be used e.g. for opaque integers or calculations where you want to have that behavior. And you have range types that have overflow checks and whose range you can define. This was the expectation I had when I started with Ada and I don't find it unreasonable to have this expectation coming from other languages. With @JoseRuizAdaCore's RFC this would be it.

Now if you use No_Wrap_Around for that you get a different situation. You can still use modular types as is, except if you add this aspect then they're overflow checked. That's fine so far.
Now range types are different. They behave like integers with overflow checks unless you want them to be unsigned and fit into the size you'd expect (say they're not symmetrical). In this special case you can't use a range type. Instead you need a modular type with an aspect and additionally define yet another type from that modular type that contains the range you want to have. I don't see how this is an improvement over the first situation.

If you define it as Yannick proposed, you have the advantage of having the 0 in the base type, so for example you will be able to use it to index an array without worrying about invalid empty array aggregates...

Don't I have this problem already with symmetric integers if I don't include 0? As from my understanding you can't freely define your range (@JoseRuizAdaCore please correct me if I'm wrong), e.g. you can't define

type Pos_32 is range 1 .. 2 ** 32 with Size => 32;

~~but you can define~~

type Pos_32 is range 1 .. 2 ** 32 - 1 with Size => 32;

in which case 0 would still be 0 if mapped to Universal_Integer. Also it's not clear to me what invalid empty array aggregates are. Does the compiler try to use 0 as a default value in this case and fails if 0 is not within the type?

I got a bit confused. One can already define an integer with the size aspect and with an arbitrary range as long as it doesn't contain more values than can fit into the given size. But I still think my point stands and that this RFC is an improvement for two reasons:

It could simply be a compiler optimization
Using the approach with a modular type is still confusing. Even more so unless you know about this specific behavior of the compiler I'd say people will simply define their 64 bit range type without noticing that it's going to use 128 bit arithmetic (unless your platform doesn't support that and it breaks and nobody knows why).

clairedross · 2023-11-27T13:41:57Z

If you define it as Yannick proposed, you have the advantage of having the 0 in the base type, so for example you will be able to use it to index an array without worrying about invalid empty array aggregates... Don't I have this problem already with symmetric integers if I don't include 0? As from my understanding you can't freely define your range ( @JoseRuizAdaCore <https://github.com/JoseRuizAdaCore> please correct me if I'm wrong), e.g. you can't define type Pos_32 is range 1 .. 2 ** 32 with Size => 32; but you can define type Pos_32 is range 1 .. 2 ** 32 - 1 with Size => 32; in which case 0 would still be 0 if mapped to Universal_Integer. Also it's not clear to me what invalid empty array aggregates are. Does the compiler try to use 0 as a default value in this case and fails if 0 is not within the type?

You can test it already (using modular types so that you do not have a symmetric base type):

…

— Reply to this email directly, view it on GitHub <#109 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AB6NCMPD3RXFCKYVDIJCGLLYGSGKJAVCNFSM6AAAAAA7GNZH6WVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQMRXHAYDKMBYGE> . You are receiving this because you commented.Message ID: ***@***.***>

-- Claire Dross (she/her) Software Engineer, AdaCore

clairedross · 2023-11-27T13:42:19Z

On Mon, Nov 27, 2023 at 2:41 PM Claire Dross ***@***.***> wrote: If you define it as Yannick proposed, you have the advantage of having the > 0 in the base type, so for example you will be able to use it to index an > array without worrying about invalid empty array aggregates... > > Don't I have this problem already with symmetric integers if I don't > include 0? As from my understanding you can't freely define your range ( > @JoseRuizAdaCore <https://github.com/JoseRuizAdaCore> please correct me > if I'm wrong), e.g. you can't define > > type Pos_32 is range 1 .. 2 ** 32 with Size => 32; > > but you can define > > type Pos_32 is range 1 .. 2 ** 32 - 1 with Size => 32; > > in which case 0 would still be 0 if mapped to Universal_Integer. Also > it's not clear to me what invalid empty array aggregates are. Does the > compiler try to use 0 as a default value in this case and fails if 0 is not > within the type? > You can test it already (using modular types so that you do not have a symmetric base type):

procedure Main with SPARK_Mode is type Nat_32 is mod 2 ** 32; type My_Array is array (Nat_32 range <>) of Integer; X : My_Array := []; begin null; end Main; raised CONSTRAINT_ERROR : main.adb:6 range check failed -- Claire Dross (she/her) Software Engineer, AdaCore

JoseRuizAdaCore · 2023-12-05T11:54:06Z

To me, the main point behind this RFC is that imposing a symmetric base range seems like an overlook to me. I don't see any advantage.

So the situation we have is that:

either we implement the No_Wrap_Around attribute for modular types. Then we can define:
type Natural_64 is mod 2**64 with No_Wrap_Around, Size => 64;
type US_64 is new Natural_64 range 0 .. 2**64-1 with Size => 64;
or we implement this RFC and we can define:
type US_64 is range 0 .. 2**64-1 with Size => 64;

I think the semantics of both options would be equivalent, but the second is less confusing.

Implementing this RFC changes the semantics of the base type, but we could keep the symmetric base type as the default and the non-symmetric base type controlled by a compiler option.

yannickmoy · 2023-12-05T12:01:57Z

@JoseRuizAdaCore you can already define with GNAT a 64-bits signed type from 0 to 2**64-1, so GNAT accepts:

type US_64 is range 0 .. 2**64-1 with Size => 64;

This is what the GNAT frontend calls an "unsigned" type. But its base type is still the 128-bits signed integer type.

yannickmoy · 2023-12-05T12:07:45Z

So if you want to be able to specify that its base type is only 64-bits, you need a new annotation, like:

type US_64 is range 0 .. 2**64-1 with Size => 64, Base'Size => 64;

Now, what are the consequences for compilation, in particular for bound checking?

ebotcazou · 2023-12-05T12:39:23Z

Note that overflow checking is implemented differently for signed and unsigned types: the former uses the Overflow flag on most processors, whereas the latter uses the Carry flag for addition but is potentially more involved for multiplication.

yannickmoy · 2023-12-05T13:03:44Z

@ebotcazou so it would not be an issue to have a type like US_64 above whose base type has also 64-bits?

JoseRuizAdaCore · 2023-12-05T13:04:11Z

So if you want to be able to specify that its base type is only 64-bits, you need a new annotation, like:
type US_64 is range 0 .. 2**64-1 with Size => 64, Base'Size => 64;
Now, what are the consequences for compilation, in particular for bound checking?

But if you don't change the Ada definition of signed integer types (i.e., if you still force a symmetric base range) the Base'Size => 64 would be rejected.

sttaft · 2023-12-05T13:44:29Z

I am not entirely sure what José means by an overlook but there are pretty good reasons why Ada has always required symmetric signed integer types. It will require significantly more care to use an asymmetric signed integer type properly. For example, simple things like "if X - Y in -5 .. +5" will fail if Y is greater than X. Even "if abs(X - Y) in 0 .. +5" will fail if Y > X, since you can get Constraint_Error if an intermediate in some computation goes outside the base range of the type, and for such an asymmetric type, there would be no negative values in the base range of the type, so any intermediate that is negative could cause a Constraint_Error.

sttaft · 2023-12-05T13:50:58Z

But if you don't change the Ada definition of signed integer types (i.e., if you still force a symmetric base range) the Base'Size => 64 would be rejected.

I would agree with José that specifying a size is perhaps hiding the issue. The fundamental issue is that you are changing the effective base range of the type. As indicated in my prior comment, that has implications for the ease of use of the type, or at least, the ease of use of the subtraction operator of the type. One might almost want to say such types don't have a subtraction operator because it is so likely to result in trouble, and clearly unary minus is useless except on the zero value ;-) So I would say this deserves more than just a subtle Base'Size clause. I would say it deserves an aspect such as No_Negative_Intermediates => True.

JoseRuizAdaCore · 2023-12-05T14:11:30Z

I see Tuck, thanks, very enlightening.

ebotcazou · 2023-12-05T17:51:40Z

@yannickmoy Overflow checking would change, but I think that both GCC and LLVM have primitives to do it whatever the signedness now, so this would be minimal.

raph-amiard · 2023-12-08T10:21:44Z

@JoseRuizAdaCore what's the status in your opinion ? Is this close to converging or does this need a rewrite ?

JoseRuizAdaCore · 2023-12-08T17:39:58Z

@raph-amiard we are not converging. There are big questions about backward incompatibilities.

florianschanda · 2024-09-27T16:27:09Z

I like this proposal, because as you know I think the current No_Wrap_Around extension is extremely confusing.

Allowing the compiler to choose an unsigned base type for existing code is not a good idea, because something people use 'Base and extend the type in the front, and this would break some existing use-cases.

I would propose simple something like:

type T is range 0 .. 255 with Unsigned_Base_Type;

As @sttaft pointed out intermediates underflowing is possibly annoying, but this makes it pretty clear that it's what the user asked for.

For annotations, with -gnato13 you can still go outside the range in a useful way so I don't think it'll too big deal for SPARK. Once this is implemented in Ada and SPARK, the GNAT extension No_Wrap_Around can be removed.

sttaft · 2024-09-27T16:58:06Z

I would propose simple something like:
type T is range 0 .. 255 with Unsigned_Base_Type;

That would be fine with me, though I would prefer "Unsigned_Base_Range" to better match existing Ada terminology ("base type" is not an Ada term). The key thing is that it requires an explicit aspect on the type declaration, similar to the "No_Negative_Intermediates" suggestion above, so it is upward compatible.

florianschanda · 2024-09-27T17:03:39Z

I like Unsigned_Base_Range. No_Negative_Intermediates I think is confusing a bit for two reasons:

Negatives statements are less clear positive statements
No_Negative_Intermediates doesn't imply that the actual base type will be unsigned

jklmnn

Two minor comments, looks good otherwise.

I agree with @florianschanda here, using Unsigned_Base_Range makes it explicit and clear without breaking existing code.

considered/rfc-range-integer-types.rst

rfc-template.md

First draft for removing the need for a symmetric base range for signed integer types.

sttaft · 2024-12-21T00:17:29Z

considered/rfc-range-integer-types.rst

+Prior art
+=========
+
+This is the way unsigned types are defined in other languages, like C.


This might be misleading. In C, unsigned types are generally guaranteed to wrap around, and the notion of an overflow exception doesn't really exist.

Yes, thanks, I just reworded it to make it clearer

JoseRuizAdaCore · 2025-04-02T14:20:14Z

Hi @clairedross. This proposal has been updated, taking into account the discussion we had with @florianschanda.
Can you please review it to check whether it matches your expectations?
Thanks!

clairedross · 2025-04-02T14:55:01Z

It matches what I recall from the call indeed. Thanks for the update.

JoseRuizAdaCore · 2025-04-02T15:09:49Z

Thanks Claire!

raph-amiard requested review from dkm and jklmnn November 23, 2023 09:49

jklmnn requested changes Sep 30, 2024

View reviewed changes

considered/rfc-range-integer-types.rst Outdated Show resolved Hide resolved

rfc-template.md Outdated Show resolved Hide resolved

JoseRuizAdaCore added 2 commits December 19, 2024 17:28

Create RFC for range integer types

6978080

First draft for removing the need for a symmetric base range for signed integer types.

Clarify the proposal with the addition of an aspect

9cf7460

JoseRuizAdaCore force-pushed the patch-1 branch from e6b1fd0 to 9cf7460 Compare December 19, 2024 16:29

sttaft reviewed Dec 21, 2024

View reviewed changes

Clarify relation with C unsigned types

e2d2440

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Range Integer Types #109

[RFC] Range Integer Types #109

JoseRuizAdaCore commented Nov 10, 2023

raph-amiard commented Nov 23, 2023

clairedross commented Nov 23, 2023 •

edited

Loading

JoseRuizAdaCore commented Nov 23, 2023

raph-amiard commented Nov 23, 2023

jklmnn commented Nov 27, 2023 •

edited

Loading

clairedross commented Nov 27, 2023

yannickmoy commented Nov 27, 2023

clairedross commented Nov 27, 2023

treiher commented Nov 27, 2023

jklmnn commented Nov 27, 2023 •

edited

Loading

clairedross commented Nov 27, 2023 via email

clairedross commented Nov 27, 2023 via email

JoseRuizAdaCore commented Dec 5, 2023 •

edited

Loading

yannickmoy commented Dec 5, 2023 •

edited

Loading

yannickmoy commented Dec 5, 2023

ebotcazou commented Dec 5, 2023

yannickmoy commented Dec 5, 2023

JoseRuizAdaCore commented Dec 5, 2023

sttaft commented Dec 5, 2023

sttaft commented Dec 5, 2023

JoseRuizAdaCore commented Dec 5, 2023

ebotcazou commented Dec 5, 2023

raph-amiard commented Dec 8, 2023

JoseRuizAdaCore commented Dec 8, 2023

florianschanda commented Sep 27, 2024 •

edited

Loading

sttaft commented Sep 27, 2024

florianschanda commented Sep 27, 2024 •

edited

Loading

jklmnn left a comment

sttaft Dec 21, 2024

JoseRuizAdaCore Dec 23, 2024

JoseRuizAdaCore commented Apr 2, 2025

clairedross commented Apr 2, 2025

JoseRuizAdaCore commented Apr 2, 2025

[RFC] Range Integer Types #109

Are you sure you want to change the base?

[RFC] Range Integer Types #109

Conversation

JoseRuizAdaCore commented Nov 10, 2023

raph-amiard commented Nov 23, 2023

clairedross commented Nov 23, 2023 • edited Loading

JoseRuizAdaCore commented Nov 23, 2023

raph-amiard commented Nov 23, 2023

jklmnn commented Nov 27, 2023 • edited Loading

clairedross commented Nov 27, 2023

yannickmoy commented Nov 27, 2023

clairedross commented Nov 27, 2023

treiher commented Nov 27, 2023

jklmnn commented Nov 27, 2023 • edited Loading

clairedross commented Nov 27, 2023 via email

clairedross commented Nov 27, 2023 via email

JoseRuizAdaCore commented Dec 5, 2023 • edited Loading

yannickmoy commented Dec 5, 2023 • edited Loading

yannickmoy commented Dec 5, 2023

ebotcazou commented Dec 5, 2023

yannickmoy commented Dec 5, 2023

JoseRuizAdaCore commented Dec 5, 2023

sttaft commented Dec 5, 2023

sttaft commented Dec 5, 2023

JoseRuizAdaCore commented Dec 5, 2023

ebotcazou commented Dec 5, 2023

raph-amiard commented Dec 8, 2023

JoseRuizAdaCore commented Dec 8, 2023

florianschanda commented Sep 27, 2024 • edited Loading

sttaft commented Sep 27, 2024

florianschanda commented Sep 27, 2024 • edited Loading

jklmnn left a comment

Choose a reason for hiding this comment

sttaft Dec 21, 2024

Choose a reason for hiding this comment

JoseRuizAdaCore Dec 23, 2024

Choose a reason for hiding this comment

JoseRuizAdaCore commented Apr 2, 2025

clairedross commented Apr 2, 2025

JoseRuizAdaCore commented Apr 2, 2025

clairedross commented Nov 23, 2023 •

edited

Loading

jklmnn commented Nov 27, 2023 •

edited

Loading

jklmnn commented Nov 27, 2023 •

edited

Loading

JoseRuizAdaCore commented Dec 5, 2023 •

edited

Loading

yannickmoy commented Dec 5, 2023 •

edited

Loading

florianschanda commented Sep 27, 2024 •

edited

Loading

florianschanda commented Sep 27, 2024 •

edited

Loading