Skip to content
Snippets Groups Projects
  1. Sep 24, 2024
    • Robin Dapp's avatar
      RISC-V: Fix effective target check. · e45537f5
      Robin Dapp authored
      The return value is inverted in check_effective_target_rvv_zvl256b_ok
      and check_effective_target_rvv_zvl512b_ok.  Fix this and also just use
      the current march.
      
      gcc/testsuite/ChangeLog:
      
      	* lib/target-supports.exp: Fix effective target check.
      e45537f5
    • Andre Vehreschild's avatar
      Fortran: Allow to nullify caf token when not in ultimate component. [PR101100] · 0c0d79c7
      Andre Vehreschild authored
      gcc/fortran/ChangeLog:
      
      	PR fortran/101100
      
      	* trans-expr.cc (trans_caf_token_assign): Take caf-token from
      	decl for non ultimate coarray components.
      
      gcc/testsuite/ChangeLog:
      
      	* gfortran.dg/coarray/proc_pointer_assign_1.f90: New test.
      0c0d79c7
    • Jason Merrill's avatar
      build: enable C++11 narrowing warnings · 2249c3b4
      Jason Merrill authored
      We've been using -Wno-narrowing since gcc 4.7, but at this point narrowing
      diagnostics seem like a stable part of C++ and we should adjust.
      
      This patch changes -Wno-narrowing to -Wno-error=narrowing so that narrowing
      issues will still not break bootstrap, but we can see them.
      
      The rest of the patch fixes the narrowing warnings I see in an
      x86_64-pc-linux-gnu bootstrap.  In most of the cases, by adjusting the types
      of various declarations so that we store the values in the same types we
      compute them in, which seems worthwhile anyway.  This also allowed us to
      remove a few -Wsign-compare casts.
      
      gcc/ChangeLog:
      
      	* configure.ac (CXX_WARNING_OPTS): Change -Wno-narrowing
      	to -Wno-error=narrowing.
      	* configure: Regenerate.
      	* config/i386/i386.h (debugger_register_map)
      	(debugger64_register_map)
      	(svr4_debugger_register_map): Make unsigned.
      	* config/i386/i386.cc: Likewise.
      	* diagnostic-event-id.h (diagnostic_thread_id_t): Make int.
      	* vec.h (vec::size): Make unsigned int.
      	* ipa-modref.cc (escape_point::arg): Make unsigned.
      	(modref_lattice::add_escape_point): Use eaf_flags_t.
      	(update_escape_summary_1): Use eaf_flags_t, && for bool.
      	* pair-fusion.cc (pair_fusion_bb_info::track_access):
      	Make mem_size unsigned int.
      	* pretty-print.cc (format_phase_2): Cast va_arg to char.
      	* tree-ssa-loop-ch.cc (ch_base::copy_headers): Make nheaders
      	unsigned, remove cast.
      	* tree-ssa-structalias.cc (bitpos_of_field): Return unsigned.
      	(push_fields_onto_fieldstack):Make offset unsigned, remove cast.
      	* tree-vect-slp.cc (vect_prologue_cost_for_slp): Use nelt_limit.
      	* tree-vect-stmts.cc (vect_truncate_gather_scatter_offset):
      	Make scale unsigned.
      	(vectorizable_operation): Make ncopies unsigned.
      	* rtl-ssa/member-fns.inl: Make num_accesses unsigned int.
      2249c3b4
    • Andre Vehreschild's avatar
      Fortran: Assign allocated caf-memory to scalar members [PR84870] · f5035d7d
      Andre Vehreschild authored
      Allocating a coarray required an array-descriptor.  For scalars a
      temporary descriptor was created.  Assigning the allocated memory from
      the temporary descriptor back to the scalar is now added.
      
      gcc/fortran/ChangeLog:
      
      	PR fortran/84870
      
      	* trans-array.cc (duplicate_allocatable_coarray): For scalar
      	allocatable components the memory allocated is now assigned to
      	the component's pointer.
      
      gcc/testsuite/ChangeLog:
      
      	* gfortran.dg/coarray/alloc_comp_10.f90: New test.
      f5035d7d
    • Richard Biener's avatar
      tree-optimization/114855 - more update_ssa speedup · 9a795b3a
      Richard Biener authored
      The following tackles another source of slow bitmap operations,
      namely populating blocks_to_update.  We already have that in
      tree view around PHI insertion but also the initial population is
      slow.  There's unfortunately a conditional inbetween list view
      requirement and the bitmap API doesn't allow opportunistic
      switching but rejects tree -> tree or list -> list transitions.
      So the following patch wraps the early population in a tree view
      section with possibly one redundant tree -> list -> tree view
      transition.
      
      This cuts tree SSA incremental from 228.25s (21%) to 65.05s (7%).
      
      	PR tree-optimization/114855
      	* tree-into-ssa.cc (update_ssa): Use tree view for the
      	initial population of blocks_to_update.
      9a795b3a
    • Aldy Hernandez's avatar
      Alphabetize my entry in MAINTAINER's DCO list. · 34366176
      Aldy Hernandez authored
      ChangeLog:
      
      	* MAINTAINERS: Move my entry in DCO list into alphabetical order.
      34366176
    • Tobias Burnus's avatar
      OpenMP: Add support for 'self_maps' to the 'require' directive · b752eed3
      Tobias Burnus authored
      'self_maps' implies 'unified_shared_memory', except that the latter
      also permits that explicit maps copy data to device memory while
      self_maps does not. In GCC, currently, both are handled identical.
      
      gcc/c/ChangeLog:
      
      	* c-parser.cc (c_parser_omp_requires): Handle self_maps clause.
      
      gcc/cp/ChangeLog:
      
      	* parser.cc (cp_parser_omp_requires): Handle self_maps clause.
      
      gcc/fortran/ChangeLog:
      
      	* gfortran.h (enum gfc_omp_requires_kind): Add OMP_REQ_SELF_MAPS.
      	(gfc_namespace): Enlarge omp_requires bitfield.
      	* module.cc (enum ab_attribute, attr_bits): Add AB_OMP_REQ_SELF_MAPS.
      	(mio_symbol_attribute): Handle it.
      	* openmp.cc (gfc_check_omp_requires, gfc_match_omp_requires): Handle
      	self_maps clause.
      	* parse.cc (gfc_parse_file): Handle self_maps clause.
      
      gcc/ChangeLog:
      
      	* lto-cgraph.cc (output_offload_tables, omp_requires_to_name): Handle
      	self_maps clause.
      	* omp-general.cc (struct omp_ts_info, omp_context_selector_matches):
      	Likewise for the associated trait.
      	* omp-general.h (enum omp_requires): Add OMP_REQUIRES_SELF_MAPS.
      	* omp-selectors.h (enum omp_ts_code): Add
      	OMP_TRAIT_IMPLEMENTATION_SELF_MAPS.
      
      include/ChangeLog:
      
      	* gomp-constants.h (GOMP_REQUIRES_SELF_MAPS): #define.
      
      libgomp/ChangeLog:
      
      	* plugin/plugin-gcn.c (GOMP_OFFLOAD_get_num_devices):
      	Accept self_maps clause.
      	* plugin/plugin-nvptx.c (GOMP_OFFLOAD_get_num_devices):
      	Likewise.
      	* libgomp.texi (TR13 Impl. Status): Set to 'Y'.
      	* target.c (gomp_requires_to_name, GOMP_offload_register_ver,
      	gomp_target_init): Handle self_maps clause.
      	* testsuite/libgomp.fortran/self_maps.f90: New test.
      
      gcc/testsuite/ChangeLog:
      
      	* c-c++-common/gomp/declare-variant-1.c: Add self_maps test.
      	* c-c++-common/gomp/requires-4.c: Likewise.
      	* gfortran.dg/gomp/declare-variant-3.f90:  Likewise.
      	* c-c++-common/gomp/requires-2.c: Update dg-error msg.
      	* gfortran.dg/gomp/requires-2.f90: Likewise.
      	* gfortran.dg/gomp/requires-self-maps-aux.f90: New.
      	* gfortran.dg/gomp/requires-self-maps.f90: New.
      b752eed3
    • Francois-Xavier Coudert's avatar
      Testsuite, darwin: account for macOS 15 · 7e560ffd
      Francois-Xavier Coudert authored
      gcc/testsuite/ChangeLog:
      
      	* gcc.dg/darwin-minversion-link.c: Account for macOS 15.
      7e560ffd
    • Richard Biener's avatar
      tree-optimization/115372 - failed store-lanes in some cases · f594008d
      Richard Biener authored
      The gcc.target/riscv/rvv/autovec/struct/struct_vect-4.c testcase shows
      that we sometimes fail to use store-lanes even though it should be
      profitable.  We're currently relying on vect_slp_prefer_store_lanes_p
      at the point we run into the first SLP discovery mismatch with obviously
      limited information.  For the case at hand we have 3, 5 or 7 lanes
      of VnDImode [2, 2] vectors with the first mismatch at lane 2 so the
      new group size is 1.  The heuristic says that might be an OK split
      given the rest is a multiple of the vector lanes.  Now we continue
      discovery but in the end mismatches result in uniformly single-lane
      SLP instances which we can handle via interleaving but of course are
      prime candidates for store-lanes.  The following patch re-assesses
      with the extra knowledge now just relying on the fact whether the
      target supports store-lanes for the given group size.
      
      	PR tree-optimization/115372
      	* tree-vect-slp.cc (vect_build_slp_instance): Compute the
      	uniform, if, number of lanes of the RHS sub-graphs feeding
      	the store and if uniformly one, use store-lanes if the target
      	supports that.
      f594008d
    • Nathaniel Shead's avatar
      libstdc++: Remove unnecessary 'static' from __is_specialization_of · 618871ff
      Nathaniel Shead authored
      
      This makes the declarations internal linkage, which is an ODR issue, and
      causes a future modules patch to fail regtest as it now detects attempted
      uses of TU-local entities in module CMIs.
      
      libstdc++-v3/ChangeLog:
      
      	* include/std/format: Remove unnecessary 'static'.
      
      Signed-off-by: default avatarNathaniel Shead <nathanieloshead@gmail.com>
      618871ff
    • Richard Biener's avatar
      tree-optimization/114855 - high update_ssa time · f9dfe8de
      Richard Biener authored
      Part of the problem in PR114855 is high update_ssa time.  When one fixes
      the backward jump threading issue tree SSA incremental is at
      439.91s ( 26%), mostly doing bitmap element searches for
      blocks_with_phis_to_rewrite.  The following turns that bitmap to tree
      view noticing the two-dimensional vector of PHIs it guards is excessive
      compared to what we actually save with it - walking all PHI nodes
      in a block, something we already do once to initialize stmt flags.
      So instead of optimizing that walk we use the stmt flag, saving
      allocations and global state that lives throughout the whole
      compilation.
      
      This reduces the tree SSA incremental time to 203.13 ( 14%)
      
      The array was added in r0-74758-g2ce798794df8e1 when we still possibly
      had gazillion virtual operands for PR26830, I checked the testcase
      still behaves OK.
      
      	PR tree-optimization/114855
      	* tree-into-ssa.cc (phis_to_rewrite): Remove global var.
      	(mark_phi_for_rewrite): Simplify.
      	(rewrite_update_phi_arguments): Walk all PHIs, process
      	those satisfying rewrite_uses_p.
      	(delete_update_ssa): Simplify.
      	(update_ssa): Likewise.  Switch blocks_with_phis_to_rewrite
      	to tree view.
      f9dfe8de
    • Yangyu Chen's avatar
      hosthooks.h: Fix GCC_HOST_HOOKS_H typo · 824229e3
      Yangyu Chen authored
      
      The comment of the final endif in hosthooks.h is wrong, it should be
      GCC_HOST_HOOKS_H instead of GCC_LANG_HOOKS_H.
      
      gcc/ChangeLog:
      
      	* hosthooks.h (struct host_hooks): Fix GCC_HOST_HOOKS_H typo.
      
      Signed-off-by: default avatarYangyu Chen <chenyangyu@isrc.iscas.ac.cn>
      824229e3
    • Prathamesh Kulkarni's avatar
      nvptx: Partial support for aliases to aliases. · f5ee372b
      Prathamesh Kulkarni authored
      
      For the following test (adapted from pr96390.c):
      
      __attribute__((noipa)) int foo () { return 42; }
      int bar () __attribute__((alias ("foo")));
      int baz () __attribute__((alias ("bar")));
      
      int main ()
      {
        int n;
        #pragma omp target map(from:n)
          n = baz ();
        return n;
      }
      
      gcc emits following ptx for baz:
      .visible .func (.param.u32 %value_out) bar;
      .alias bar,foo;
      .visible .func (.param.u32 %value_out) baz;
      .alias baz,bar;
      
      which is incorrect since PTX requires aliasee to be a defined function.
      The patch instead uses cgraph_node::get(name)->ultimate_alias_target,
      which generates the following PTX:
      
      .visible .func (.param.u32 %value_out) baz;
      .alias baz,foo;
      
      gcc/ChangeLog:
      	PR target/104957
      	* config/nvptx/nvptx.cc (nvptx_asm_output_def_from_decls): Use
      	cgraph_node::get(name)->ultimate_alias_target instead of value.
      
      gcc/testsuite/ChangeLog:
      	PR target/104957
      	* gcc.target/nvptx/alias-to-alias-1.c: Adjust.
      
      Signed-off-by: default avatarPrathamesh Kulkarni <prathameshk@nvidia.com>
      Co-authored-by: default avatarThomas Schwinge <tschwinge@baylibre.com>
      f5ee372b
    • GCC Administrator's avatar
      Daily bump. · 4d6fa5b7
      GCC Administrator authored
      4d6fa5b7
  2. Sep 23, 2024
    • Gaius Mulley's avatar
      modula2: Add noreturn attribute to m2/gm2-libs/M2RTS.mod · 5ef52ec4
      Gaius Mulley authored
      
      This patch removes a build warning by adding a noreturn attribute
      to the M2RTS.mod:HaltC procedure.  Also add an infinite loop to
      gm2-libs-min/M2RTS.mod.
      
      gcc/m2/ChangeLog:
      
      	* Make-lang.in (m2/gm2-libs-boot/M2RTS.o): Remove
      	--suppress-noreturn.
      	* gm2-libs/M2RTS.mod (HaltC): Add noreturn attribute.
      	* gm2-libs-min/M2RTS.mod (HALT): Add LOOP END.
      
      Signed-off-by: default avatarGaius Mulley <gaiusmod2@gmail.com>
      5ef52ec4
    • Andrew Kreimer's avatar
      libobjc: Fix typos · 0121b852
      Andrew Kreimer authored
      
      Fix typos in comments.
      
      libobjc/ChangeLog:
      
      	* Makefile.in: s/overrridden/overridden.
      	* encoding.c (_darwin_rs6000_special_round_type_align): Fix typo
      	in comment.
      	(rs6000_special_round_type_align): Likewise.
      	* exception.c (is_kind_of_exception_matcher): Likewise.
      	(PERSONALITY_FUNCTION): Likewise.
      	* hash.c (objc_hash_next): Likewise.
      	* init.c (__objc_create_classes_tree): Likewise.
      	* objc-private/objc-list.h (list_remove_head): Likewise.
      	* sendmsg.c (__objc_install_dtable_for_class): Likewise.
      	* thr.c (objc_thread_yield): Likewise.
      
      Signed-off-by: default avatarAndrew Kreimer <algonell@gmail.com>
      0121b852
    • Marek Polacek's avatar
      c++: diagnose this specifier in requires expr [PR116798] · 4700ad1c
      Marek Polacek authored
      
      We don't detect an explicit object parameter in a requires expression.
      We can get there by way of requires-expression -> requirement-parameter-list
      -> parameter-declaration-clause -> ... -> parameter-declaration with
      this[opt].  But [dcl.fct]/5 doesn't allow an explicit object parameter
      in this context.  So let's fix it like r14-9033 and not like r14-8832.
      
      	PR c++/116798
      
      gcc/cp/ChangeLog:
      
      	* parser.cc (cp_parser_parameter_declaration): Detect an explicit
      	object parameter in a requires expression.
      
      gcc/testsuite/ChangeLog:
      
      	* g++.dg/cpp23/explicit-obj-diagnostics12.C: New test.
      
      Reviewed-by: default avatarJason Merrill <jason@redhat.com>
      4700ad1c
    • Saurabh Jha's avatar
      aarch64: Add codegen support for AdvSIMD faminmax · c1fb78fb
      Saurabh Jha authored
      The AArch64 FEAT_FAMINMAX extension is optional from Armv9.2-a and
      mandatory from Armv9.5-a. It introduces instructions for computing the
      floating point absolute maximum and minimum of the two vectors
      element-wise.
      
      This patch adds code generation support for famax and famin in terms of
      existing RTL operators.
      
      famax/famin is equivalent to first taking abs of the operands and then
      taking smax/smin on the results of abs.
      
      	famax/famin (a, b) = smax/smin (abs (a), abs (b))
      
      This fusion of operators is only possible when -march=armv9-a+faminmax
      flags are passed. We also need to pass -ffast-math flag; if we don't,
      then a statement like
      
      	c[i] = __builtin_fmaxf16 (a[i], b[i]);
      
      is RTL expanded to UNSPEC_FMAXNM instead of smax (likewise for smin).
      
      This code generation is only available on -O2 or -O3 as that is when
      auto-vectorization is enabled.
      
      gcc/ChangeLog:
      
      	* config/aarch64/aarch64-simd.md
      	(*aarch64_faminmax_fused): Instruction pattern for faminmax
      	codegen.
      	* config/aarch64/iterators.md: Attribute for faminmax codegen.
      
      gcc/testsuite/ChangeLog:
      
      	* gcc.target/aarch64/simd/faminmax-codegen-no-flag.c: New test.
      	* gcc.target/aarch64/simd/faminmax-codegen.c: New test.
      	* gcc.target/aarch64/simd/faminmax-no-codegen.c: New test.
      c1fb78fb
    • Saurabh Jha's avatar
      aarch64: Add AdvSIMD faminmax intrinsics · bfefed6c
      Saurabh Jha authored
      The AArch64 FEAT_FAMINMAX extension is optional from Armv9.2-a and
      mandatory from Armv9.5-a. It introduces instructions for computing the
      floating point absolute maximum and minimum of the two vectors element-wise.
      
      This patch introduces AdvSIMD faminmax intrinsics. The intrinsics of
      this extension are implemented as the following builtin functions:
      * vamax_f16
      * vamaxq_f16
      * vamax_f32
      * vamaxq_f32
      * vamaxq_f64
      * vamin_f16
      * vaminq_f16
      * vamin_f32
      * vaminq_f32
      * vaminq_f64
      
      We are defining a new way to add AArch64 AdvSIMD intrinsics by listing
      all the intrinsics in a .def file and then using that .def file to
      initialise various data structures. This would lead to more concise code
      and easier addition of the new AdvSIMD intrinsics in future.
      
      The faminmax intrinsics are defined using the new approach.
      
      gcc/ChangeLog:
      
      	* config/aarch64/aarch64-builtins.cc
      	(ENTRY): Macro to parse the contents of
      	aarch64-simd-pragma-builtins.def.
      	(ENTRY_VHSDF): Macro to parse the contents of
      	aarch64-simd-pragma-builtins.def.
      	(enum aarch64_builtins): New enum values for faminmax builtins
      	via aarch64-simd-pragma-builtins.def.
      	(enum class aarch64_builtin_signatures): Enum class to specify
      	the number of operands a builtin will take.
      	(struct aarch64_pragma_builtins_data): Struct to hold data from
      	aarch64-simd-pragma-builtins.def.
      	(aarch64_fntype): New function to define function types of
      	intrinsics given an object of type aarch64_pragma_builtins_data.
      	(aarch64_init_pragma_builtins): New function to define pragma
      	builtins.
      	(aarch64_get_pragma_builtin): New function to get a row of
      	aarch64_pragma_builtins, given code.
      	(handle_arm_neon_h): Modify to call
      	aarch64_init_pragma_builtins.
      	(aarch64_general_check_builtin_call): Modify to check whether
      	required flag is being used for pragma builtins.
      	(aarch64_expand_pragma_builtin): New function to emit
      	instructions of pragma_builtin.
      	(aarch64_general_expand_builtin): Modify to call
      	aarch64_expand_pragma_builtin.
      	* config/aarch64/aarch64-option-extensions.def
      	(AARCH64_OPT_EXTENSION): Introduce new flag for this extension.
      	* config/aarch64/aarch64-simd.md
      	(@aarch64_<faminmax_uns_op><mode>): Instruction pattern for
      	faminmax intrinsics.
      	* config/aarch64/aarch64.h
      	(TARGET_FAMINMAX): Introduce new flag for this extension.
      	* config/aarch64/iterators.md: New iterators and unspecs.
      	* doc/invoke.texi: Document extension in AArch64 Options.
      	* config/aarch64/aarch64-simd-pragma-builtins.def: New file to
      	list pragma builtins.
      
      gcc/testsuite/ChangeLog:
      
      	* gcc.target/aarch64/simd/faminmax-builtins-no-flag.c: New test.
      	* gcc.target/aarch64/simd/faminmax-builtins.c: New test.
      bfefed6c
    • Saurabh Jha's avatar
      [MAINTAINERS] Fix myself in order and add username · 6141d0c9
      Saurabh Jha authored
      ChangeLog:
      
      	* MAINTAINERS: Fix sort order and add username.
      6141d0c9
    • Jason Merrill's avatar
      libstdc++: guard c++config pragmas · 358db2e3
      Jason Merrill authored
      c++config needs to be compilable as C, in which mode we complain about the
      -Wc++ pragmas.
      
      libstdc++-v3/ChangeLog:
      
      	* include/bits/c++config: Don't try to disable -Wc++??-extensions
      	when compiling as C.
      358db2e3
    • Jason Merrill's avatar
      libstdc++: operator new/delete are transaction_safe · 2620e372
      Jason Merrill authored
      With the changes to #pragma system_header, g++.dg/tm/pr46270.C was
      failing because <new> didn't implement the N4514 change to [new.delete] that
      says "The library versions of the global allocation and deallocation
      functions are declared transaction_safe (8.3.5 dcl.fct)."  We already have
      the _GLIBCXX_TXN_SAFE macro, just need to add it.
      
      libstdc++-v3/ChangeLog:
      
      	* libsupc++/new: Add _GLIBCXX_TXN_SAFE.
      2620e372
    • Matthieu Longo's avatar
      dwarf2: store the RA state in CFI row · 2b797144
      Matthieu Longo authored
      On AArch64, the RA state informs the unwinder whether the return address
      is mangled and how, or not. This information is encoded in a boolean in
      the CFI row. This binary approach prevents from expressing more complex
      configuration, as it is the case with PAuth_LR introduced in Armv9.5-A.
      
      This patch addresses this limitation by replacing the boolean by an enum.
      
      gcc/ChangeLog:
      
      	* dwarf2cfi.cc
      	(struct dw_cfi_row): Declare a new enum type to replace ra_mangled.
      	(cfi_row_equal_p): Use ra_state instead of ra_mangled.
      	(dwarf2out_frame_debug_cfa_negate_ra_state): Same.
      	(change_cfi_row): Same.
      2b797144
    • Matthieu Longo's avatar
      aarch64 testsuite: explain expectections for pr94515* tests · fb475d3f
      Matthieu Longo authored
      gcc/testsuite/ChangeLog:
      
      	* g++.target/aarch64/pr94515-1.C: Improve test documentation.
      	* g++.target/aarch64/pr94515-2.C: Same.
      fb475d3f
    • Matthieu Longo's avatar
      dwarf2: add hooks for architecture-specific CFIs · 9e1c71ba
      Matthieu Longo authored
      Architecture-specific CFI directives are currently declared an processed
      among others architecture-independent CFI directives in gcc/dwarf2* files.
      This approach creates confusion, specifically in the case of DWARF
      instructions in the vendor space and using the same instruction code.
      
      Such a clash currently happen between DW_CFA_GNU_window_save (used on
      SPARC) and DW_CFA_AARCH64_negate_ra_state (used on AArch64), and both
      having the same instruction code 0x2d.
      Then AArch64 compilers generates a SPARC CFI directive (.cfi_window_save)
      instead of .cfi_negate_ra_state, contrarilly to what is expected in
      [DWARF for the Arm 64-bit Architecture (AArch64)](https://github.com/
      ARM-software/abi-aa/blob/main/aadwarf64/aadwarf64.rst).
      
      This refactoring does not solve completely the problem, but improve the
      situation by moving some of the processing of those directives (more
      specifically their output in the assembly) to the backend via 2 target
      hooks:
      - DW_CFI_OPRND1_DESC: parse the first operand of the directive (if any).
      - OUTPUT_CFI_DIRECTIVE: output the CFI directive as a string.
      
      Additionally, this patch also contains a renaming of an enum used for
      return address mangling on AArch64.
      
      gcc/ChangeLog:
      
      	* config/aarch64/aarch64.cc
      	(aarch64_output_cfi_directive): New hook for CFI directives.
      	(aarch64_dw_cfi_oprnd1_desc): Same.
      	(TARGET_OUTPUT_CFI_DIRECTIVE): Hook for output_cfi_directive.
      	(TARGET_DW_CFI_OPRND1_DESC): Hook for dw_cfi_oprnd1_desc.
      	* config/sparc/sparc.cc
      	(sparc_output_cfi_directive): New hook for CFI directives.
      	(sparc_dw_cfi_oprnd1_desc): Same.
      	(TARGET_OUTPUT_CFI_DIRECTIVE): Hook for output_cfi_directive.
      	(TARGET_DW_CFI_OPRND1_DESC): Hook for dw_cfi_oprnd1_desc.
      	* coretypes.h
      	(struct dw_cfi_node): Forward declaration of CFI type from
      	gcc/dwarf2out.h.
      	(enum dw_cfi_oprnd_type): Same.
      	(enum dwarf_call_frame_info): Same.
      	* doc/tm.texi: Regenerated from doc/tm.texi.in.
      	* doc/tm.texi.in: Add doc for new target hooks.
      	type of enum to allow forward declaration.
      	* dwarf2cfi.cc
      	(struct dw_cfi_row): Update the description for window_save
      	and ra_mangled.
      	(dwarf2out_frame_debug_cfa_negate_ra_state): Use AArch64 CFI
      	directive instead of the SPARC one.
      	(change_cfi_row): Use the right CFI directive's name for RA
      	mangling.
      	(output_cfi): Remove explicit architecture-specific CFI
      	directive DW_CFA_GNU_window_save that falls into default case.
      	(output_cfi_directive): Use target hook as default.
      	* dwarf2out.cc (dw_cfi_oprnd1_desc): Use target hook as default.
      	* dwarf2out.h (enum dw_cfi_oprnd_type): specify underlying type
      	of enum to allow forward declaration.
      	(dw_cfi_oprnd1_desc): Call target hook.
      	(output_cfi_directive): Use dw_cfi_ref instead of struct
      	dw_cfi_node *.
      	* hooks.cc
      	(hook_bool_dwcfi_dwcfioprndtyperef_false): New.
      	(hook_bool_FILEptr_dwcfiptr_false): New.
      	* hooks.h
      	(hook_bool_dwcfi_dwcfioprndtyperef_false): New.
      	(hook_bool_FILEptr_dwcfiptr_false): New.
      	* target.def: Documentation for new hooks.
      
      include/ChangeLog:
      
      	* dwarf2.h (enum dwarf_call_frame_info): specify underlying
      
      libffi/ChangeLog:
      
      	* include/ffi_cfi.h (cfi_negate_ra_state): Declare AArch64 cfi
      	directive.
      
      libgcc/ChangeLog:
      
      	* config/aarch64/aarch64-asm.h (PACIASP): Replace SPARC CFI
      	directive by AArch64 one.
      	(AUTIASP): Same.
      
      libitm/ChangeLog:
      
      	* config/aarch64/sjlj.S: Replace SPARC CFI directive by
      	AArch64 one.
      
      gcc/testsuite/ChangeLog:
      
      	* g++.target/aarch64/pr94515-1.C: Replace SPARC CFI directive by
      	AArch64 one.
      	* g++.target/aarch64/pr94515-2.C: Same.
      9e1c71ba
    • Matthieu Longo's avatar
      Rename REG_CFA_TOGGLE_RA_MANGLE to REG_CFA_NEGATE_RA_STATE · 4068096f
      Matthieu Longo authored
      The current name REG_CFA_TOGGLE_RA_MANGLE is not representative of what
      it really is, i.e. a register to represent several states, not only a
      binary one. Same for dwarf2out_frame_debug_cfa_toggle_ra_mangle.
      
      gcc/ChangeLog:
      
      	* combine-stack-adj.cc
      	(no_unhandled_cfa): Rename.
      	* config/aarch64/aarch64.cc
      	(aarch64_expand_prologue): Rename.
      	(aarch64_expand_epilogue): Rename.
      	* dwarf2cfi.cc
      	(dwarf2out_frame_debug_cfa_toggle_ra_mangle): Rename this...
      	(dwarf2out_frame_debug_cfa_negate_ra_state): To this.
      	(dwarf2out_frame_debug): Rename.
      	* reg-notes.def (REG_CFA_NOTE): Rename REG_CFA_TOGGLE_RA_MANGLE.
      4068096f
    • Matthieu Longo's avatar
      libgcc: hide CIE and FDE data for DWARF architecture extensions behind a handler. · bdf41d62
      Matthieu Longo authored
      This patch provides a new handler MD_ARCH_FRAME_STATE_T to hide an
      architecture-specific structure containing CIE and FDE data related
      to DWARF architecture extensions.
      
      Hiding the architecture-specific attributes behind a handler has the
      following benefits:
      1. isolating those data from the generic ones in _Unwind_FrameState
      2. avoiding casts to custom types.
      3. preserving typing information when debugging with GDB, and so
         facilitating their printing.
      
      This approach required to add a new header md-unwind-def.h included at
      the top of libgcc/unwind-dw2.h, and redirecting to the corresponding
      architecture header via a symbolic link.
      
      An obvious drawback is the increase in complexity with macros, and
      headers. It also caused a split of architecture definitions between
      md-unwind-def.h (types definitions used in unwind-dw2.h) and
      md-unwind.h (local types definitions and handlers implementations).
      The naming of md-unwind.h with .h extension is a bit misleading as
      the file is only included in the middle of unwind-dw2.c. Changing
      this naming would require modification of others backends, which I
      prefered to abstain from. Overall the benefits are worth the added
      complexity from my perspective.
      
      libgcc/ChangeLog:
      
      	* Makefile.in: New target for symbolic link to md-unwind-def.h
      	* config.host: New parameter md_unwind_def_header. Set it to
      	aarch64/aarch64-unwind-def.h for AArch64 targets, or no-unwind.h
      	by default.
      	* config/aarch64/aarch64-unwind.h
      	(aarch64_pointer_auth_key): Move to aarch64-unwind-def.h
      	(aarch64_cie_aug_handler): Update.
      	(aarch64_arch_extension_frame_init): Update.
      	(aarch64_demangle_return_addr): Update.
      	* configure.ac: New substitute variable md_unwind_def_header.
      	* unwind-dw2.h (defined): MD_ARCH_FRAME_STATE_T.
      	* config/aarch64/aarch64-unwind-def.h: New file.
      	* configure: Regenerate.
      	* config/no-unwind.h: Updated comment
      bdf41d62
    • Matthieu Longo's avatar
      aarch64: skip copy of RA state register into target context · ba3e5976
      Matthieu Longo authored
      The RA state register is local to a frame, so it should not be copied to
      the target frame during the context installation.
      
      This patch adds a new backend handler that check whether a register
      needs to be skipped or not before its installation.
      
      libgcc/ChangeLog:
      
      	* config/aarch64/aarch64-unwind.h
      	(MD_FRAME_LOCAL_REGISTER_P): new handler checking whether a register
      	from the current context needs to be skipped before installation into
      	the target context.
      	(aarch64_frame_local_register): Likewise.
      	* unwind-dw2.c (uw_install_context_1): use MD_FRAME_LOCAL_REGISTER_P.
      ba3e5976
    • Matthieu Longo's avatar
      aarch64: store signing key and signing method in DWARF _Unwind_FrameState · f5316739
      Matthieu Longo authored
      This patch is only a refactoring of the existing implementation
      of PAuth and returned-address signing. The existing behavior is
      preserved.
      
      _Unwind_FrameState already contains several CIE and FDE information
      (see the attributes below the comment "The information we care
      about from the CIE/FDE" in libgcc/unwind-dw2.h).
      The patch aims at moving the information from DWARF CIE (signing
      key stored in the augmentation string) and FDE (the used signing
      method) into _Unwind_FrameState along the already-stored CIE and
      FDE information.
      Note: those information have to be saved in frame_state_reg_info
      instead of _Unwind_FrameState as they need to be savable by
      DW_CFA_remember_state and restorable by DW_CFA_restore_state, that
      both rely on the attribute "prev".
      
      Those new information in _Unwind_FrameState simplifies the look-up
      of the signing key when the return address is demangled. It also
      allows future signing methods to be easily added.
      
      _Unwind_FrameState is not a part of the public API of libunwind,
      so the change is backward compatible.
      
      A new architecture-specific handler MD_ARCH_EXTENSION_FRAME_INIT
      allows to reset values (if needed) in the frame state and unwind
      context before changing the frame state to the caller context.
      
      A new architecture-specific handler MD_ARCH_EXTENSION_CIE_AUG_HANDLER
      isolates the architecture-specific augmentation strings in AArch64
      backend, and allows others architectures to reuse augmentation
      strings that would have clashed with AArch64 DWARF extensions.
      
      aarch64_demangle_return_addr, DW_CFA_AARCH64_negate_ra_state and
      DW_CFA_val_expression cases in libgcc/unwind-dw2-execute_cfa.h
      were documented to clarify where the value of the RA state register
      is stored (FS and CONTEXT respectively).
      
      libgcc/ChangeLog:
      
      	* config/aarch64/aarch64-unwind.h
      	(AARCH64_DWARF_RA_STATE_MASK): The mask for RA state register.
      	(aarch64_ra_signing_method_t): The diversifiers used to sign a
      	function's return address.
      	(aarch64_pointer_auth_key): The key used to sign a function's
      	return address.
      	(aarch64_cie_signed_with_b_key): Deleted as the signing key is
      	available now in _Unwind_FrameState.
      	(MD_ARCH_EXTENSION_CIE_AUG_HANDLER): New CIE augmentation string
      	handler for architecture extensions.
      	(MD_ARCH_EXTENSION_FRAME_INIT): New architecture-extension
      	initialization routine for DWARF frame state and context before
      	execution of DWARF instructions.
      	(aarch64_context_ra_state_get): Read RA state register from CONTEXT.
      	(aarch64_ra_state_get): Read RA state register from FS.
      	(aarch64_ra_state_set): Write RA state register into FS.
      	(aarch64_ra_state_toggle): Toggle RA state register in FS.
      	(aarch64_cie_aug_handler): Handler AArch64 augmentation strings.
      	(aarch64_arch_extension_frame_init): Initialize defaults for the
      	signing key (PAUTH_KEY_A), and RA state register (RA_no_signing).
      	(aarch64_demangle_return_addr): Rely on the frame registers and
      	the signing_key attribute in _Unwind_FrameState.
      	* unwind-dw2-execute_cfa.h:
      	Use the right alias DW_CFA_AARCH64_negate_ra_state for __aarch64__
      	instead of DW_CFA_GNU_window_save.
      	(DW_CFA_AARCH64_negate_ra_state): Save the signing method in RA
      	state register. Toggle RA state register without resetting 'how'
      	to REG_UNSAVED.
      	* unwind-dw2.c:
      	(extract_cie_info): Save the signing key in the current
      	_Unwind_FrameState while parsing the augmentation data.
      	(uw_frame_state_for): Reset some attributes related to architecture
      	extensions in _Unwind_FrameState.
      	(uw_update_context): Move authentication code to AArch64 unwinding.
      	* unwind-dw2.h (enum register_rule): Give a name to the existing
      	enum for the register rules, and replace 'unsigned char' by 'enum
      	register_rule' to facilitate debugging in GDB.
      	(_Unwind_FrameState): Add a new architecture-extension attribute
      	to store the signing key.
      f5316739
    • Tobias Burnus's avatar
      OpenMP: Fix omp_get_device_from_uid, minor cleanup · cdb9aa0f
      Tobias Burnus authored
      In Fortran, omp_get_device_from_uid can also accept substrings, which are
      then not NUL terminated.  Fixed by introducing a fortran.c wrapper function.
      Additionally, in case of a fail the plugin functions now return NULL instead
      of failing fatally such that a fall-back UID is generated.
      
      gcc/ChangeLog:
      
      	* omp-general.cc (omp_runtime_api_procname): Strip "omp_" from
      	string; move get_device_from_uid as now a '_' suffix exists.
      
      libgomp/ChangeLog:
      
      	* fortran.c (omp_get_device_from_uid_): New function.
      	* libgomp.map (GOMP_6.0): Add it.
      	* oacc-host.c (host_dispatch): Init '.uid' and '.get_uid_func'.
      	* omp_lib.f90.in: Make it used by removing bind(C).
      	* omp_lib.h.in: Likewise.
      	* target.c (omp_get_device_from_uid): Ensure the device is initialized.
      	* plugin/plugin-gcn.c (GOMP_OFFLOAD_get_uid): Add function comment;
      	return NULL in case of an error.
      	* plugin/plugin-nvptx.c (GOMP_OFFLOAD_get_uid): Likewise.
      	* testsuite/libgomp.fortran/device_uid.f90: Update to test substrings.
      cdb9aa0f
    • Claudiu Zissulescu's avatar
      arc: Remove mlra option [PR113954] · ffd861c8
      Claudiu Zissulescu authored
      
      The target dependent mlra option was designed to be able to quickly
      switch between LRA and reload.  The reload register allocator step is
      scheduled for retirement, thus, remove the functionality of mlra,
      keeping it for backward compatibility.
      
      	PR target/113954
      
      gcc/ChangeLog:
      
      	* config/arc/arc.cc (TARGET_LRA_P): Always return true.
      	(arc_lra_p): Remove.
      	* config/arc/arc.h (TARGET_LRA): Remove.
      	* config/arc/arc.opt (mlra): Change it to do nothing.
      	* doc/invoke.texi (mlra): Update option description.
      
      Signed-off-by: default avatarClaudiu Zissulescu <claziss@gmail.com>
      ffd861c8
    • Simon Martin's avatar
      c++: Don't crash when mangling member with anonymous union or template type [PR100632, PR109790] · a030fcad
      Simon Martin authored
      We currently crash upon mangling members that have an anonymous union or
      a template operator type.
      
      The problem is that before calling write_unqualified_name,
      write_member_name asserts that it has a declaration whose DECL_NAME is
      an identifier node that is not that of an operator. This is wrong:
       - In PR100632, it's an anonymous union declaration, hence a 0 DECL_NAME
       - In PR109790, it's a legitimate template declaration for an operator
         (this was accepted up to GCC 10)
      
      This assert was added via r11-6301, to be sure that we do write the "on"
      marker for operator members.
      
      This patch removes that assert and instead
       - Lets members with an anonymous union type go through
       - For operators, adds the missing "on" marker for ABI versions greater
         than the highest usable with GCC 10
      
      	PR c++/109790
      	PR c++/100632
      
      gcc/cp/ChangeLog:
      
      	* mangle.cc (write_member_name): Handle members whose type is an
      	anonymous union member. Write missing "on" marker for operators
      	when ABI version is at least 16.
      
      gcc/testsuite/ChangeLog:
      
      	* g++.dg/cpp0x/decltype83.C: New test.
      	* g++.dg/cpp0x/decltype83a.C: New test.
      	* g++.dg/cpp1y/lambda-ice3.C: New test.
      	* g++.dg/cpp1y/lambda-ice3a.C: New test.
      	* g++.dg/cpp2a/nontype-class67.C: New test.
      a030fcad
    • Simon Martin's avatar
      c++: Don't ICE due to artificial constructor parameters [PR116722] · d7bf5e53
      Simon Martin authored
      The following code triggers an ICE
      
      === cut here ===
      class base {};
      class derived : virtual public base {
      public:
        template<typename Arg> constexpr derived(Arg) {}
      };
      int main() {
        derived obj(1.);
      }
      === cut here ===
      
      The problem is that cxx_bind_parameters_in_call ends up attempting to
      convert a REAL_CST (the first non artificial parameter) to INTEGER_TYPE
      (the type of the __in_chrg parameter), which ICEs.
      
      This patch changes cxx_bind_parameters_in_call to return early if it's
      called with a *structor that has an __in_chrg or __vtt_parm parameter
      since the expression won't be a constant expression.
      
      Note that in the test case, the constructor is not constexpr-suitable,
      however it's OK since it's a template according to my read of paragraph
      (3) of [dcl.constexpr].
      
      	PR c++/116722
      
      gcc/cp/ChangeLog:
      
      	* constexpr.cc (cxx_bind_parameters_in_call): Leave early for
      	{con,de}structors of classes with virtual bases.
      
      gcc/testsuite/ChangeLog:
      
      	* g++.dg/cpp0x/constexpr-ctor22.C: New test.
      d7bf5e53
    • Saurabh Jha's avatar
      Add myself to write after approval · 346f767f
      Saurabh Jha authored
      ChangeLog:
      
      	* MAINTAINERS: Add myself to write after approval.
      346f767f
    • Richard Biener's avatar
      tree-optimization/116810 - out-of-bound access to matches[] · 2c04f175
      Richard Biener authored
      The following makes sure to apply forced splitting of groups for
      firced single-lane SLP only when the group being analyzed has more
      than one lane.  This avoids an out-of-bound access to matches[].
      
      	PR tree-optimization/116810
      	* tree-vect-slp.cc (vect_build_slp_instance): Onlu force
      	splitting for group_size > 1.
      2c04f175
    • Richard Biener's avatar
      tree-optimization/116796 - virtual LC SSA broken after unrolling · e97c75d6
      Richard Biener authored
      When the unroller unloops loops it tracks whether it changes any
      nesting relationship of remaining loops but when scanning a loops
      preheader it fails to pass down the LC-SSA-invalidated bitmap, losing
      the fact that an unrolled formerly inner loop can now be placed on
      an exit of its outer loop.  The following fixes that.
      
      	PR tree-optimization/116796
      	* cfgloopmanip.cc (fix_loop_placements): Get LC-SSA-invalidated
      	bitmap and pass it on.
      	(remove_path): Pass LC-SSA-invalidated to fix_loop_placements.
      e97c75d6
    • Tamar Christina's avatar
      middle-end: Insert invariant instructions before the gsi [PR116812] · 09892448
      Tamar Christina authored
      The new invariant statements should be inserted before the current
      statement and not after.  This goes fine 99% of the time but when the
      current statement is a gcond the control flow gets corrupted.
      
      gcc/ChangeLog:
      
      	PR tree-optimization/116812
      	* tree-vect-slp.cc (vect_slp_region): Fix insertion.
      
      gcc/testsuite/ChangeLog:
      
      	PR tree-optimization/116812
      	* gcc.dg/vect/pr116812.c: New test.
      09892448
    • Richard Biener's avatar
      tree-optimization/116791 - Elementwise SLP vectorization · 723f7b6d
      Richard Biener authored
      The following restricts the elementwise SLP vectorization to the
      single-lane case which is the reason I enabled it to avoid regressions
      with non-SLP.  The PR shows that multi-line SLP loads with elementwise
      accesses require work, I'll open a new bug to track this for the
      future.
      
      	PR tree-optimization/116791
      	* tree-vect-stmts.cc (get_group_load_store_type): Only
      	fall back to elementwise access for single-lane SLP, restore
      	hard failure mode for other cases.
      
      	* gcc.dg/vect/pr116791.c: New testcase.
      723f7b6d
    • Tobias Burnus's avatar
      gcn/mkoffload.cc: Re-add fprintf for #include of stdlib.h/stdbool.h · dfb75079
      Tobias Burnus authored
      In commit r15-3629-g508ef585243d4674d06b0737bfe8769fc18f824f, #embed
      was added and no longer required fprintf '#include' removed, missing
      somehow that with -mstack-size=, the generated configure_stack_size
      will use 'setenv' and 'true'.
      
      gcc/ChangeLog:
      
      	* config/gcn/mkoffload.cc (process_asm): (Re)add the fprintf
      	lines for stdlib.h/stdbool.h inclusion if gcn_stack_size is used.
      dfb75079
    • Pan Li's avatar
      Genmatch: Fix ICE for binary phi cfg mismatching [PR116795] · 999363c5
      Pan Li authored
      
      This patch would like to fix one ICE when try to match the binary
      phi for below cfg.  We check the first edge of the Phi block comes
      from b0, instead of check the only one edge of b1 comes from the
      b0 too.  Thus, it will result in some code to be recog as .SAT_SUB
      but it is not, and finally result the verify_ssa failure.
      
      +------+
      | b0:  |
      | def  |       +-----+
      | ...  |       | b1: |
      | cond |------>| def |
      +------+       | ... |
         |           +-----+
         |              |
         |              |
         v              |
      +-----+           |
      | b2: |           |
      | Phi |<----------+
      +-----+
      
      The below test suites are passed for this patch.
      * The rv64gcv fully regression test.
      * The x86 bootstrap test.
      * The x86 fully regression test.
      
      	PR target/116795
      
      gcc/ChangeLog:
      
      	* gimple-match-head.cc (match_cond_with_binary_phi): Fix the
      	incorrect cfg check as b0->b1 in above example.
      
      gcc/testsuite/ChangeLog:
      
      	* gcc.dg/torture/pr116795-1.c: New test.
      
      Signed-off-by: default avatarPan Li <pan2.li@intel.com>
      999363c5
Loading